mistral.rs

Active

Description

Fast, flexible LLM inference engine built in Rust — supports multiple model architectures and quantization schemes for high-performance local LLM deployment.

Related Projects

ToolOrchestra

706 · Python

Active

An end-to-end RL training framework by NVIDIA for orchestrating tools and agentic workflows. Optimizes multi-step agent decision-making and tool-use policies.

agentframeworktools +2

KTransformers

17.0k · Python

Active

A flexible framework for experiencing heterogeneous LLM inference and fine-tuning optimizations — run large language models efficiently on consumer hardware with kernel-level optimizations.

pythonllmtools +1

DeepReasoning

5.4k · Rust

Stale

A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's chain-of-thought reasoning traces with Anthropic Claude models.

rustllmapi +1

Vision Agents

7.7k · Python

Active

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider, using Stream's edge network for ultra-low latency realtime interactions.

voiceagentpython +3

mistral.rs

Description

Tags

Categories

Related Projects

ToolOrchestra

KTransformers

DeepReasoning

Vision Agents