KTransformers

Active

GitHub Python Apache-2.0

Description

A flexible framework for experiencing heterogeneous LLM inference and fine-tuning optimizations — run large language models efficiently on consumer hardware with kernel-level optimizations.

Related Projects

ToolOrchestra

706 · Python

Active

An end-to-end RL training framework by NVIDIA for orchestrating tools and agentic workflows. Optimizes multi-step agent decision-making and tool-use policies.

agentframeworktools +2

mistral.rs

7.0k · Rust

Active

Fast, flexible LLM inference engine built in Rust — supports multiple model architectures and quantization schemes for high-performance local LLM deployment.

rustllmtools +1

Vision Agents

7.7k · Python

Active

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider, using Stream's edge network for ultra-low latency realtime interactions.

voiceagentpython +3

MiroThinker

8.1k · Python

Active

A deep research agent framework optimized for complex research and prediction tasks, with MiroThinker-1.7 and MiroThinker-H1 models achieving 74.0 and 88.2 on BrowseComp benchmark, supporting multi-step reasoning and information retrieval.

pythonagentllm +3

KTransformers

Description

Tags

Categories

Related Projects

ToolOrchestra

mistral.rs

Vision Agents

MiroThinker