Agent Lightning
活跃简介
Agent Lightning 是微软开源的 AI Agent 训练框架,通过强化学习提升 Agent 能力。
Agent Lightning 是微软开源的 AI Agent 训练框架,通过强化学习提升 Agent 能力。
ART(Agent Reinforcement Trainer)是一个使用 GRPO 算法训练多步 Agent 的强化学习框架,支持为 Qwen、Llama 等模型进行在职训练以完成真实世界任务。
Open-source deep research agent from Alibaba Tongyi Lab, using multi-stage iterative information retrieval and reasoning to conduct deep analysis, synthesis, and summarization of complex topics with web search and document analysis.
Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider, using Stream's edge network for ultra-low latency realtime interactions.
A deep research agent framework optimized for complex research and prediction tasks, with MiroThinker-1.7 and MiroThinker-H1 models achieving 74.0 and 88.2 on BrowseComp benchmark, supporting multi-step reasoning and information retrieval.