OpenRLHF
ActiveDescription
OpenRLHF is a high-performance agentic RL framework based on Ray and vLLM, offering PPO, DAPO, and REINFORCE++ algorithms for large-scale training of agents and vision-language models.
OpenRLHF is a high-performance agentic RL framework based on Ray and vLLM, offering PPO, DAPO, and REINFORCE++ algorithms for large-scale training of agents and vision-language models.
ART (Agent Reinforcement Trainer) trains multi-step agents for real-world tasks using GRPO reinforcement learning, enabling on-the-job training for models like Qwen, Llama, and more.
Open-source multi-agent framework from Alibaba, enabling the construction of observable and interpretable agents with rich distributed capabilities.
Agent Lightning is Microsoft's open-source training framework for AI agents, using reinforcement learning to enhance agent capabilities.
Flexible and powerful framework for managing multiple AI agents and handling complex conversations across providers like OpenAI, Anthropic, and AWS Bedrock.