相关项目
Agents Towards Production
18.8k · Jupyter Notebook
End-to-end, code-first tutorials for building production-grade GenAI agents. From prototype to enterprise deployment.
agentframeworkevaluation +2
Argilla
4.9k · Python
Argilla 是面向 AI 工程师和领域专家的协作平台,支持构建高质量数据集、人工反馈收集与模型评估。
evaluationdata-processingllm +2
Hugging Face Evaluate
2.4k · Python
Hugging Face 官方模型与数据集评估库,提供丰富的评估指标和方法,轻松评估机器学习模型性能和数据集质量。
evaluationllmpython +2
12 Factor Agents
19.4k · TypeScript
What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?
agentframeworkevaluation +2