Agents Towards Production
ActiveDescription
End-to-end, code-first tutorials for building production-grade GenAI agents. From prototype to enterprise deployment.
End-to-end, code-first tutorials for building production-grade GenAI agents. From prototype to enterprise deployment.
What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?
A CNCF Sandbox SRE Agent that automatically analyzes infrastructure logs and metrics to assist with incident diagnosis and system operations.
A comprehensive benchmark to evaluate LLMs as agents (ICLR 2024), covering operating systems, databases, knowledge graphs, digital card games and more.
OpenTelemetry instrumentation for AI observability, providing standardized tracing, metrics collection, and span definitions for LLM inference processes to help developers monitor and debug AI agent systems.