Bifrost

Active

GitHub Go Apache-2.0

Description

An observability and gateway platform for LLM applications, providing request tracing, model routing, logging, and cost analysis for agent workflows.

Key Features

Unified OpenAI-compatible API gateway connecting to 23+ LLM providers including OpenAI, Anthropic, AWS Bedrock, and Google Vertex
Automatic failover and load balancing across multiple API keys and providers with zero downtime
Semantic caching for intelligent response caching based on similarity, reducing costs and latency
Model Context Protocol (MCP) support enabling AI models to use external tools like filesystem, web search, and databases
Enterprise governance with virtual keys, hierarchical budget management, rate limiting, and OIDC user provisioning
Zero-config startup with web UI for visual configuration, real-time monitoring, and analytics

Use Cases

💡 Unifying LLM access across multiple providers with a single API endpoint for production AI applications

💡 Implementing automatic failover to maintain service availability when a primary LLM provider experiences outages

💡 Reducing LLM API costs through semantic caching and intelligent request routing across providers

💡 Adding enterprise-grade access control and budget management to AI agent deployments

💡 Building AI applications with vendor-agnostic provider switching without code changes

Quick Start

Start in 30 seconds: `npx -y @maximhq/bifrost` or `docker run -p 8080:8080 maximhq/bifrost`. Open http://localhost:8080 for the web UI. Make your first call with `curl -X POST http://localhost:8080/v1/chat/completions` using the OpenAI-compatible format.

Visit GitHub

Bifrost

Description

Key Features

Use Cases

Tags

Categories

Quick Start

Related Projects

Arize Phoenix

Observal

Braintrust

Opik

Related Articles

LLM Agent Cost Control: Semantic Caching and Model Routing in Practice