Featured Articles

Technical guides for building AI agents

Agent Evaluation and Testing: From Vibe Checks to End-to-End Pipelines

Most teams evaluate agents by checking a few examples. Real evaluation needs layered metrics, non-rotting datasets, and judges that push back. This article provides runnable code patterns and a practical decision framework.

AgentList Team · April 28, 2026

Agent 评估LLM 评测自动化测试

Agent Workflow Orchestration in Practice: Production Patterns from DAG to State Machines

Most agent workflows fail at the orchestration layer, not the model. A practical comparison of DAG, state machine, and visual builder approaches with production-ready code for error handling, human approval gates, and conditional branching.

AgentList Team · April 28, 2026

AI Agent工作流编排DAG

AI Coding Agents Deep Dive: Architecture Trade-offs from CLI to IDE-Integrated

A deep architectural comparison of seven open-source coding agents across three paradigms — CLI-first, IDE-integrated, and fully autonomous — examining context management, tool access, and autonomy levels to help you pick the right tool for each development scenario.

AgentList Team · April 28, 2026

AI 编程Coding AgentCLI

Browser Agents in Practice: Architecture and Pitfalls of AI-Controlled Browsers

Breaking down three abstraction layers for browser automation—from raw Playwright to structured extraction—with production patterns, runnable code, and common pitfalls.

AgentList Team · April 28, 2026

Browser AgentWeb 自动化Playwright

Advanced RAG: Chunking Strategies and Retrieval Optimization Trade-offs

Most RAG pipelines fail at retrieval, not generation. This article covers five chunking strategies, hybrid search, reranking pipelines, and a production-ready decision framework.

AgentList Team · April 28, 2026

RAGChunking检索优化

Designing Agent Memory Systems: From Short-Term Context to Persistent Knowledge

A deep dive into the four-layer agent memory architecture, with practical code for vector retrieval and memory compression to help you build scalable long-term memory systems.

AgentList Team · April 21, 2026

AI Agent记忆系统向量检索

Building Agent Observability: From Distributed Tracing to Automated Evaluation

A systematic guide to the three pillars of agent observability — distributed tracing, metrics monitoring, and automated evaluation — for building production-grade agent monitoring.

AgentList Team · April 21, 2026

AI Agent可观测性链路追踪

AI Agent Security in Practice: From Prompt Injection to Defense in Depth

A systematic walkthrough of three major attack surfaces in AI agents, with practical code examples for prompt injection defense, tool permission scoping, and output filtering.

AgentList Team · April 21, 2026

AI Agent安全Prompt Injection

Sandboxing AI Agents: Isolation Strategies for Safe Code Execution

Comparing container, WebAssembly, and process-level isolation approaches, with practical code for safely executing agent-generated code.

AgentList Team · April 21, 2026

AI Agent沙箱代码执行

Building MCP Servers in Practice: Custom Tool Chains for AI Agents

Build a production-grade MCP server from scratch, covering tool definition, authentication design, and testing strategies to turn any API into an agent-ready tool.

AgentList Team · April 21, 2026

MCPAgentTool Server

Featured

MCP Protocol in Practice: Building an Extensible Tool Ecosystem for Agents

From protocol modeling and server design to permission isolation, this guide shows how to build a stable tool integration layer for AI agents with MCP.

AgentList Team · February 25, 2026

MCPAgentTool Calling

Featured

Agent Observability Playbook: End-to-End Tracing with Langfuse

Based on real production experience, this guide explains how to build a closed loop of tracing, evaluation, and cost analytics for AI agents with Langfuse.

AgentList Team · February 18, 2026

Langfuse可观测性Tracing

PydanticAI in Production: Type-Driven Agent Design Patterns

Focused on structured outputs, tool calling, and error recovery, this article presents practical PydanticAI patterns for production systems.

AgentList Team · February 11, 2026

PydanticAIAgentPython

Web Automation Agent in Practice: Limits and Best Practices of browser-use

A practical breakdown of browser-use strengths and limits in web task automation, with strategies for stable execution and failure recovery.

AgentList Team · February 5, 2026

browser-useWeb AutomationAgent

Qdrant + RAG Retrieval Optimization Guide: From Recall to Answer Quality

Production-focused best practices for index design, filtering, reranking, and evaluation when building RAG retrieval layers with Qdrant.

AgentList Team · January 30, 2026

QdrantRAGVector Database

Featured

Building an AI Software Team with MetaGPT: From Requirements to Code Automation

An in-depth guide on how MetaGPT achieves full software development automation through role-playing, including practical guidance for PM, Architect, Engineer collaboration.

AgentList Team · March 1, 2025

MetaGPTMulti-Agent软件开发

Featured

Vector Database Selection Guide: Milvus vs Chroma vs Weaviate Comparison

A comprehensive comparison of popular open-source vector databases Milvus, Chroma, and Weaviate, helping you choose the best vector database for RAG applications.

AgentList Team · February 28, 2025

向量数据库RAGMilvus

Featured

RAG System Evaluation in Practice: Building High-Quality RAG Apps with Ragas and DeepEval

Learn how to evaluate RAG systems using Ragas and DeepEval, including measuring key metrics like faithfulness, answer relevance, and context precision.

AgentList Team · February 25, 2025

RAG评估Ragas

Featured

Building Stateful AI Agents: A Deep Dive into Letta (MemGPT)

Learn how to build stateful AI agents with long-term memory using Letta (formerly MemGPT), solving the LLM context window limitation.

AgentList Team · February 22, 2025

LettaMemGPTAI Agent

Featured

AI Coding Assistant Comparison: Aider vs Continue vs Cursor

A detailed comparison of three popular AI coding assistants - Aider, Continue, and Cursor - to help you choose the best development tool based on features, experience, and pricing.

AgentList Team · February 20, 2025

AI编程助手AiderContinue

Featured

2025 AI Agent Framework Selection Guide

An in-depth comparison of mainstream AI agent frameworks including LangChain, LangGraph, CrewAI, and AutoGen to help you choose the best development stack.

AgentList Team · February 15, 2025

AI AgentLangChainLangGraph

Featured

Build Your First AI Agent from Scratch

A hands-on guide to building a complete AI agent from scratch, covering environment setup, core components, and tool integration.

AgentList Team · February 10, 2025

AI Agent入门教程Python

Featured

Architecture Design for Multi-Agent Collaboration Systems

A deep dive into principles, architecture patterns, and best practices for building efficient multi-agent collaboration systems.

AgentList Team · February 8, 2025

Multi-Agent系统架构协作模式

Featured

Complete Local Deployment Guide for AutoGPT

A step-by-step tutorial for installing and running AutoGPT locally, including environment setup, Docker deployment, and common troubleshooting.

AgentList Team · February 5, 2025

AutoGPT部署教程Docker

RAG Explained: Giving AI Agents a Knowledge Base

An in-depth explanation of Retrieval-Augmented Generation and how to build private knowledge bases for AI agents to improve accuracy and reliability.

AgentList Team · February 1, 2025

RAG向量数据库知识库