AgentList
HomeProjectsArticlesAbout
Explore Projects
HomeProjectsArticlesAbout
Explore Projects
Projects Multi-SWE-bench

Multi-SWE-bench

Stale
GitHub Python Apache-2.0

Description

A multilingual benchmark for issue resolving. Extends SWE-bench to multiple programming languages for evaluating AI agent capabilities across diverse codebases.

Tags

benchmark swe-bench multilingual evaluation software-engineering python

Categories

💻 Coding Agent
Visit GitHub

Project Metrics

Stars 336
Forks 54
Watchers 336
Issues 16
Created February 18, 2025
Last commit December 18, 2025

Deployment

Local

Related Projects

SWE-smith

664 · Python
Active

Scaling data for SWE-agents (NeurIPS 2025 D&B Spotlight). A toolkit for automatically generating large-scale training datasets for software engineering agents.

swe-agenttraining-databenchmark +3

SWE-bench

5.1k · Python
Normal

SWE-bench is a benchmark for evaluating language models on real-world GitHub issue resolution, featuring genuine problems from popular Python repositories, now a core standard for measuring AI coding agent capabilities.

evaluationpythoncoding +2

Augment SWE-bench Agent

873 · Python
Stale

Augment SWE-bench Agent is the number one open-source SWE-bench Verified implementation, demonstrating how to build high-performance software engineering agents to automatically resolve GitHub issues.

codingpythonagent +2

Trae Agent

11.6k · Python
Stale

Trae Agent is an LLM-based agent for general purpose software engineering tasks.

coding-agentsoftware-engineeringllm +1
AgentList

The most comprehensive directory of open-source AI Agent projects. Discover and compare top Agent frameworks like LangChain, CrewAI, and more.

Quick Links

  • Project List
  • Featured Articles
  • Browse Categories

Contact

  • About
  • Privacy Policy
  • Contact Us

© 2026 AgentList. All rights reserved.

Made with for the open source community