Found 508 repositories(showing 30)
SWE-agent
SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
rllm-org
Democratizing Reinforcement Learning for LLMs
SWE-agent
The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!
augmentcode
The #1 open-source SWE-bench Verified implementation
SWE-Gym
Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]
langtalks
🤖 AI-powered software engineering multi-agent system with researcher and developer agents that automate code implementation through intelligent planning and execution. Built with LangGraph multi-agent workflows
SWE-bench
[NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents
bhouston
Simple to install, powerful command-line based AI agent system for coding.
QuantaAlpha
RepoMaster: The open-source AI agent that masters GitHub. It turns any code repository into a powerful tool, achieving a new level of autonomous task-solving. An open alternative to Claude-Code.
lidangzzz
A set of coding style guidelines for Vibe Coding or SWE-Agents that maximize efficiency and improve human readability.
SWE-agent
Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.
Gen-Verse
RLAnything & DemyAgent: General and scalable agentic RL algorithms across terminal, GUI, SWE, and tool-call settings
OpenAutoCoder
Live-SWE-agent: live, runtime self-evolving software engineering agent
scaleapi
SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?
jd-opensource
Repository-level Repair Agent Based on SWE-Bench—JoyCode Agent
aavetis
This repo tracks the opened and merged PRs by the top SWE coding agents by OpenAI, GitHub, and others. Updates regularly.
R2E-Gym
[COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents
QuantaAlpha
Repo-level benchmark for real-world Code Agents: from repo understanding → env setup → incremental dev/bug-fixing → task delivery, with cost-aware α metric.
JARVIS-Xs
SE-Agent is a self-evolution framework for LLM Code agents. It enables trajectory-level evolution to exchange information across reasoning paths via Revision, Recombination, and Refinement, expanding the search space and escaping local optima. On SWE-bench Verified, it achieves SOTA performance
aws-samples
Autonomous SWE agent working in the cloud!
SKYLENAGE-AI
SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via Continuous Integration
OpenHands
The theory of mind module for the SWE agent
amazon-science
SWE-PolyBench: A multi-language benchmark for repository level evaluation of coding agents
gso-bench
[NeurIPS '25] GSO: Challenging Software Optimization Tasks for Evaluating SWE-Agents
giraffe-tree
Agent Base is a source-level research project on coding agents. It compares Codex CLI, OpenCode, Gemini CLI, Kimi CLI, and SWE-agent across agent loops, tools, MCP integration, context/memory handling, UI flows, web architecture, and safety controls.
microsoft
RepoLaunch is an agentic SWE tool aimed at automating the build, execution and test of GitHub repositories across programming languages and operating systems.
THUDM
[ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.
logic-star-ai
We track and analyze the activity and performance of autonomous code agents in the wild
lblankl
Container-free RL framework for training software engineering agents
CUHK-Shenzhen-SE
[ACL'25] UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench