Search Results

Found 508 repositories(showing 30)

SWE-agent

💚95

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

18.9k

2.0k

MIT

Python

Updated 5 hours ago

agentagent-based-modelai+4

rllm

rllm-org

💛76

Democratizing Reinforcement Learning for LLMs

5.4k

538

Apache-2.0

Python

Updated 5 hours ago

agent-frameworkagentic-workflowcoding-agent+11

mini-swe-agent

SWE-agent

💛74

The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!

3.6k

504

MIT

Python

Updated 3 minutes ago

agentagentic-aiagentic-ai-cli+3

augment-swebench-agent

augmentcode

💛72

The #1 open-source SWE-bench Verified implementation

863

152

NOASSERTION

Python

Updated 4 days ago

SWE-Gym

🧡66

Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]

656

Apache-2.0

Jupyter Notebook

Updated 1 day ago

🤖 AI-powered software engineering multi-agent system with researcher and developer agents that automate code implementation through intelligent planning and execution. Built with LangGraph multi-agent workflows

623

123

MIT

Python

Updated 3 days ago

SWE-smith

SWE-bench

💛72

[NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents

614

114

MIT

Python

Updated 22 hours ago

agentslanguage-modelsoftware-engineering+1

mycoder

bhouston

🧡56

Simple to install, powerful command-line based AI agent system for coding.

564

MIT

TypeScript

Updated 2 days ago

agentagenticai+8

RepoMaster

QuantaAlpha

🧡61

RepoMaster: The open-source AI agent that masters GitHub. It turns any code repository into a powerful tool, achieving a new level of autonomous task-solving. An open alternative to Claude-Code.

515

Python

Updated 7 hours ago

claude-codecode-agentgittaskbench+3

AI-Coding-Style-Guides

lidangzzz

💛71

A set of coding style guidelines for Vibe Coding or SWE-Agents that maximize efficiency and improve human readability.

480

Apache-2.0

JavaScript

Updated 6 days ago

SWE-ReX

SWE-agent

🧡67

Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.

471

108

MIT

Python

Updated 1 day ago

agentagentsai+7

Open-AgentRL

Gen-Verse

🧡66

RLAnything & DemyAgent: General and scalable agentic RL algorithms across terminal, GUI, SWE, and tool-call settings

433

Apache-2.0

Python

Updated 3 hours ago

agent-rlcoding-agententropy-method+8

live-swe-agent

OpenAutoCoder

🧡66

Live-SWE-agent: live, runtime self-evolving software engineering agent

351

MIT

Updated just now

agentllmself-evolving+1

SWE-bench_Pro-os

scaleapi

🧡66

SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?

334

MIT

Python

Updated 1 day ago

joycode-agent

jd-opensource

🧡56

Repository-level Repair Agent Based on SWE-Bench—JoyCode Agent

327

MIT

Python

Updated 1 week ago

agentaillm+2

PRarena

aavetis

🧡66

This repo tracks the opened and merged PRs by the top SWE coding agents by OpenAI, GitHub, and others. Updates regularly.

297

HTML

Updated 11 hours ago

agentsswe-agent

R2E-Gym

🧡66

[COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents

257

Apache-2.0

Python

Updated 2 days ago

agentscoding-agentsllm+2

GitTaskBench

QuantaAlpha

🧡50

Repo-level benchmark for real-world Code Agents: from repo understanding → env setup → incremental dev/bug-fixing → task delivery, with cost-aware α metric.

252

Python

Updated 2 weeks ago

claude-codecode-agentcode-llm+4

SE-Agent

JARVIS-Xs

💛71

SE-Agent is a self-evolution framework for LLM Code agents. It enables trajectory-level evolution to exchange information across reasoning paths via Revision, Recombination, and Refinement, expanding the search space and escaping local optima. On SWE-bench Verified, it achieves SOTA performance

245

MIT

Python

Updated 3 days ago

claude-codecode-agentcode-fix+5

remote-swe-agents

aws-samples

🧡61

Autonomous SWE agent working in the cloud!

229

MIT-0

TypeScript

Updated 2 days ago

agentic-aibedrockgenai+2

SWE-CI

SKYLENAGE-AI

💛70

SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via Continuous Integration

141

Apache-2.0

Python

Updated 10 minutes ago

agentbenchmarkcontinuous-integration

ToM-SWE

OpenHands

🧡65

The theory of mind module for the SWE agent

Python

Updated 16 hours ago

SWE-PolyBench

amazon-science

🧡60

SWE-PolyBench: A multi-language benchmark for repository level evaluation of coding agents

MIT

Python

Updated 1 week ago

gso

gso-bench

💛70

[NeurIPS '25] GSO: Challenging Software Optimization Tasks for Evaluating SWE-Agents

MIT

Python

Updated 21 hours ago

agent-base

giraffe-tree

🧡65

Agent Base is a source-level research project on coding agents. It compares Codex CLI, OpenCode, Gemini CLI, Kimi CLI, and SWE-agent across agent loops, tools, MCP integration, context/memory handling, UI flows, web architecture, and safety controls.

HTML

Updated 3 days ago

RepoLaunch

microsoft

🧡65

RepoLaunch is an agentic SWE tool aimed at automating the build, execution and test of GitHub repositories across programming languages and operating systems.

MIT

Python

Updated 1 day ago

SWE-Dev

THUDM

❤️45

[ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.

MIT

Python

Updated 1 month ago

insights

logic-star-ai

❤️45

We track and analyze the activity and performance of autonomous code agents in the wild

MIT

TypeScript

Updated 1 month ago

agentsswe-agentswe-bench

SWE-MiniSandbox

lblankl

🧡65

Container-free RL framework for training software engineering agents

Python

Updated 1 hour ago

agentagent-based-frameworkai+3

UTBoost

CUHK-Shenzhen-SE

🧡60

[ACL'25] UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench

MIT

Python

Updated 2 weeks ago

GitHub Explorer

Search Results

SWE-agent

rllm

mini-swe-agent

augment-swebench-agent

SWE-Gym

swe-agent

SWE-smith

mycoder

RepoMaster

AI-Coding-Style-Guides

SWE-ReX

Open-AgentRL

live-swe-agent

SWE-bench_Pro-os

joycode-agent

PRarena

R2E-Gym

GitTaskBench

SE-Agent

remote-swe-agents

SWE-CI

ToM-SWE

SWE-PolyBench

gso

agent-base

RepoLaunch

SWE-Dev

insights

SWE-MiniSandbox

UTBoost

SWE-agent

rllm

mini-swe-agent

augment-swebench-agent

SWE-Gym

swe-agent

SWE-smith

mycoder

RepoMaster

AI-Coding-Style-Guides

SWE-ReX

Open-AgentRL

live-swe-agent

SWE-bench_Pro-os

joycode-agent

PRarena

R2E-Gym

GitTaskBench

SE-Agent

remote-swe-agents

SWE-CI

ToM-SWE

SWE-PolyBench

gso

agent-base

RepoLaunch

SWE-Dev

insights

SWE-MiniSandbox

UTBoost