Found 641 repositories(showing 30)
Farama-Foundation
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
microsoft
Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.
SJTUwbl
Multi-agent Combat Arena (UAV swarm vs UAV swarm)
elliottneilclark
rs-poker is a rust library that includes all of the poker evaluation tools that you need from hand ranking and starting card enumeration to a full agent arena for self learning.
YuhangSong
Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.
YuhangSong
Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.
upstash
Agent Trading Arena --- Three AI agents. $100k each. Real market prices. Who wins?
ByteArena
Byte Arena: Digital Playground for Intelligent Agents (star the repo to vote for us)
chongdashu
A vibe coded threejs shooter arena using threejs agent skills
xlang-ai
[ICLR 2026] Computer Agent Arena: Toward Human-Centric Evaluation and Analysis of Computer-Use Agents
jiangjiechen
Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction Arena"
wekjsdvnm
This work has been accepted to Findings of EMNLP 2025!
xrose3159
PaperPub is an academic arena where diverse AI Agents read papers daily, pick apart each other's arguments, and fiercely debate.
hilkoc
The purpose of this project is to research Artificial Intelligence and Reinforcement Learning. In the AI Arena, multiple agents can interact with a single environment. After sending its action, each each agent will receive a reward. This allows agents to learn, improve their behavior and to adapt to each other. Interesting phenomena can arise...
openclaw-trade
openclaw trading assistant| openclaw trading skill | nof1.ai & openclaw [moltbot] collaboration | We get the best practices from alpha arena trading seasons and bring it to clawdbot All top AI agents, realtime monitoring and news research, gather info from private insiders and many other! Using Hyperliquid API.
sands-lab
A Network Arena for Benchmarking AI Agents on Network Troubleshooting
abhijitmajumdar
A multi agent multi arena car simulator oriented towards Reinforcement Learning with simultaneous multi instance spawning capability
diambra
Example Agents for DIAMBRA Arena Environments
Felliks
AI plays Doom — pit Vision Language Models against demons and each other. Solo scenarios, deathmatch arena, 1-4 agents with any OpenAI-compatible API
neulab
⚔️ OpenHands PR Arena ⚔️ is a platform for evaluating and benchmarking agentic coding assistants through paired pull request (PR) generations.
xqbjs
A cross-platform, web-based Avalon game arena featuring sci-fi pixel art and Qwen-Max AI agents. Supports remote multiplayer on both mobile and desktop.
cgrivera
The AI Arena: A framework for distributed multi-agent reinforcement learning
masa-finance
The Agent Arena Subnet gamifies creating the best Twitter AI agent by having miners register agents, validators manage registrations, retrieve metrics, and score agents based on algorithms, ranking top performers in an arena-style competition.
AMD-AGI
AgentKernelArena provides an end-to-end siloed-benchmarking environment where different LLM-powered agents—such as Cursor Agent, Claude Code, Codex, SWE-agent, and GEAK—can be evaluated side-by-side on the same GPU kernel tasks, using objective and reproducible metrics.
PKU-YuanGroup
A living arena for Human vs Agent and Agent vs Agent competition, testing OpenClaw intelligence through real-time gameplay.
SatyamSingh8306
mcp_arena is a production-ready Python library for building MCP (Model Context Protocol) servers with intelligent agent orchestration and domain-specific presets.
aidenybai
what if we put a bunch of coding agents together and made them argue with each other?
Software-Engineering-Arena
Compare agents pairwise via multi‑round evaluations for SE tasks.
vistara-apps
Agent Arena SDK
varsity-tech-product
arena mcp and sdk for agent