Search Results

Found 108,540 repositories(showing 30)

BenchmarkDotNet

dotnet

💛87

Powerful .NET library for benchmarking

11.4k

1.0k

MIT

Updated 7 hours ago

benchmarkbenchmarkingc-sharp+4

benchmark

google

💚90

A microbenchmark support library

10.1k

1.8k

Apache-2.0

C++

Updated 15 hours ago

benchmark

docker-bench-security

docker

💛85

The Docker Bench for Security is a script that checks for dozens of common best-practices around deploying Docker containers in production.

9.6k

1.0k

Apache-2.0

Shell

Updated 2 hours ago

maskrcnn-benchmark

facebookresearch

💛84

Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.

9.4k

2.5k

MIT

Python

Updated 3 days ago

FrameworkBenchmarks

TechEmpower

💚93

Source for the TechEmpower Framework Benchmarks project

8.1k

2.1k

NOASSERTION

C++

Updated 1 day ago

benchmarkframeworkframeworkbenchmarks+2

kube-bench

aquasecurity

💚91

Checks whether Kubernetes is deployed according to security best practices as defined in the CIS Kubernetes Benchmark

8.0k

1.3k

Apache-2.0

Updated 13 hours ago

cis-benchmarkcis-kubernetes-benchmarkcis-security+5

js-framework-benchmark

krausest

💛86

A comparison of the performance of a few popular javascript frameworks

7.4k

896

Apache-2.0

JavaScript

Updated 4 hours ago

yet-another-bench-script

masonr

💛82

YABS - a simple bash script to estimate Linux server performance using fio, iperf3, & Geekbench

6.3k

557

WTFPL

Shell

Updated 7 hours ago

bashbench-scriptbenchmark+8

ReLE评测：中文AI大模型能力评测（持续更新）：目前已囊括359个大模型，覆盖chatgpt、gpt-5.2、o4-mini、谷歌gemini-3-pro、Claude-4.6、文心ERNIE-X1.1、ERNIE-5.0、qwen3-max、qwen3.5-plus、百川、讯飞星火、商汤senseChat等商用模型，以及step3.5-flash、kimi-k2.5、ernie4.5、MiniMax-M2.5、deepseek-v3.2、Qwen3.5、llama4、智谱GLM-5、GLM-4.7、LongCat、gemma3、mistral等开源大模型。不仅提供排行榜，也提供规模超200万的大模型缺陷库！方便广大社区研究分析、改进大模型。

5.8k

234

Updated 58 minutes ago

agentic-aiartificial-intelligencellm-agent+1

ann-benchmarks

erikbern

💛85

Benchmarks of approximate nearest neighbor libraries in Python

5.6k

889

MIT

Python

Updated 10 hours ago

benchmarkdockernearest-neighbors

ToolBench

OpenBMB

💛80

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

5.6k

481

Apache-2.0

Python

Updated 6 minutes ago

benchmark.js

bestiejs

💛74

A benchmarking library. As used on jsPerf.com.

5.5k

332

NOASSERTION

JavaScript

Updated 1 hour ago

Chronos

Kodezi

💛72

Kodezi Chronos is a debugging-first language model that achieves state-of-the-art results on SWE-bench Lite (80.33%) and 67% real-world fix accuracy, over six times better than GPT-4. Built with Adaptive Graph-Guided Retrieval and Persistent Debug Memory. Model available Q1 2026 via Kodezi OS.

5.0k

213

NOASSERTION

Java

Updated 16 hours ago

artificial-intelligenceautonomous-debuggingbenchmark+15

SWE-bench

💛83

SWE-bench: Can Language Models Resolve Real-world Github Issues?

4.7k

821

MIT

Python

Updated 52 minutes ago

benchmarklanguage-modelsoftware-engineering

benchmark_results

foolwood

🧡69

Visual Tracking Paper List

4.0k

1.0k

Updated 1 week ago

benchmarkdeep-learningpaper+2

deep-text-recognition-benchmark

clovaai

💛80

Text recognition (optical character recognition) with deep learning methods, ICCV 2019

3.9k

1.1k

Apache-2.0

Jupyter Notebook

Updated 16 hours ago

crnndeep-learninggrcnn+11

awesome-http-benchmark

denji

💛76

HTTP(S) benchmark tools, testing/debugging, & restAPI (RESTful)

3.7k

250

MIT

Updated 11 hours ago

awesomeawesome-listbenchmark+15

mini-swe-agent

SWE-agent

💛74

The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!

3.7k

513

MIT

Python

Updated just now

agentagentic-aiagentic-ai-cli+3

XcodeBenchmark

devMEremenko

💛73

XcodeBenchmark measures the compilation time of a large codebase on iMac, MacBook, and Mac Pro

3.6k

415

MIT

Swift

Updated 1 day ago

benchmarkcocoapodsswift+1

AgentBench

THUDM

💛71

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

3.3k

244

Apache-2.0

Python

Updated 1 hour ago

chatgptgpt-4llm+1

BIG-bench

google

💛74

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

3.2k

617

Apache-2.0

Python

Updated 1 day ago

auto-code-rover

AutoCodeRoverSG

💛76

A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-bench lite and 46.2% tasks (pass@1) in SWE-bench verified with each task costs less than $0.7.

3.1k

329

NOASSERTION

Python

Updated 16 hours ago

derailed_benchmarks

zombocom

🧡69

Go faster, off the Rails - Benchmarks for your whole Rails app

3.1k

142

Ruby

Updated 1 day ago

benchmarks

kostya

💛76

Some benchmarks of different languages

2.9k

259

MIT

Makefile

Updated 1 day ago

benchmarkslanguages

WebBench

EZLippi

💛84

Webbench是Radim Kolar在1997年写的一个在linux下使用的非常简单的网站压测工具。它使用fork()模拟多个客户端同时访问我们设定的URL，测试网站在压力下工作的性能，最多可以模拟3万个并发连接去测试网站的负载能力。官网地址:http://home.tiscali.cz/~cz210552/webbench.html

2.8k

1.1k

GPL-3.0

Updated 1 day ago

convnet-benchmarks

soumith

💛78

Easy benchmarking of all publicly accessible implementations of convnets

2.7k

570

MIT

Python

Updated 1 day ago

benchmarking-gnns

graphdeeplearning

💛77

Repository for benchmarking graph neural networks (JMLR 2023)

2.7k

458

MIT

Jupyter Notebook

Updated 5 days ago

benchmark-frameworkdeep-learningdgl+4

cnn-benchmarks

jcjohnson

💛77

Benchmarks for popular CNN models

2.5k

406

MIT

Python

Updated 3 hours ago

go-web-framework-benchmark

smallnest

💛74

:zap: Go web framework benchmark

2.1k

229

Apache-2.0

Updated 3 days ago

benchmarkconcurrencyhttp-router-benchmark+2

nativejson-benchmark

miloyip

💛75

C/C++ JSON parser/generator benchmark

2.1k

268

MIT

JavaScript

Updated 1 day ago

GitHub Explorer

Search Results

BenchmarkDotNet

benchmark

docker-bench-security

maskrcnn-benchmark

FrameworkBenchmarks

kube-bench

js-framework-benchmark

yet-another-bench-script

chinese-llm-benchmark

ann-benchmarks

ToolBench

benchmark.js

Chronos

SWE-bench

benchmark_results

deep-text-recognition-benchmark

awesome-http-benchmark

mini-swe-agent

XcodeBenchmark

AgentBench

BIG-bench

auto-code-rover

derailed_benchmarks

benchmarks

WebBench

convnet-benchmarks

benchmarking-gnns

cnn-benchmarks

go-web-framework-benchmark

nativejson-benchmark

BenchmarkDotNet

benchmark

docker-bench-security

maskrcnn-benchmark

FrameworkBenchmarks

kube-bench

js-framework-benchmark

yet-another-bench-script

chinese-llm-benchmark

ann-benchmarks

ToolBench

benchmark.js

Chronos

SWE-bench

benchmark_results

deep-text-recognition-benchmark

awesome-http-benchmark

mini-swe-agent

XcodeBenchmark

AgentBench

BIG-bench

auto-code-rover

derailed_benchmarks

benchmarks

WebBench

convnet-benchmarks

benchmarking-gnns

cnn-benchmarks

go-web-framework-benchmark

nativejson-benchmark