Search Results

Found 89,702 repositories(showing 30)

wrk

💚95

Modern HTTP benchmarking tool

40.2k

3.0k

NOASSERTION

Updated 9 hours ago

mmdetection

open-mmlab

💚95

OpenMMLab Detection Toolbox and Benchmark

32.6k

9.8k

Apache-2.0

Python

Updated 13 hours ago

cascade-rcnnconvnextdetr+17

hyperfine

sharkdp

💛85

A command-line benchmarking tool

27.8k

471

Apache-2.0

Rust

Updated 1 hour ago

benchmarkclicommand-line+3

toon

toon-format

💚96

🎒 Token-Oriented Object Notation (TOON) – Compact, human-readable, schema-aware JSON for LLM prompts. Spec, benchmarks, TypeScript SDK.

23.7k

1.1k

MIT

TypeScript

Updated 44 minutes ago

data-formatllmserialization+1

evals

openai

💚100

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

18.2k

2.9k

NOASSERTION

Python

Updated 9 hours ago

cua

trycua

💛87

Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full desktops (macOS, Linux, Windows).

13.4k

829

MIT

Python

Updated 59 minutes ago

agentai-agentapple+15

fashion-mnist

zalandoresearch

💚93

A MNIST-like fashion product database. Benchmark :point_down:

12.7k

3.1k

MIT

Python

Updated 2 hours ago

benchmarkcomputer-visionconvolutional-neural-networks+8

BenchmarkDotNet

dotnet

💛87

Powerful .NET library for benchmarking

11.4k

1.0k

MIT

Updated 12 hours ago

benchmarkbenchmarkingc-sharp+4

xmrig

💚95

RandomX, KawPow, CryptoNight and GhostRider unified CPU/GPU miner and RandomX benchmark

9.9k

3.8k

GPL-3.0

Updated 21 hours ago

argon2cpuminercryptocurrency+6

mmsegmentation

open-mmlab

💚90

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

9.7k

2.8k

Apache-2.0

Python

Updated 1 minute ago

deeplabv3image-segmentationmedical-image-segmentation+8

autocannon

mcollina

💛77

fast HTTP/1.1 benchmarking tool written in Node.js

8.4k

340

MIT

JavaScript

Updated 16 hours ago

FrameworkBenchmarks

TechEmpower

💚93

Source for the TechEmpower Framework Benchmarks project

8.1k

2.1k

NOASSERTION

C++

Updated 3 hours ago

benchmarkframeworkframeworkbenchmarks+2

kube-bench

aquasecurity

💚91

Checks whether Kubernetes is deployed according to security best practices as defined in the CIS Kubernetes Benchmark

8.0k

1.3k

Apache-2.0

Updated 14 hours ago

cis-benchmarkcis-kubernetes-benchmarkcis-security+5

mmpose

open-mmlab

💚92

OpenMMLab Pose Estimation Toolbox and Benchmark.

7.5k

1.5k

Apache-2.0

Python

Updated 2 hours ago

animal-pose-estimationbenchmarkcpm+17

bombardier

codesenberg

💛80

Fast cross-platform HTTP benchmarking tool written in Go

6.8k

332

MIT

Updated 9 hours ago

benchmarkingbombardiercli+9

sysbench

akopytov

💛88

Scriptable database and system performance benchmark

6.7k

1.1k

GPL-2.0

Updated 5 hours ago

benchmarkconsolefreebsd+9

cleverhans

cleverhans-lab

💚90

An adversarial example library for constructing attacks, building defenses, and benchmarking both

6.4k

1.4k

MIT

Jupyter Notebook

Updated 2 days ago

benchmarkingmachine-learningsecurity

siege

JoeDog

💛80

Siege is an http load tester and benchmarking utility

6.2k

398

GPL-3.0

Updated 1 day ago

ann-benchmarks

erikbern

💛85

Benchmarks of approximate nearest neighbor libraries in Python

5.6k

889

MIT

Python

Updated 1 day ago

benchmarkdockernearest-neighbors

benchmark.js

bestiejs

💛74

A benchmarking library. As used on jsPerf.com.

5.5k

332

NOASSERTION

JavaScript

Updated 11 hours ago

Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks including CrewAI, Agno, OpenAI Agents SDK, Langchain, Autogen, AG2, and CamelAI

5.4k

562

MIT

Python

Updated 3 hours ago

agentagentopsagents-sdk+14

criterion.rs

bheisler

💛79

Statistics-driven benchmarking library for Rust

5.4k

354

Apache-2.0

Rust

Updated 2 hours ago

benchmarkcriteriongnuplot+2

YCSB

brianfrankcooper

💛85

Yahoo! Cloud Serving Benchmark

5.2k

2.3k

Apache-2.0

Java

Updated 2 days ago

mmaction2

open-mmlab

💛83

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

5.0k

1.3k

Apache-2.0

Python

Updated 2 hours ago

action-recognitionavabenchmark+15

TCN

locuslab

💛83

Sequence modeling benchmarks and temporal convolutional networks

4.5k

899

MIT

Python

Updated 11 hours ago

plow

six-ddc

💛76

A high-performance HTTP benchmarking tool that includes a real-time web UI and terminal display

4.4k

150

Apache-2.0

Updated 8 hours ago

apachebenchclihttp+6

local-deep-research

LearningCircuit

💛73

Local Deep Research achieves ~95% on SimpleQA benchmark (tested with GPT-4.1-mini). Supports local and cloud LLMs (Ollama, Google, Anthropic, ...). Searches 10+ sources - arXiv, PubMed, web, and your private documents. Everything Local & Encrypted.

4.3k

408

MIT

Python

Updated 5 hours ago

academiaanthropicarxiv+17

CLUE

CLUEbenchmark

💛75

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

4.2k

545

Python

Updated 1 day ago

albertbenchmarkbert+12

VLMEvalKit

open-compass

💛71

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

4.0k

676

Apache-2.0

Python

Updated 3 hours ago

chatgptclaudeclip+16

mmpretrain

open-mmlab

💛75

OpenMMLab Pre-training Toolbox and Benchmark

3.8k

1.1k

Apache-2.0

Python

Updated 2 hours ago

beitclipconstrastive-learning+14

GitHub Explorer

Search Results

wrk

mmdetection

hyperfine

toon

evals

cua

fashion-mnist

BenchmarkDotNet

xmrig

mmsegmentation

autocannon

FrameworkBenchmarks

kube-bench

mmpose

bombardier

sysbench

cleverhans

siege

ann-benchmarks

benchmark.js

agentops

criterion.rs

YCSB

mmaction2

TCN

plow

local-deep-research

CLUE

VLMEvalKit

mmpretrain

wrk

mmdetection

hyperfine

toon

evals

cua

fashion-mnist

BenchmarkDotNet

xmrig

mmsegmentation

autocannon

FrameworkBenchmarks

kube-bench

mmpose

bombardier

sysbench

cleverhans

siege

ann-benchmarks

benchmark.js

agentops

criterion.rs

YCSB

mmaction2

TCN

plow

local-deep-research

CLUE

VLMEvalKit

mmpretrain