Search Results

Found 237,956 repositories(showing 30)

evals

openai

💚100

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

18.1k

2.9k

NOASSERTION

Python

Updated 25 minutes ago

lm-evaluation-harness

EleutherAI

💚92

A framework for few-shot evaluation of language models.

12.0k

3.2k

MIT

Python

Updated 21 minutes ago

evaluation-frameworklanguage-modeltransformer

react-native-ui-kitten

akveo

💛85

:boom: React Native UI Library based on Eva Design System :new_moon_with_face::sparkles:Dark Mode

10.7k

968

MIT

TypeScript

Updated 2 hours ago

reactreact-nativeui-kit

eva-icons

akveo

💛78

A pack of more than 480 beautifully crafted Open Source icons. SVG, Sketch, Web Font and Animations support.

8.8k

369

MIT

TypeScript

Updated 1 day ago

iconicon-fonticon-pack+1

nebular

akveo

💛78

:boom: Customizable Angular UI Library based on Eva Design System :new_moon_with_face::sparkles:Dark Mode

8.1k

1.5k

MIT

TypeScript

Updated 9 hours ago

akveoangularangular-cli+17

evans

ktr0731

💛71

Evans: more expressive universal gRPC client

4.5k

193

MIT

Updated 58 minutes ago

clicompletiongolang+5

simple-evals

openai

🧡69

No description available

4.4k

480

MIT

Python

Updated 16 hours ago

lmms-eval

EvolvingLMMs-Lab

💛80

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

4.0k

555

NOASSERTION

Python

Updated 1 hour ago

agiaudio-evaluationbenchmark+8

human-eval

openai

💛78

Code for the paper "Evaluating Large Language Models Trained on Code"

3.2k

441

MIT

Python

Updated 2 days ago

AV_Evasion_Tool

1y0n

💛77

掩日 - 免杀执行器生成工具

2.8k

404

AGPL-3.0

Updated 2 days ago

evadb

georgia-tech-db

💛70

Database system for AI-powered apps

2.7k

263

Apache-2.0

Python

Updated 14 hours ago

agentaiauto-gpt+14

EVA

baaivision

💛75

EVA Series: Visual Representation Fantasies from BAAI

2.7k

187

MIT

Python

Updated 4 days ago

foundation-modelsrepresentation-learningvision-transformer

evalscope

modelscope

💛76

A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.

2.6k

300

Apache-2.0

Python

Updated 1 hour ago

evaluationllmperformance+2

evaluate

huggingface

🧡66

🤗 Evaluate: A library for easily evaluating machine learning models and datasets.

2.4k

313

Apache-2.0

Python

Updated 1 day ago

evaluationmachine-learning

SentEval

facebookresearch

💛70

A python tool for evaluating the quality of sentence embeddings.

2.1k

306

NOASSERTION

Python

Updated 1 day ago

evaluation-guidebook

huggingface

💛73

Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!

2.1k

122

NOASSERTION

Jupyter Notebook

Updated 1 day ago

evaluationevaluation-metricsguidebook+4

statrethinking_winter2019

rmcelreath

🧡61

Statistical Rethinking course at MPI-EVA from Dec 2018 through Feb 2019

2.0k

389

Updated 1 week ago

EvalAI

Cloud-CV

🧡67

:cloud: :rocket: :bar_chart: :chart_with_upwards_trend: Evaluating state of the art in AI

2.0k

987

NOASSERTION

Python

Updated 21 hours ago

aiai-challengesangularjs+14

alpaca_eval

tatsu-lab

💛75

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

2.0k

306

Apache-2.0

Jupyter Notebook

Updated 2 days ago

deep-learningevaluationfoundation-models+5

Veil-Evasion

Veil-Framework

💛79

Veil Evasion is no longer supported, use Veil 3.0!

1.8k

677

NOASSERTION

Python

Updated 13 hours ago

antivirusantivirus-evasionpython+2

EvaporateJS

TTLabs

🧡64

Javascript library for browser to S3 multipart resumable uploads

1.8k

204

JavaScript

Updated 5 days ago

eva.js

eva-engine

💛73

Eva.js is a front-end game engine specifically for creating interactive game projects.

1.8k

107

MIT

JavaScript

Updated 2 days ago

canvas2devajsfamework+4

evalplus

🧡69

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

1.7k

192

Apache-2.0

Python

Updated 4 days ago

benchmarkchatgptefficiency+4

LLM-eval-survey

MLGroupJLU

🧡68

The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".

1.6k

Updated 3 days ago

benchmarkevaluationlarge-language-models+3

seed-tts-eval

BytedanceSpeech

🧡58

No description available

1.5k

143

Python

Updated 3 days ago

Phantom-Evasion

oddcod3

💛75

Python antivirus evasion tool

1.4k

340

GPL-3.0

Python

Updated 3 days ago

antivirusapkinjectiondynamic-analysis+4

evalite

mattpocock

🧡67

Evaluate your LLM-powered apps with TypeScript

1.4k

MIT

TypeScript

Updated 16 hours ago

aievalstypescript

nlg-eval

Maluuba

💛74

Evaluation code for various unsupervised automated metrics for Natural Language Generation.

1.4k

226

NOASSERTION

Python

Updated 5 days ago

bleubleu-scorecider+14

expr-eval

silentmatt

🧡69

Mathematical expression evaluator in JavaScript

1.3k

265

MIT

JavaScript

Updated 4 days ago

hacktoberfestjavascriptmath+2

web-eval-agent

refreshdotdev

💛72

An MCP server that autonomously evaluates web applications.

1.2k

106

Apache-2.0

Python

Updated 1 day ago

debuggingdebugging-toolmcp+6

GitHub Explorer

Search Results

evals

lm-evaluation-harness

react-native-ui-kitten

eva-icons

nebular

evans

simple-evals

lmms-eval

human-eval

AV_Evasion_Tool

evadb

EVA

evalscope

evaluate

SentEval

evaluation-guidebook

statrethinking_winter2019

EvalAI

alpaca_eval

Veil-Evasion

EvaporateJS

eva.js

evalplus

LLM-eval-survey

seed-tts-eval

Phantom-Evasion

evalite

nlg-eval

expr-eval

web-eval-agent

evals

lm-evaluation-harness

react-native-ui-kitten

eva-icons

nebular

evans

simple-evals

lmms-eval

human-eval

AV_Evasion_Tool

evadb

EVA

evalscope

evaluate

SentEval

evaluation-guidebook

statrethinking_winter2019

EvalAI

alpaca_eval

Veil-Evasion

EvaporateJS

eva.js

evalplus

LLM-eval-survey

seed-tts-eval

Phantom-Evasion

evalite

nlg-eval

expr-eval

web-eval-agent