Search Results

Found 16,907 repositories(showing 30)

pytorch-image-models

huggingface

💚100

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

36.6k

5.1k

Apache-2.0

Python

Updated 22 minutes ago

augmixconvnextdistributed-training+17

langfuse

💚95

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

24.5k

2.5k

NOASSERTION

TypeScript

Updated 11 minutes ago

analyticsautogenevaluation+16

mastra

mastra-ai

💚95

From the team behind Gatsby, Mastra is a framework for building AI-powered applications and agents with a modern TypeScript stack.

22.8k

1.9k

NOASSERTION

TypeScript

Updated 2 minutes ago

agentsaichatbots+10

evals

openai

💚100

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

18.2k

2.9k

NOASSERTION

Python

Updated 6 minutes ago

easy-dataset

ConardLi

💚98

A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval

13.8k

1.4k

NOASSERTION

JavaScript

Updated 29 minutes ago

datasetfine-tuningjavascript+2

phoenix

Arize-ai

💛82

AI Observability & Evaluation

9.2k

799

NOASSERTION

Jupyter Notebook

Updated 2 hours ago

agentsai-monitoringai-observability+13

OLMo

allenai

💛84

Modeling, training, eval, and inference code for OLMo

6.5k

736

Apache-2.0

Python

Updated 21 hours ago

agentops

AgentOps-AI

💛81

Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks including CrewAI, Agno, OpenAI Agents SDK, Langchain, Autogen, AG2, and CamelAI

5.4k

562

MIT

Python

Updated 2 hours ago

agentagentopsagents-sdk+14

Kiln

Kiln-AI

💛73

Build, Evaluate, and Optimize AI Systems. Includes evals, RAG, agents, fine-tuning, synthetic data generation, dataset management, MCP, and more.

4.7k

352

NOASSERTION

Python

Updated 5 hours ago

aichain-of-thoughtcollaboration+17

simple-evals

openai

🧡69

No description available

4.4k

480

MIT

Python

Updated 40 minutes ago

logfire

pydantic

🧡66

AI observability platform for production LLM and agent systems.

4.2k

223

MIT

Python

Updated 5 hours ago

agent-observabilityaiai-observability+13

verifiers

PrimeIntellect-ai

💛74

Our library for RL environments + evals

4.0k

529

MIT

Python

Updated 2 hours ago

learning-react

MoonHighway

💛81

The code samples for Learning React by Alex Banks and Eve Porcello, published by O'Reilly Media

3.7k

1.3k

JavaScript

Updated 23 hours ago

functional-jsjestreact+1

trulens

truera

💛76

Evaluation and Tracking for LLM Experiments and AI Agents

3.2k

261

MIT

Python

Updated 10 hours ago

agent-evaluationagentopsai-agents+10

lmnr

lmnr-ai

💛75

Laminar - open-source observability platform purpose-built for AI agents. YC S24.

2.8k

191

Apache-2.0

TypeScript

Updated 4 hours ago

agent-observabilityagentsai+17

evalscope

modelscope

💛76

A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.

2.6k

301

Apache-2.0

Python

Updated 10 minutes ago

evaluationllmperformance+2

Cisco-Images-for-GNS3-and-EVE-NG

hegdepavankumar

💛77

Free Images for EVE-NG and GNS3 containing routers, switches,Firewalls and other appliances, including Cisco, Fortigate, Palo Alto, Sophos and more. Master the art of networking and improve your skills!, our repository provides a one-stop solution for a comprehensive hands-on experience.

2.3k

501

Apache-2.0

HTML

Updated 4 hours ago

checkpoint-firewallciscocisco-images+17

gomacro

cosmos72

🧡63

Interactive Go interpreter and debugger with REPL, Eval, generics and Lisp-like macros

2.3k

100

MPL-2.0

Updated 1 week ago

debuggerevalgenerics+4

ceval

hkust-nlp

🧡68

Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]

1.8k

MIT

Python

Updated 1 day ago

Pyfa

pyfa-org

🧡61

Python fitting assistant, cross-platform fitting tool for EVE Online

1.7k

435

GPL-3.0

Python

Updated 2 days ago

eve-onlinefittingmmo+3

dreamzero

dreamzero0

💛73

Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals

1.6k

124

Apache-2.0

Python

Updated 26 minutes ago

simba

GitHamza0206

🧡57

OpenSource Production ready Customer service with built in Evals and monitoring

1.4k

102

Apache-2.0

TypeScript

Updated 3 weeks ago

customer-serviceevalsknowledge-base+2

evalite

mattpocock

🧡67

Evaluate your LLM-powered apps with TypeScript

1.4k

MIT

TypeScript

Updated 20 hours ago

aievalstypescript

harbor

harbor-framework

🧡65

Harbor is a framework for running agent evaluations and creating and using RL environments.

1.4k

880

Apache-2.0

Python

Updated 9 minutes ago

evalsrl-environmentsterminal-bench

java-repl

albertlatacz

🧡53

Read Eval Print Loop for Java

1.3k

189

Apache-2.0

Java

Updated 2 months ago

frontier-evals

openai

💛73

OpenAI Frontier Evals

1.2k

143

MIT

Python

Updated 1 hour ago

raglite

superlinear-ai

💛72

🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL

1.2k

101

MPL-2.0

Python

Updated 13 hours ago

chainlitcolbertduckdb+15

phpsh

facebookarchive

❤️47

A read-eval-print-loop for php

1.1k

135

BSD-3-Clause

Emacs Lisp

Updated 2 months ago

evals-skills

hamelsmu

💛72

Skills for AI Evals to compliment the course: AI Evals For Engineers & PMs

1.1k

117

MIT

Updated 10 hours ago

judgeval

JudgmentLabs

💛72

The open source post-building layer for agents. Our environment data and evals power agent post-training (RL, SFT) and monitoring.

1.0k

Apache-2.0

Python

Updated 1 day ago

agentagentic-aiagents+12

GitHub Explorer

Search Results

pytorch-image-models

langfuse

mastra

evals

easy-dataset

phoenix

OLMo

agentops

Kiln

simple-evals

logfire

verifiers

learning-react

trulens

lmnr

evalscope

Cisco-Images-for-GNS3-and-EVE-NG

gomacro

ceval

Pyfa

dreamzero

simba

evalite

harbor

java-repl

frontier-evals

raglite

phpsh

evals-skills

judgeval

pytorch-image-models

langfuse

mastra

evals

easy-dataset

phoenix

OLMo

agentops

Kiln

simple-evals

logfire

verifiers

learning-react

trulens

lmnr

evalscope

Cisco-Images-for-GNS3-and-EVE-NG

gomacro

ceval

Pyfa

dreamzero

simba

evalite

harbor

java-repl

frontier-evals

raglite

phpsh

evals-skills

judgeval