Search Results

Found 39 repositories(showing 30)

genai-compliance-bench

zzyfight

💛73

GenAI compliance benchmark is a evaluation benchmarks for generative AI in regulated industries.

1.5k

141

Apache-2.0

Python

Updated 4 days ago

genai-bench

sgl-project

🧡61

Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.

287

MIT

Python

Updated 4 days ago

GenAI-Bench

TIGER-AI-Lab

🧡60

Code and Data for "GenAI Arena: An Open Evaluation Platform for Generative Models" [NeurIPS 2024]

MIT

Python

Updated 3 weeks ago

diffusionevaluationgenerative-ai

GenAIMediaGenerationChallengeBenchmark

facebookresearch

❤️35

GenAI Media Generation Challenge Benchmark

NOASSERTION

Updated 1 year ago

genai-bench

hiamitabha

❤️40

Code to benchmark APIs available from LLM vendors and demostrate how they work

Apache-2.0

Python

Updated 1 year ago

benchmarkinggenerative-aillm-inference

AIGC-VQA

kangruobing

❤️20

VQualA 2025 GenAI-Bench AIGC Video Quality Assessment Challenge

Python

Updated 5 months ago

genai-marketing-benchmarks

seanbetts

❤️20

Benchmark tests to evaluate LLM's marketing knowledge, understanding and capabilities

Python

Updated 1 year ago

GenAI-Benchmark-Infrastructure

speglich

❤️35

This project provides a complete Terraform infrastructure setup for benchmarking Generative AI models, specifically designed for Oracle Cloud Infrastructure (OCI). It automates the deployment of compute instances with pre-configured benchmarking tools and includes performance comparison capabilities between different AI platforms.

Python

Updated 4 months ago

Benchmarking-GenAI-Models-GANs-VAEs-and-Diffusion-Models

Lokeshkanna7

❤️40

This project aims to provide hands-on experience with three major classes of generative models — Generative Adversarial Networks , Variational Autoencoders , and Diffusion Models. Students will implement simplified versions of each model, train them on image datasets and compare their generated samples, training dynamics, and evaluation metrics.

MIT

Jupyter Notebook

Updated 5 months ago

BASIC-genai-benchmark

enterprisebot-community

❤️30

No description available

MIT

Python

Updated 1 year ago

genai-bench

Testing-AI-Security-Dashboard-Org

❤️35

No description available

MIT

Python

Updated 3 weeks ago

ov-genai-llm-bench-batch

ravi9

❤️40

Run OpenVINO GenAI LLM_BENCH in a batch

MIT

Python

Updated 6 months ago

genai-benchmark

nearai

❤️40

No description available

MIT

Rust

Updated 1 month ago

BASIC-genai-benchmark

Testing-AI-Security-Dashboard-Org

🧡50

No description available

MIT

Python

Updated 3 weeks ago

genai-perf-bench

pdtgct

❤️40

Yet another Generative AI Performance dataset generation and benchmarking toolset.

MIT

Python

Updated 10 months ago

genai-energy-bench

guytonde

❤️45

Energy profiling and benchmarking suite for inference optimizations across LLMs and diffusion models.

Updated 1 month ago

pptak-genai-bench

ptakpiotr

❤️35

Simple TUI for running benchmarks (tasks) for locally-run AI models using Ollama

Updated 3 months ago

GENAI-API-BENCHMARKER

HaoZhang615

❤️30

No description available

MIT

Python

Updated 7 months ago

genai_benchmark_tool

jelyoussefi

❤️25

No description available

Python

Updated 1 year ago

genai-agentic-benchmark

Nvillaluenga

❤️40

A few benchmarks on different agentic architectural approachs

Apache-2.0

Python

Updated 6 months ago

genai-bench-rs

key4ng

🧡60

A Rust reimplementation of genai-bench for benchmarking LLM serving systems at high concurrency with accurate timing and industry-standard metrics.

MIT

Rust

Updated 1 week ago

onxruntime_genai_tps_benchmark

wanheo09

❤️40

No description available

MIT

Python

Updated 2 months ago

GenAI_Benchmarking_Models

SumitKochar

❤️35

GenAI_Benchmarking_Models

Updated 7 months ago

genAI_tools_benchmark

GazzoA

🧡60

Benchmarking generative AI tools for literature retrieval and summarization in genomic variant interpretation

MIT

Python

Updated 3 weeks ago

multi-model-genai-benchmark

Collinsbrefo123

🧡50

This repository contains a multi-model generative AI evaluation project comparing lightweight, open-source LLMs under identical inference conditions. It focuses on analyzing instruction-following behavior, response quality, and model trade-offs for practical GenAI system design.

MIT

Jupyter Notebook

Updated 2 months ago

genai-recipe-audit-benchmark

RazumAI-ch

❤️40

GxP benchmark for ALCOA+ deviation detection and audit validation

NOASSERTION

Python

Updated 7 months ago

genai-recipe-audit-benchmark

igorrazumny

❤️35

A structured benchmark for evaluating Generative AI models (e.g., OpenAI GPT-4o, Claude, Gemini) on their ability to identify quality deviations in healthcare manufacturing recipes. Focuses on GxP-relevant issues, model comparison, and long-term reproducibility.

NOASSERTION

Python

Updated 7 months ago

latam-genai-lakehouse-benchmark

ramirez-ai-labs

🧡50

Lakehouse-native evaluation framework for measuring regional Spanish LLM performance (SV vs PE) using Delta tables, Spark, and Databricks. Demonstrates Bronze/Silver/Gold architecture and production-ready GenAI evaluation patterns.

MIT

Jupyter Notebook

Updated 1 month ago

GenAI-Motion-Benchmarks

Saivinay24

🧡55

Physics-grounded evaluation harness for auditing Generative Video (Optical Flow metrics).

Python

Updated 1 week ago

genai-energy-accuracy-benchmark

Chenik00Anas

🧡50

Measuring the cost of accuracy in generative AI models — TER I3S Lab, Université Côte d'Azur

MIT

Python

Updated 1 month ago

GitHub Explorer

Search Results

genai-compliance-bench

genai-bench

GenAI-Bench

GenAIMediaGenerationChallengeBenchmark

genai-bench

AIGC-VQA

genai-marketing-benchmarks

GenAI-Benchmark-Infrastructure

Benchmarking-GenAI-Models-GANs-VAEs-and-Diffusion-Models

BASIC-genai-benchmark

genai-bench

ov-genai-llm-bench-batch

genai-benchmark

BASIC-genai-benchmark

genai-perf-bench

genai-energy-bench

pptak-genai-bench

GENAI-API-BENCHMARKER

genai_benchmark_tool

genai-agentic-benchmark

genai-bench-rs

onxruntime_genai_tps_benchmark

GenAI_Benchmarking_Models

genAI_tools_benchmark

multi-model-genai-benchmark

genai-recipe-audit-benchmark

genai-recipe-audit-benchmark

latam-genai-lakehouse-benchmark

GenAI-Motion-Benchmarks

genai-energy-accuracy-benchmark

genai-compliance-bench

genai-bench

GenAI-Bench

GenAIMediaGenerationChallengeBenchmark

genai-bench

AIGC-VQA

genai-marketing-benchmarks

GenAI-Benchmark-Infrastructure

Benchmarking-GenAI-Models-GANs-VAEs-and-Diffusion-Models

BASIC-genai-benchmark

genai-bench

ov-genai-llm-bench-batch

genai-benchmark

BASIC-genai-benchmark

genai-perf-bench

genai-energy-bench

pptak-genai-bench

GENAI-API-BENCHMARKER

genai_benchmark_tool

genai-agentic-benchmark

genai-bench-rs

onxruntime_genai_tps_benchmark

GenAI_Benchmarking_Models

genAI_tools_benchmark

multi-model-genai-benchmark

genai-recipe-audit-benchmark

genai-recipe-audit-benchmark

latam-genai-lakehouse-benchmark

GenAI-Motion-Benchmarks

genai-energy-accuracy-benchmark