Found 59 repositories(showing 30)
Alaboudi1
الذكاء الاصطناعي التوليدي باللغة العربية
jryebread
Like system requirements lab but for LLMs
cpldcpu
Various LLM Benchmarks
morsoli
大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标
Datastory-CN
No description available
BIMSBbioinfo
No description available
SmartyX-GmbH
No description available
kmkarakaya
No description available
red1108
No description available
anirudhs1010
https://studio--llm-judge-4sh1r.us-central1.hosted.app/
EduardoNicacio
Small/Medium LLM Benchmarks using LM Studio
LuckyJH2024
No description available
jimenaRL
No description available
BIMSBbioinfo
No description available
yannicklevederpvtl
No description available
EdisAganovic
No description available
adv-11
No description available
sumitdeyonline
No description available
NicolasDeBufala
Outil de benchmark pour évaluation automatique de pipeline d'automatisation de tâche par des LLM. Ce répertoire est associé à un article de recherche envoyé pour PFIA.
tcs211
No description available
baborok
No description available
Mikop22
This project develops a backend system to benchmark the ability of Large Language Models (LLMs) to verify the accuracy of product specifications by leveraging web search capabilities through the Perplexity AI API. The system evaluates how well LLMs, when augmented with access to external information. (see video)
ArifTansel
No description available
wubbyweb
No description available
okelot
LLM benchmark for Canadian case law
MeisterSeSe
No description available
Bai-Yu-Lan
No description available
Mark-koptev
No description available
iamthemovie
Benchmarking LLM inference performance on different hardware
LLM benchmark for Canadian case law