Found 201 repositories(showing 30)
CLUEbenchmark
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
CLUEbenchmark
SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese
FireRedTeam
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics recognition capability.
ChineseGLUE
Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard
CBLUEbenchmark
[CBLUE1] 中文医疗信息处理基准CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark
This repository contains datasets and baselines for benchmarking Chinese text recognition.
IAAR-Shanghai
CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models
kakaobrain
A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset
thu-coai
The official repository of the paper: COLD: A Benchmark for Chinese Offensive Language Detection
X-PLUG
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks
csust7zhangjm
CSUST Chinese Traffic Sign Detection Benchmark
FreedomIntelligence
CMB, A Comprehensive Medical Benchmark in Chinese
hkgsas
Benchmark Dataset of Limit Order Book in China Markets
IntelligentSystemsLab
UrbanEV is an open benchmark dataset for electric vehicle (EV) charging demand in Shenzhen, China.
THUIR
T2Ranking: A large-scale Chinese benchmark for passage ranking.
SoulmateB
CurveLanes is a new benchmark lane detection dataset with 150K lanes images for difficult scenarios such as curves and multi-lanes in traffic lane detection. It is collected in real urban and highway scenarios in multiple cities in China.
CLUEbenchmark
Python toolkit for Chinese Language Understanding(CLUE) Evaluation benchmark
DUT-lujunyu
The code and resource of "Facilitating Fine-grained Detection of Chinese Toxic Language: Hierarchical Taxonomy, Resources, and Benchmark" (ACL2023).
mjq11302010044
Real-CE: A Benchmark for Chinese-English Scene Text Image Super-resolution (ICCV2023)
LAMDA-NeSy
ChinaTravel: A Real-World Benchmark for Language Agents in Chinese Travel Planning
lizhaoliu-Lec
Instance Segmentation for Chinese Character Stroke Extraction, Datasets and Benchmarks.
Dai-shen
LAiW: A Chinese Legal Large Language Models Benchmark
williamliujl
A Chinese National Medical Licensing Examination dataset and large languge model benchmarks
CLUEbenchmark
中文通用大模型开放域多轮测评基准 | An Open Domain Benchmark for Foundation Models in Chinese
OpenMOSS
[ACL'2024 Findings] GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation
meituan-longcat
Omni Model Benchmark with high quality and diversity, which reveals the Compositional Law. We’re now focused on Chinese scenarios — and actively seeking partners to co-build English & multilingual versions! Let’s expand global impact together.
AI45Lab
Flames is a highly adversarial benchmark in Chinese for LLM's harmlessness evaluation developed by Shanghai AI Lab and Fudan NLP Group.
jizijing
A Benchmark for Classical Chinese Based on a Crowdsourcing System.
Ethan-yt
古文语言理解测评基准 Classical Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
zjunlp
ChineseHarm-Bench: A Chinese Harmful Content Detection Benchmark