Search Results

Found 41 repositories(showing 30)

VoxDIY

pilot7747

❤️40

This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.

NOASSERTION

Python

Updated 10 months ago

crowdsourcingspeech-recognitionspeech-synthesis

DALI-TestSet4ALT

emirdemirel

❤️40

This is a subset of the DALI set consisting of 240 polyphonic recordings that is used to benchmark lyrics transcription evaluation.

CC0-1.0

Jupyter Notebook

Updated 2 years ago

CrowdSpeech

Toloka

❤️20

Benchmark Dataset for Crowdsourced Audio Transcription

NOASSERTION

Python

Updated 6 months ago

Enrichment_Sandbox

MaayanLab

❤️40

Benchmarking enrichment analysis algorithms using transcription factor-gene and drug-gene libraries.

GPL-3.0

Python

Updated 7 years ago

stt-bench

kalpalabs

❤️35

cli utility for benchmarking transcription models on Indic Datasets

MIT

Python

Updated 2 months ago

meeting-transcription-bench

micdarau

💛70

Benchmark transcription APIs against real meeting audio. Measure WER, diarization, latency, and cost.

MIT

Python

Updated 2 days ago

asr-benchmark

ehabmmoaty

🧡55

Benchmark ASR models (Whisper, VibeVoice-ASR, Qwen3-ASR, XEUS, Azure Speech) for Arabic/English transcription — built for Anees AI companion

Python

Updated 3 weeks ago

Whisper-Arabic-Poetry-Performance

Bilel-Eljaamii

🧡50

Benchmarking OpenAI Whisper models (tiny→turbo) for classical Arabic poetry transcription (Amr ibn Kulthum’s Mu'allaqat). Metrics: speed, accuracy, disk usage. Error analysis on diacritics (tashkeel) & archaic vocabulary. Includes Python scripts, dataset (audio samples), and visualizations. #ArabicNLP #ASR #Whisper

MIT

Python

Updated 2 months ago

arabicnlpasropenai+3

transcription-benchmarks

extrange

❤️35

Speech to text model benchmarks

Python

Updated 1 year ago

transcriptionwhisper

State-Transcription-Benchmark

Maryland-State-Innovation-Team

❤️35

A pipeline to construct state-specific audio transcription benchmarks

Python

Updated 5 months ago

Audio-Multimodal-AI-Resources

danielrosehill

❤️35

A compilation of resources (model profiles, benchmarks, docs) for multimodal AI models with audio understanding (esp. focused on ASR and transcription use-cases)

Updated 3 months ago

asraudio-multimodalaudio-text-to-text+3

semax-cognitive-testing

amanmsiddiqui

🧡55

An open-source cognitive benchmarking suite for N=1 biohacking. Features automated AI grading (Gemini 3.0), voice transcription (Whisper), and longitudinal data visualization.

Python

Updated 1 week ago

whisper_testing

jhu-sheridan-libraries

🧡50

A comprehensive testing and benchmarking suite for Whisper speech recognition models, focusing on transcription and diarization performance. This project tests C++ and Python implementations to evaluate Whisper's capabilities across different scenarios.

C++

Updated 4 days ago

React_agent_Gaia

Pandagan-85

❤️40

ReAct-based AI agent using LangGraph for GAIA benchmark evaluation. Handles audio/video transcription, web search, file analysis, and complex reasoning chains. Achieves autonomous execution on real-world assistant tasks with 15+ specialized tools.

MIT

Python

Updated 8 months ago

gaia-apilanggraphreactagent

VR-editor-Positions-Available-

mting4life

❤️35

Fall into a new career at Amphion! Enjoy working with THE BEST in the country (all US-based). NOW HIRING - Home-Based Employee Status Careers: FT/PT Speech Recognition Editors for days, nights and weekends. Multiple positions using our new technology, "Triton," incorporating M*Modal and Benchmark KB. Also recruiting for experienced eScription and iChart MTs. Requires at least two years of acute-care hospital and/or clinic (at least four specialties) medical transcription/medical editing experience in addition to any formal training. Proven ability to move between accounts with urgency and accuracy a must! Up to 90% of workload will involve voice recognition editing. Work schedule you design needs to include one weekend day or night. PC and high-speed cable-modem or DSL required; no dial-up or satellite, please. Guaranteed hourly rate for the 30 days of employment. Pay per line with production bonuses (65 VBC line) and quality incentives (AHDI Book of Style utilized). Evening and weekend pay differentials. Full-time benefits include Health, Life, Dental, Vision, Flexible Spending Account, 401k, Paid Time Off, CHDS/RHDS Credential Maintenance Reimbursement, Referral Bonuses, and Direct Deposit of Paychecks (on time!). Friendly, knowledgeable, technical support staff with daily feedback from an experienced transcription management team. If you're looking for a career, not just another job, get ready to shine bright at Amphion. We have stable, new, large accounts and we are looking for contributors to our success. If you have proven work independence and an excellent quality and production history, contact us today. Amphion is poised for great things in 2014 and we'd love to include you! Become a fan of Amphion on Facebook! Amphion Medical Solutions. . . Where plenty of work gets done -- but fun makes a regular appearance! On-line application and skills assessment available to you 24/7: http://amphionms.mttest.com Equal Opportunity Employer

Updated 2 years ago

transcription-benchmarks

nuhs-projects

❤️15

Speech to text model benchmarks

Python

Updated 10 months ago

CrowdSpeech

k-rks

❤️35

Benchmark Dataset for Crowdsourced Audio Transcription

NOASSERTION

Updated 7 months ago

stt-benchmarking-tool

Erdosity

❤️40

Speech-To-Text (STT)/Transcription Services Benchmarking Tool

Apache-2.0

JavaScript

Updated 7 years ago

ML_ATB2

philip-brohan

❤️35

Solve the auto-transcription benchmark 2 with Machine learning

Python

Updated 5 years ago

agora

omar-elamin

🧡55

AI vendor eval platform — benchmark transcription vendors side-by-side

TypeScript

Updated 1 week ago

DictaBench

DevStrategist

🧡60

Benchmark voice dictation apps — measure transcription latency with precise timing metrics

MIT

Python

Updated 3 weeks ago

benchmarklatencymacos+4

Benchmarking_DL_tools_for_TF_prediction

Laia90

❤️30

Files for the Bachelor thesis "Benchmarking deep learning tools for transcription factor prediction .

Shell

Updated 1 year ago

transcribe-model-benchmark

searchandrescuegg

❤️35

benchmarking small large language models for use with transcribe (transcription of fire dispatch calls)

MIT

Updated 3 months ago

asrbench-cli

ASRBench

❤️40

A command-line tool for the ASRBench framework, simplifying audio transcription system benchmarking with a single config file, supporting popular and custom transcription systems

MIT

Python

Updated 9 months ago

aiasrbenchmark+6

whisper-hardware-test

jensse

🧡55

A Python script to benchmark Whisper transcription performance on varying hardware (CPU vs. GPU) using Norwegian audio files.

Python

Updated 1 week ago

medical-rag-comparison

jv813yh

❤️45

A comprehensive benchmarking project designed to evaluate and compare three different Retrieval-Augmented Generation (RAG) architectures using clinical medical transcription data.

Python

Updated 1 month ago

kaggle-datasetollamaopenai+2

dilbert-strip-transcriber

renganathc

🧡55

A fully reproducible and deterministic benchmark evaluating Vision Language Models on structured dilbert comic strip transcription with strictly defined evaluation rules.

Jupyter Notebook

Updated 3 weeks ago

vosk_large_whisper_small_benchmark

MehediHasan-ds

❤️35

A high-performance, offline real-time speech-to-text system optimized for CPU only. This project benchmarks and compares Vosk and Whisper.cpp models for call-quality speech transcription with minimal latency.

TypeScript

Updated 4 months ago

whisperX-batch

gracee3

🧡55

Small, practical toolkit for audio cleaning and batch transcription using WhisperX, with a simple benchmarking harness for testing ASR configurations and performance.

Python

Updated 3 weeks ago

Uzbek-ASR-Benchmarking

UmrbekAbdullayev

❤️35

This project benchmarks multiple Uzbek speech-to-text (ASR) models on the same audio files using Hugging Face pipelines. It automatically loads each model, runs transcription, and saves the output into a `/results` folder.

Python

Updated 3 months ago

GitHub Explorer

Search Results

VoxDIY

DALI-TestSet4ALT

CrowdSpeech

Enrichment_Sandbox

stt-bench

meeting-transcription-bench

asr-benchmark

Whisper-Arabic-Poetry-Performance

transcription-benchmarks

State-Transcription-Benchmark

Audio-Multimodal-AI-Resources

semax-cognitive-testing

whisper_testing

React_agent_Gaia

VR-editor-Positions-Available-

transcription-benchmarks

CrowdSpeech

stt-benchmarking-tool

ML_ATB2

agora

DictaBench

Benchmarking_DL_tools_for_TF_prediction

transcribe-model-benchmark

asrbench-cli

whisper-hardware-test

medical-rag-comparison

dilbert-strip-transcriber

vosk_large_whisper_small_benchmark

whisperX-batch

Uzbek-ASR-Benchmarking

VoxDIY

DALI-TestSet4ALT

CrowdSpeech

Enrichment_Sandbox

stt-bench

meeting-transcription-bench

asr-benchmark

Whisper-Arabic-Poetry-Performance

transcription-benchmarks

State-Transcription-Benchmark

Audio-Multimodal-AI-Resources

semax-cognitive-testing

whisper_testing

React_agent_Gaia

VR-editor-Positions-Available-

transcription-benchmarks

CrowdSpeech

stt-benchmarking-tool

ML_ATB2

agora

DictaBench

Benchmarking_DL_tools_for_TF_prediction

transcribe-model-benchmark

asrbench-cli

whisper-hardware-test

medical-rag-comparison

dilbert-strip-transcriber

vosk_large_whisper_small_benchmark

whisperX-batch

Uzbek-ASR-Benchmarking