Search Results

Found 13,065 repositories(showing 30)

ultimatevocalremovergui

Anjok07

💚95

GUI for a Vocal Remover that uses Deep Neural Networks.

24.3k

1.8k

MIT

Python

Updated 20 minutes ago

audioinstrumentalkaraoke+9

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.

6.6k

712

GPL-3.0

Python

Updated 3 hours ago

audiobookfaster-whispergradio+16

stemroller

stemrollerapp

💛75

Isolate vocals, drums, bass, and other instrumental stems from any song

3.1k

151

NOASSERTION

Svelte

Updated 17 hours ago

audio-processingbassdeep-learning+8

vocal-separate

jianchang512

🧡69

an extremely simple tool for separating vocals and background music, completely localized for web operation, using 2stems/4stems/5stems models 这是一个极简的人声和背景音乐分离工具，本地化网页操作，无需连接外网

1.9k

220

GPL-3.0

Python

Updated 16 minutes ago

music-separationspleetervocal-separation+1

omnizart

Music-and-Culture-Technology-Lab

🧡68

Omniscient Mozart, being able to transcribe everything in the music, including vocal, drum, chord, beat, instruments, and more.

1.9k

129

MIT

Python

Updated 1 day ago

beat-trackingchorddrum-transcription+3

vocal-remover

tsurumeso

🧡64

Vocal Remover using Deep Neural Networks

1.7k

254

MIT

Python

Updated 1 week ago

audiodeep-learningpytorch+3

AI-Song-Cover-RVC

ardha27

💛73

All in One Version : Youtube WAV Download, Separating Vocal, Splitting Audio, Training, and Inference Using Google Colab

1.2k

159

MIT

Jupyter Notebook

Updated 21 hours ago

airvcsong-cover+1

python-audio-separator

nomadkaraoke

💛73

Easy to use stem (e.g. instrumental/vocals) separation from CLI or as a python package, using a variety of amazing pre-trained models (primarily from UVR)

1.1k

183

MIT

Python

Updated 1 hour ago

YARG

YARC-Official

🧡64

YARG (a.k.a. Yet Another Rhythm Game) is a free, open-source, plastic guitar game that is still in development. It supports guitar (five fret), drums (plastic or e-kit), vocals, pro-keys, and more!

1.0k

329

LGPL-3.0

Updated 1 day ago

clone-heroguitar-herorockband

UVR5-UI

Eddycrack864

💛71

Ultimate Vocal Remover 5 with Gradio UI. Separate an audio file into various stems, using multiple models

626

MIT

Python

Updated 20 hours ago

audiogradio-python-appinstrumental+4

spleeter-web

JeffreyCA

🧡66

Self-hostable web app for isolating the vocal, accompaniment, bass, and drums of any song. Supports Spleeter, Demucs, BS-RoFormer. Built with React and Django.

537

MIT

Python

Updated 2 days ago

bs-roformerdemucsseparation+3

audio-separation-nodes-comfyui

christian-byrne

💛71

Separate stems (vocals, bass, drums, other) from audio. Recombine, tempo match, slice/crop audio

509

MIT

Python

Updated 19 minutes ago

comfyuicomfyui-nodes

UltraSinger

rakuri255

🧡66

AI based tool to convert vocals lyrics and pitch from music to autogenerate Ultrastar Deluxe, Midi and notes. It automatic tapping, adding text, pitch vocals and creates karaoke files.

491

MIT

Python

Updated 1 day ago

aiaudiokaraoke+8

DALI

gabolsgabs

💛71

DALI: a large Dataset of synchronised Audio, LyrIcs and vocal notes.

379

NOASSERTION

Python

Updated 5 days ago

datasetdeep-learningismir+3

voice-gender

primaryobjects

🧡66

Gender recognition by voice and speech analysis

362

102

Updated 3 days ago

acoustic-propertiesaiartificial-intelligence+10

Mel-Band-Roformer-Vocal-Model

KimberleyJensen

🧡51

No description available

356

Python

Updated 1 day ago

vocal

VocalPodcastProject

❤️41

A powerful, beautiful, and simple podcast client for the modern free desktop.

339

GPL-3.0

Vala

Updated 1 month ago

Vocaluxe

🧡56

Vocaluxe is an open source singing game inspired by SingStar™ and Ultrastar Deluxe.

333

GPL-3.0

Updated 3 days ago

c-sharpfreegame+4

autotone

alexcrist

💛70

A vocal pitch correction web application (like Autotune)

325

MIT

Updated 5 hours ago

creacttensorflowjs+3

Vocalis

Lex-au

🧡66

Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. Features low-latency audio streaming, dynamic visual feedback, and works with local LLM/TTS services via OpenAI-compatible endpoints.

295

Apache-2.0

TypeScript

Updated 5 days ago

artificial-intelligenceconversational-aispeech-to-speech+1

vocalizer

atifazam

🧡51

A simple javascript plugin to show people how to say your name correctly.

290

MIT

HTML

Updated 2 months ago

accessibilityaudiojavascript-plugin+1

ColorSplitter

KakaruHayate

🧡55

A cli tool for split vocal timbre.

281

MIT

Python

Updated 1 week ago

AI-Song-Cover-SOVITS

ardha27

🧡66

All in One Version : Youtube WAV Download, Separating Vocal, Splitting Audio, Training, and Inference Using Google Colab

278

Jupyter Notebook

Updated 2 days ago

aisong-coverssovits+1

Speech_Signal_Processing_and_Classification

gionanide

💛71

Front-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a pre-requisite step toward any pattern recognition problem employing speech or audio (e.g., music). Here, we are interesting in voice disorder classification. That is, to develop two-class classifiers, which can discriminate between utterances of a subject suffering from say vocal fold paralysis and utterances of a healthy subject.The mathematical modeling of the speech production system in humans suggests that an all-pole system function is justified [1-3]. As a consequence, linear prediction coefficients (LPCs) constitute a first choice for modeling the magnitute of the short-term spectrum of speech. LPC-derived cepstral coefficients are guaranteed to discriminate between the system (e.g., vocal tract) contribution and that of the excitation. Taking into account the characteristics of the human ear, the mel-frequency cepstral coefficients (MFCCs) emerged as descriptive features of the speech spectral envelope. Similarly to MFCCs, the perceptual linear prediction coefficients (PLPs) could also be derived. The aforementioned sort of speaking tradi- tional features will be tested against agnostic-features extracted by convolu- tive neural networks (CNNs) (e.g., auto-encoders) [4]. The pattern recognition step will be based on Gaussian Mixture Model based classifiers,K-nearest neighbor classifiers, Bayes classifiers, as well as Deep Neural Networks. The Massachussets Eye and Ear Infirmary Dataset (MEEI-Dataset) [5] will be exploited. At the application level, a library for feature extraction and classification in Python will be developed. Credible publicly available resources will be 1used toward achieving our goal, such as KALDI. Comparisons will be made against [6-8].

256

MIT

Python

Updated 1 day ago

classifierfeature-extractiongaussian-mixture-models+16

vocalinux

jatinkrmalik

🧡61

Free, open-source, 100% offline voice dictation for Linux. Speak and type anywhere via whisper.cpp, Whisper & VOSK engines, GPU-accelerated, works on X11 + Wayland!

253

GPL-3.0

Python

Updated 12 hours ago

accessibilitydictationgpu-acceleration+12

VocalKit

KingOfBrian

❤️36

Objective-C shim layer for Speech Recognition

248

Updated 5 months ago

citizenlab

CitizenLabDotCo

🧡51

Go Vocal is a digital democracy platform that facilitates community participation and co-creation. Participants can post ideas, contribute to discussions, or choose to vote and prioritize community projects.

230

NOASSERTION

TypeScript

Updated 1 day ago

citizen-participationcitizenlabcivic-tech+9

Voice-based-gender-recognition

SuperKogito

🧡66

:sound: :boy: :girl:Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)

221

MIT

Python

Updated 3 days ago

data-sciencegaussian-mixture-modelsgender+15

VocalShaper

Do-sth-sharp

💛70

A JUCE-based Open Source DAW

209

GPL-3.0

C++

Updated 3 days ago

araara-hostara-plugin+14

Vocalization-Sign-Language-iOS

ardamavi

🧡60

Vocalization sign language iOS App with deep learning using CoreML.

204

Apache-2.0

Swift

Updated 1 week ago

artificial-intelligencecoremldeep-learning+3

GitHub Explorer

Search Results

ultimatevocalremovergui

voice-pro

stemroller

vocal-separate

omnizart

vocal-remover

AI-Song-Cover-RVC

python-audio-separator

YARG

UVR5-UI

spleeter-web

audio-separation-nodes-comfyui

UltraSinger

DALI

voice-gender

Mel-Band-Roformer-Vocal-Model

vocal

Vocaluxe

autotone

Vocalis

vocalizer

ColorSplitter

AI-Song-Cover-SOVITS

Speech_Signal_Processing_and_Classification

vocalinux

VocalKit

citizenlab

Voice-based-gender-recognition

VocalShaper

Vocalization-Sign-Language-iOS

ultimatevocalremovergui

voice-pro

stemroller

vocal-separate

omnizart

vocal-remover

AI-Song-Cover-RVC

python-audio-separator

YARG

UVR5-UI

spleeter-web

audio-separation-nodes-comfyui

UltraSinger

DALI

voice-gender

Mel-Band-Roformer-Vocal-Model

vocal

Vocaluxe

autotone

Vocalis

vocalizer

ColorSplitter

AI-Song-Cover-SOVITS

Speech_Signal_Processing_and_Classification

vocalinux

VocalKit

citizenlab

Voice-based-gender-recognition

VocalShaper

Vocalization-Sign-Language-iOS