Search Results

Found 210,246 repositories(showing 30)

transformers

huggingface

💚100

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

159.2k

32.8k

Apache-2.0

Python

Updated 19 minutes ago

audiodeep-learningdeepseek+16

yt-dlp

💚100

A feature-rich command-line audio/video downloader

156.1k

12.8k

Unlicense

Python

Updated 2 minutes ago

clidownloaderpython+4

lossless-cut

mifi

💚100

The swiss army knife of lossless video/audio editing

39.7k

1.9k

GPL-2.0

TypeScript

Updated 1 hour ago

codeccuteditor+7

bark

suno-ai

💚100

🔊 Text-Prompted Generative Audio Model

39.1k

4.7k

MIT

Jupyter Notebook

Updated 41 minutes ago

OpenVoice

myshell-ai

💚100

Instant voice cloning by MIT and MyShell. Audio foundation model.

36.2k

4.0k

MIT

Python

Updated 21 minutes ago

text-to-speechttsvoice-clone+1

diffusers

huggingface

💚100

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

33.3k

6.9k

Apache-2.0

Python

Updated 1 hour ago

deep-learningdiffusionflux+12

File Upload widget with multiple file selection, drag&drop support, progress bar, validation and preview images, audio and video for jQuery. Supports cross-domain, chunked and resumable file uploads. Works with any server-side platform (Google App Engine, PHP, Python, Ruby on Rails, Java, etc.) that supports standard HTML form file uploads.

30.8k

7.8k

MIT

PHP

Updated 14 hours ago

Seal

JunkFood02

💚91

🦭 Video/Audio Downloader for Android, based on yt-dlp

25.6k

1.1k

GPL-3.0

Kotlin

Updated 7 minutes ago

androidf-droidjetpack-compose+5

howler.js

goldfire

💚95

Javascript audio library for the modern web.

25.2k

2.3k

MIT

JavaScript

Updated 2 hours ago

audioaudio-libraryhowler+5

labelImg

HumanSignal

💚100

LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source data labeling tool for images, text, hypertext, audio, video and time-series data.

24.9k

6.6k

MIT

Python

Updated 14 hours ago

annotationsdeep-learningdetection+6

audiocraft

facebookresearch

💚95

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

23.2k

2.6k

MIT

Jupyter Notebook

Updated 2 hours ago

BackgroundMusic

kyleneideck

💚93

Background Music, a macOS audio utility: automatically pause your music, set individual apps' volumes and record system audio.

18.8k

755

GPL-2.0

C++

Updated 46 minutes ago

audioaudio-utilitycpp+2

BlackHole

ExistentialAudio

💚93

BlackHole is a modern macOS audio loopback driver that allows applications to pass audio to other applications with zero additional latency.

18.8k

786

GPL-3.0

Updated 1 hour ago

audiodriverloopback+2

buzz

chidiwilliams

💚94

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

18.6k

1.4k

MIT

Python

Updated 36 minutes ago

whisper

mediamtx

bluenviron

💚95

Ready-to-use SRT / WebRTC / RTSP / RTMP / LL-HLS / MPEG-TS / RTP media server and media proxy that allows to read, publish, proxy, record and playback video and audio streams.

18.4k

2.2k

MIT

Updated 37 minutes ago

gogolanghls+15

audacity

💚95

Audio Editor

16.8k

2.5k

NOASSERTION

C++

Updated 36 minutes ago

audiocross-platformeditor+2

Tone.js

Tonejs

💚95

A Web Audio framework for making interactive music in the browser.

14.6k

1.0k

MIT

TypeScript

Updated 3 minutes ago

javascriptmusicsamples+4

SadTalker

OpenTalker

💚99

[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

13.7k

2.6k

NOASSERTION

Python

Updated 2 hours ago

audio-driven-talking-facecvpr2023deep-fake+6

uamp

android

💚98

A sample audio app for Android

13.2k

3.8k

Apache-2.0

Kotlin

Updated 6 hours ago

AudioKit

💚91

Audio synthesis, processing, & analysis platform for iOS, macOS and tvOS

11.3k

1.6k

MIT

Swift

Updated 6 hours ago

audioaudiokitios+7

Captura

MathewSachin

💚91

Capture Screen, Audio, Cursor, Mouse Clicks and Keystrokes

10.6k

2.0k

MIT

Updated 15 hours ago

capturechocolateydotnet+12

wavesurfer.js

katspaugh

💚90

Audio waveform player

10.2k

1.8k

BSD-3-Clause

TypeScript

Updated 1 minute ago

audiojavascriptmusic+3

moshi

kyutai-labs

💛84

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

10.0k

931

Apache-2.0

Python

Updated 12 minutes ago

pydub

jiaaro

💛86

Manipulate audio with a simple and easy high level interface

9.8k

1.1k

MIT

Python

Updated 9 hours ago

Amphion

open-mmlab

💛88

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

9.7k

805

MIT

Python

Updated 8 hours ago

audio-generationaudio-synthesisaudioldm+14

engine-sim

ange-yaghi

💛88

Combustion engine simulator that generates realistic audio.

9.3k

872

MIT

C++

Updated 2 days ago

enginesimulation

ytDownloader

aandrew-me

💛87

Desktop App for downloading Videos and Audios from hundreds of sites

9.1k

781

GPL-3.0

JavaScript

Updated 9 hours ago

appimagecompressordownloader+17

clone-voice

jianchang512

💛84

A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具，使用你的音色或任意声音来录制音频

8.9k

984

NOASSERTION

Python

Updated 1 day ago

clonevoicespeech-analysissts+2

hallo

fudan-generative-vision

💚90

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

8.6k

1.1k

MIT

Python

Updated 17 hours ago

face-animationimage-animationvideo-animation

librosa

💛84

Python library for audio and music analysis

8.3k

1.0k

ISC

Python

Updated 56 minutes ago

audiodsplibrosa+3

GitHub Explorer

Search Results

transformers

yt-dlp

lossless-cut

bark

OpenVoice

diffusers

jQuery-File-Upload

Seal

howler.js

labelImg

audiocraft

BackgroundMusic

BlackHole

buzz

mediamtx

audacity

Tone.js

SadTalker

uamp

AudioKit

Captura

wavesurfer.js

moshi

pydub

Amphion

engine-sim

ytDownloader

clone-voice

hallo

librosa

transformers

yt-dlp

lossless-cut

bark

OpenVoice

diffusers

jQuery-File-Upload

Seal

howler.js

labelImg

audiocraft

BackgroundMusic

BlackHole

buzz

mediamtx

audacity

Tone.js

SadTalker

uamp

AudioKit

Captura

wavesurfer.js

moshi

pydub

Amphion

engine-sim

ytDownloader

clone-voice

hallo

librosa