Found 210,246 repositories(showing 30)
huggingface
π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
yt-dlp
A feature-rich command-line audio/video downloader
mifi
The swiss army knife of lossless video/audio editing
suno-ai
π Text-Prompted Generative Audio Model
myshell-ai
Instant voice cloning by MIT and MyShell. Audio foundation model.
huggingface
π€ Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
blueimp
File Upload widget with multiple file selection, drag&drop support, progress bar, validation and preview images, audio and video for jQuery. Supports cross-domain, chunked and resumable file uploads. Works with any server-side platform (Google App Engine, PHP, Python, Ruby on Rails, Java, etc.) that supports standard HTML form file uploads.
JunkFood02
π¦ Video/Audio Downloader for Android, based on yt-dlp
goldfire
Javascript audio library for the modern web.
HumanSignal
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source data labeling tool for images, text, hypertext, audio, video and time-series data.
facebookresearch
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
kyleneideck
Background Music, a macOS audio utility: automatically pause your music, set individual apps' volumes and record system audio.
ExistentialAudio
BlackHole is a modern macOS audio loopback driver that allows applications to pass audio to other applications with zero additional latency.
chidiwilliams
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
bluenviron
Ready-to-use SRT / WebRTC / RTSP / RTMP / LL-HLS / MPEG-TS / RTP media server and media proxy that allows to read, publish, proxy, record and playback video and audio streams.
audacity
Audio Editor
Tonejs
A Web Audio framework for making interactive music in the browser.
OpenTalker
[CVPR 2023] SadTalkerοΌLearning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
android
A sample audio app for Android
AudioKit
Audio synthesis, processing, & analysis platform for iOS, macOS and tvOS
MathewSachin
Capture Screen, Audio, Cursor, Mouse Clicks and Keystrokes
katspaugh
Audio waveform player
kyutai-labs
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
jiaaro
Manipulate audio with a simple and easy high level interface
open-mmlab
Amphion (/Γ¦mΛfaΙͺΙn/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
ange-yaghi
Combustion engine simulator that generates realistic audio.
aandrew-me
Desktop App for downloading Videos and Audios from hundreds of sites
jianchang512
A sound cloning tool with a web interface, using your voice or any sound to record audio / δΈδΈͺεΈ¦webηι’ηε£°ι³ε ιε·₯ε ·οΌδ½Ώη¨δ½ ηι³θ²ζδ»»ζε£°ι³ζ₯ε½εΆι³ι’
fudan-generative-vision
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
librosa
Python library for audio and music analysis