Found 252 repositories(showing 30)
facebookresearch
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
facebookresearch
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
jishengpeng
[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling
modelscope
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
youngsheen
[ICCV 2025] SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer
jishengpeng
A Survey of Spoken Dialogue Models (60 pages)
avidale
The tiniest sentence encoder for Russian language
PABannier
Port of Meta's Encodec in C/C++
ZhikangNiu
unofficial implementation of the High Fidelity Neural Audio Compression
habla-liaa
Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'
seastar105
Implementation of TTS model based on NVIDIA P-Flow TTS Paper
Mikxox
No description available
MiscellaneousStuff
(R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.
DillionLowry
Neural Audio Codecs implemented in C# - DAC, SNAC, Encodec, Dia
voidful
No description available
lucadellalib
A collections of audio codecs with a standardized API
DBraun
JAX Implementations of Descript Audio Codec and EnCodec
BreakingY
Nvidia video hard decoding, rendering, soft/hard encoding, and writing to MP4 file ; Nvidia视频硬解码、渲染、软/硬编码并写入MP4文件
thomas-xin
A lightweight wrapper around https://github.com/facebookresearch/encodec that enables dynamic streamed reading, seeking, metadata and GPU support.
dadaxian
通过哈夫曼树编解码原理编写解码器,实现文件的压缩与解压缩
deepanwadhwa
An experimental nanogpt fork that learns to speak Shakespeare by modeling EnCodec audio tokens.
youlanhai
文件编码转换器
Supremolink81
A TTS app where you can clone the voices of any person you wish.
Compressed using encodec librilight datasets
audiojourney
Audio-Journey: Visual+LLM-aided Audio Encodec Diffusion
stillhere
GBK/UTF-8编码转换工具 by Yong
MiscellaneousStuff
Combines VALL-E, AV-HuBERT and Encodec methods to synthesis speech from lip movements
phoenixZZZ
No description available
imbforge
No description available
lifeiteng
No description available