Search Results

Found 252 repositories(showing 30)

audiocraft

facebookresearch

💚95

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

23.2k

2.6k

MIT

Jupyter Notebook

Updated 2 hours ago

encodec

facebookresearch

💛72

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

3.9k

356

MIT

Python

Updated 1 day ago

WavTokenizer

jishengpeng

🧡57

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

1.3k

110

MIT

Python

Updated 1 week ago

acousticaudio-representationcodec+9

FunCodec

modelscope

🧡66

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

443

MIT

Python

Updated 6 days ago

audio-generationaudio-quantizationcodec+5

SimVQ

youngsheen

🧡55

[ICCV 2025] SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer

323

MIT

Python

Updated 1 week ago

audioencodecimage+2

WavChat

jishengpeng

🧡55

A Survey of Spoken Dialogue Models (60 pages)

317

Updated 1 week ago

duplexencodecgpt-4o+11

encodechka

avidale

🧡60

The tiniest sentence encoder for Russian language

246

MIT

Python

Updated 2 weeks ago

natural-language-processingnlppython+2

encodec.cpp

PABannier

🧡50

Port of Meta's Encodec in C/C++

229

C++

Updated 1 week ago

encodec-pytorch

ZhikangNiu

💛70

unofficial implementation of the High Fidelity Neural Audio Compression

176

MIT

Python

Updated 3 days ago

audio-compressionaudio-processingencodec+1

encodecmae

habla-liaa

❤️45

Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'

101

Python

Updated 1 month ago

audiodeep-learningencodec+2

pflow-encodec

seastar105

❤️35

Implementation of TTS model based on NVIDIA P-Flow TTS Paper

Python

Updated 3 months ago

EnCodec_Trainer

Mikxox

❤️35

No description available

Python

Updated 1 month ago

PhoneLM

MiscellaneousStuff

❤️40

(R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.

MIT

Jupyter Notebook

Updated 1 year ago

NeuralCodecs

DillionLowry

❤️40

Neural Audio Codecs implemented in C# - DAC, SNAC, Encodec, Dia

Apache-2.0

Updated 3 months ago

audio-codecneural-audioneural-audio-codec+2

vall-e-encodec

voidful

❤️20

No description available

Python

Updated 1 year ago

audiocodecs

lucadellalib

💛70

A collections of audio codecs with a standardized API

Apache-2.0

Python

Updated 6 days ago

codecdacencodec+10

DAC-JAX

DBraun

🧡65

JAX Implementations of Descript Audio Codec and EnCodec

MIT

Python

Updated 17 hours ago

audioaudio-codecaudio-compression+2

Nvidia-Video-Codec

BreakingY

💛70

Nvidia video hard decoding, rendering, soft/hard encoding, and writing to MP4 file ; Nvidia视频硬解码、渲染、软/硬编码并写入MP4文件

MIT

Updated 5 days ago

codecdecodecencodec+3

Encodec-Stream

thomas-xin

❤️35

A lightweight wrapper around https://github.com/facebookresearch/encodec that enables dynamic streamed reading, seeking, metadata and GPU support.

MIT

Python

Updated 4 months ago

HuffmanCodec

dadaxian

❤️35

通过哈夫曼树编解码原理编写解码器，实现文件的压缩与解压缩

C++

Updated 1 year ago

encodechuffman-coding

nanogpt-Audio

deepanwadhwa

❤️45

An experimental nanogpt fork that learns to speak Shakespeare by modeling EnCodec audio tokens.

MIT

Python

Updated 2 months ago

audioaudio-processinggpt+1

encodeconv

youlanhai

❤️30

文件编码转换器

C++

Updated 1 year ago

TTSCeleb

Supremolink81

🧡50

A TTS app where you can clone the voices of any person you wish.

MIT

Python

Updated 2 months ago

barkbeatsencodec+5

supervoice-libriheavy-encodec

ex3ndr

❤️35

Compressed using encodec librilight datasets

Jupyter Notebook

Updated 6 months ago

audiojourney.github.io

audiojourney

❤️30

Audio-Journey: Visual+LLM-aided Audio Encodec Diffusion

BSD-2-Clause

Jupyter Notebook

Updated 1 year ago

EncodeConvert

stillhere

❤️20

GBK/UTF-8编码转换工具 by Yong

Java

Updated 8 years ago

lip2speech

MiscellaneousStuff

❤️40

Combines VALL-E, AV-HuBERT and Encodec methods to synthesis speech from lip movements

MIT

Updated 2 years ago

G711_EncodecAndDecodec

phoenixZZZ

❤️25

No description available

Updated 6 years ago

encodeChIPqc

imbforge

❤️25

No description available

Updated 3 years ago

encodec

lifeiteng

❤️10

No description available

MIT

Python

Updated 8 months ago

GitHub Explorer

Search Results

audiocraft

encodec

WavTokenizer

FunCodec

SimVQ

WavChat

encodechka

encodec.cpp

encodec-pytorch

encodecmae

pflow-encodec

EnCodec_Trainer

PhoneLM

NeuralCodecs

vall-e-encodec

audiocodecs

DAC-JAX

Nvidia-Video-Codec

Encodec-Stream

HuffmanCodec

nanogpt-Audio

encodeconv

TTSCeleb

supervoice-libriheavy-encodec

audiojourney.github.io

EncodeConvert

lip2speech

G711_EncodecAndDecodec

encodeChIPqc

encodec

audiocraft

encodec

WavTokenizer

FunCodec

SimVQ

WavChat

encodechka

encodec.cpp

encodec-pytorch

encodecmae

pflow-encodec

EnCodec_Trainer

PhoneLM

NeuralCodecs

vall-e-encodec

audiocodecs

DAC-JAX

Nvidia-Video-Codec

Encodec-Stream

HuffmanCodec

nanogpt-Audio

encodeconv

TTSCeleb

supervoice-libriheavy-encodec

audiojourney.github.io

EncodeConvert

lip2speech

G711_EncodecAndDecodec

encodeChIPqc

encodec