Search Results

Found 10,619 repositories(showing 30)

annotated_deep_learning_paper_implementations

labmlai

💚100

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

66.2k

6.7k

MIT

Python

Updated 51 minutes ago

attentiondeep-learningdeep-learning-tutorial+10

GPT-SoVITS

RVC-Boss

💚100

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

56.4k

6.2k

MIT

Python

Updated 7 minutes ago

text-to-speechttsvits+3

pytorch-image-models

huggingface

💚100

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

36.6k

5.1k

Apache-2.0

Python

Updated 51 minutes ago

augmixconvnextdistributed-training+17

Retrieval-based-Voice-Conversion-WebUI

RVC-Project

💚100

Easily train a good VC model with voice data <= 10 mins!

35.1k

5.0k

MIT

Python

Updated 1 hour ago

audio-analysischangeconversational-ai+13

fish-speech

fishaudio

💚95

SOTA Open Source TTS

29.1k

2.5k

NOASSERTION

Python

Updated 4 minutes ago

llamatransformertts+4

so-vits-svc

svc-develop-team

💚100

SoftVC VITS Singing Voice Conversion

28.0k

5.1k

AGPL-3.0

Python

Updated 2 hours ago

aiaudio-analysisdeep-learning+14

LaTeX-OCR

lukas-blecher

💚98

pix2tex: Using a ViT to convert images of equations into LaTeX code.

16.3k

1.3k

MIT

Python

Updated 1 hour ago

datasetdeep-learningim2latex+14

sherpa-onnx

k2-fsa

💛89

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, RK NPU, Axera NPU, Ascend NPU, x86_64 servers, websocket server/client, support 12 programming languages

11.4k

1.3k

Apache-2.0

C++

Updated 12 minutes ago

aarch64androidarm32+17

Amphion

open-mmlab

💛88

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

9.7k

805

MIT

Python

Updated 2 hours ago

audio-generationaudio-synthesisaudioldm+14

so-vits-svc-fork

voicepaw

💚92

so-vits-svc fork with realtime support, improved interface and more features.

9.3k

1.2k

NOASSERTION

Python

Updated 9 hours ago

contentvecdeep-learninggan+12

Bert-VITS2

fishaudio

💛86

vits2 backbone with multilingual-bert

8.7k

1.3k

AGPL-3.0

Python

Updated 2 hours ago

agentbertbert-vits+8

vits

jaywalnut310

💚92

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

7.8k

1.4k

MIT

Python

Updated 2 hours ago

deep-learningpytorchspeech-synthesis+2

VITS-fast-fine-tuning

Plachtaa

💛77

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

5.0k

735

Apache-2.0

Python

Updated 2 days ago

so-vits-svc

innnky

💛74

基于vits与softvc的歌声音色转换模型

3.8k

AGPL-3.0

Python

Updated 2 days ago

Applio

IAHispano

💛73

A simple, high-quality voice conversion tool focused on ease of use and performance.

3.1k

512

MIT

Python

Updated 53 minutes ago

aiappliopytorch+11

whisper-vits-svc

PlayVoice

💛77

Core Engine of Singing Voice Conversion & Singing Voice Clone

2.9k

915

MIT

Python

Updated 3 days ago

changediff-svcdiffusion+7

MoeGoe

CjangCjengh

💛75

Executable file for VITS inference

2.4k

246

MIT

Python

Updated 4 days ago

Genie-TTS

High-Logic

🧡67

GPT-SoVITS ONNX Inference Engine & Model Converter

1.5k

101

MIT

Python

Updated 11 hours ago

gpt-sovitstext-to-speechtts+3

lightly-train

lightly-ai

🧡67

All-in-one training for vision models (YOLO, ViTs, RT-DETR, DINOv3): pretraining, fine-tuning, distillation.

1.4k

AGPL-3.0

Python

Updated 12 hours ago

computer-visioncontrastive-learningdeep-learning+17

ChatWaifu_Mobile

Voine

💛73

移动版二次元 AI 老婆聊天器

1.4k

148

MIT

C++

Updated 3 hours ago

androidchatgptcompose+4

emotional-vits

innnky

💛73

无需情感标注的情感可控语音合成模型，基于VITS

1.4k

169

MIT

Jupyter Notebook

Updated 3 days ago

vits_chinese

PlayVoice

🧡68

Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!

1.2k

178

MIT

Python

Updated 1 day ago

aishell3bertbert-vits+4

T2T-ViT

yitu-opensource

🧡63

ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet

1.2k

176

NOASSERTION

Jupyter Notebook

Updated 1 week ago

t2t-transformervision-transformervit

DragonianVoice

PriesiaMioShirakana

💛72

多个SVC/TTS的C++推理库

1.1k

135

AGPL-3.0

Updated 4 days ago

bertvits2diffsingerdiffsvc+9

U-ViT

baofff

💛72

A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".

1.1k

MIT

Jupyter Notebook

Updated 2 hours ago

RepViT

THU-MIG

💛72

RepViT: Revisiting Mobile CNN From ViT Perspective [CVPR 2024] and RepViT-SAM: Towards Real-Time Segmenting Anything

1.1k

Apache-2.0

Jupyter Notebook

Updated 6 days ago

vits-simple-api

Artrajz

💛72

A simple VITS HTTP API, developed by extending Moegoe with additional features.

1.0k

134

AGPL-3.0

Python

Updated 2 days ago

bert-vits2gpt-sovitsmoegoe+3

MoeTTS

luoyily

🧡62

Speech synthesis model /inference GUI repo for galgame characters based on Tacotron2, Hifigan, VITS and Diff-svc

996

GPL-3.0

Updated 2 weeks ago

Awesome-Token-Compress

daixiangzi

🧡66

A paper list of some recent works about Token Compress for Vit and VLM

874

Updated 19 hours ago

PyTorch-Pretrained-ViT

lukemelas

🧡57

Vision Transformer (ViT) in PyTorch

853

127

Python

Updated 2 weeks ago

GitHub Explorer

Search Results

annotated_deep_learning_paper_implementations

GPT-SoVITS

pytorch-image-models

Retrieval-based-Voice-Conversion-WebUI

fish-speech

so-vits-svc

LaTeX-OCR

sherpa-onnx

Amphion

so-vits-svc-fork

Bert-VITS2

vits

VITS-fast-fine-tuning

so-vits-svc

Applio

whisper-vits-svc

MoeGoe

Genie-TTS

lightly-train

ChatWaifu_Mobile

emotional-vits

vits_chinese

T2T-ViT

DragonianVoice

U-ViT

RepViT

vits-simple-api

MoeTTS

Awesome-Token-Compress

PyTorch-Pretrained-ViT

annotated_deep_learning_paper_implementations

GPT-SoVITS

pytorch-image-models

Retrieval-based-Voice-Conversion-WebUI

fish-speech

so-vits-svc

LaTeX-OCR

sherpa-onnx

Amphion

so-vits-svc-fork

Bert-VITS2

vits

VITS-fast-fine-tuning

so-vits-svc

Applio

whisper-vits-svc

MoeGoe

Genie-TTS

lightly-train

ChatWaifu_Mobile

emotional-vits

vits_chinese

T2T-ViT

DragonianVoice

U-ViT

RepViT

vits-simple-api

MoeTTS

Awesome-Token-Compress

PyTorch-Pretrained-ViT