Search Results

Found 10,601 repositories(showing 30)

labelImg

HumanSignal

💚100

LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source data labeling tool for images, text, hypertext, audio, video and time-series data.

24.9k

6.6k

MIT

Python

Updated 8 hours ago

annotationsdeep-learningdetection+6

clone-voice

jianchang512

💛84

A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具，使用你的音色或任意声音来录制音频

8.9k

982

NOASSERTION

Python

Updated 14 hours ago

clonevoicespeech-analysissts+2

ruby_llm

crmne

💛73

One beautiful Ruby API for OpenAI, Anthropic, Gemini, Bedrock, Azure, OpenRouter, DeepSeek, Ollama, VertexAI, Perplexity, Mistral, xAI, GPUStack & OpenAI compatible APIs. Agents, Chat, Vision, Audio, PDF, Images, Embeddings, Tools, Streaming & Rails integration.

3.8k

417

MIT

Ruby

Updated 5 hours ago

agentsaianthropic+17

stable-audio-tools

Stability-AI

💛73

Generative models for conditional audio generation

3.7k

440

MIT

Python

Updated 16 hours ago

AsrTools

WEIFENG2333

💛76

3.2k

297

GPL-3.0

Python

Updated 1 day ago

aeneas

readbeyond

💛76

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

2.8k

272

AGPL-3.0

Python

Updated 6 hours ago

alignmentaudiocli+17

LiveCaptions-Translator

SakiRinn

💛75

Lightweight and powerful real-time audio/speech translation tool based on Windows LiveCaptions.

2.7k

189

Apache-2.0

Updated 52 minutes ago

apiapi-integrationaudio-to-text+6

AudioMass

pkalogiros

💛70

Free full-featured web-based audio & waveform editing tool

2.3k

278

JavaScript

Updated 4 hours ago

arduino-audio-tools

pschatzmann

💛76

Arduino Audio Tools (a powerful Audio library not only for Arduino)

2.2k

347

GPL-3.0

Updated 21 hours ago

arduinoarduino-libraryaudio+11

spatial-media

google

💛72

Specifications and tools for 360º video and spatial audio.

2.1k

465

NOASSERTION

Python

Updated 1 day ago

sonobus

sonosaurus

🧡69

Source code for SonoBus, a real-time network audio streaming collaboration tool.

2.0k

164

GPL-3.0

C++

Updated 1 day ago

aaxaudioaudiounit+10

vokoscreenNG

vkohaupt

🧡63

vokoscreenNG is a powerful screencast creator in many languages to record the screen, an area or a window (Linux only). Recording of audio from multiple sources is supported. With the built-in camera support, you can make your video more personal. Other tools such as systray, magnifying glass, countdown, timer, Showclick and Halo support will help

1.4k

110

GPL-2.0

C++

Updated 1 hour ago

capturelinuxopensource+7

fansly-downloader

Avnsx

💛72

Easy to use fansly.com content downloading tool. Written in python, but ships as a standalone Executable App for Windows too. Enjoy your Fansly content offline anytime, anywhere in the highest possible content resolution! Fully customizable to download in bulk or single: photos, videos & audio from timeline, messages, collection & specific posts 👍

1.4k

GPL-3.0

Python

Updated 3 hours ago

cross-platformdatabasedatascraping+17

oTranscribe

💛73

A free & open tool for transcribing audio interviews

1.2k

213

MIT

JavaScript

Updated 3 days ago

ai-game-devtools

Yuan-ManX

💛72

Here we will keep track of the latest AI Game Development Tools, including LLM, World Model, Agent, Code, Image, Texture, Shader, 3D Model, Animation, Video, Audio, Music, Singing Voice and Analytics. 🔥

1.1k

110

MIT

Updated 5 hours ago

ai-platformai-toolkitaigc+8

audino

midas-research

🧡68

Open source audio annotation tool for humans

1.1k

142

NOASSERTION

TypeScript

Updated 5 days ago

annotation-toolaudio-annotationaudio-processing+4

sample-generator

Harmonai-org

🧡63

Tools to train a generative model on arbitrary audio samples

1.1k

174

MIT

Jupyter Notebook

Updated 3 weeks ago

ai-audio-datasets

Yuan-ManX

💛72

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.

924

MIT

Updated 15 hours ago

aigcartificial-intelligenceaudio+6

chatgpt-cli

kardolus

🧡67

ChatGPT CLI is a powerful, multi-provider command-line interface for working with modern LLMs. It supports OpenAI, Azure, Perplexity, LLaMA, and more, with features like streaming, interactive chat, prompt files, image/audio I/O, MCP tool calls, and an experimental agent mode for safe, multi-step automation.

911

MIT

Updated 10 hours ago

agentagentic-aiazure+10

audio_shop

robertfoss

🧡51

Your friendly neighbourhood script for mangling images or video using audio editing tools

875

GPL-2.0

Shell

Updated 1 month ago

auditok

amsehili

💛72

An audio/acoustic activity detection and audio segmentation tool

844

100

MIT

Python

Updated 19 hours ago

audio-activitiesaudio-dataaudio-segmentation+3

TTS-Audio-Suite

diodiogod

🧡67

A ComfyUI custom node integration for local multi-engine multi-language Text-to-Speech and Voice Conversion. Supports: RVC, Echo-TTS, Qwen3-TTS, Cozy Voice 3, Step Audio EditX, IndexTTS-2, Chatterbox (classic and multilingual), F5-TTS, Higgs Audio 2 and VibeVoice with unlimited text length, SRT timing, Character support, and many audio tools

832

NOASSERTION

Python

Updated 4 hours ago

ai-audioaudioaudio-editing+17

APT

rnchg

💛72

AI Productivity Tool - Free and open source, improve user productivity, and protect privacy and data security. Including but not limited to: built-in local exclusive ChatGPT, DeepSeek, Phi, Qwen and other models, one-click batch intelligent processing of pictures, videos, audio, etc.

774

MIT

Updated 2 days ago

aiai-frameworkaigc+15

flymd

flyhunterl

🧡66

高性能Markdown笔记工具！免费AI，智能便签、TODO推送、本地知识库、AI小说引擎。PDF解析、自动语音笔记、录音转文本。毫秒级启动High-performance Markdown note tool! Free AI, smart notes, TODO reminders, local knowledge base, AI novel engine. PDF parsing, auto voice notes, audio-to-text. Millisecond startup.

767

NOASSERTION

JavaScript

Updated 3 hours ago

wpair-app

zalexdev

🧡67

WPair is a defensive security research tool that demonstrates the CVE-2025-36911 (eg WhisperPair) vulnerability in Google's Fast Pair protocol. This vulnerability affects millions of Bluetooth audio devices worldwide, allowing unauthorized pairing and potential microphone access without user consent.

754

Apache-2.0

Kotlin

Updated 12 hours ago

audiolib.js

jussi-kalliokoski

❤️36

audiolib.js is a powerful audio tools library for javascript.

672

JavaScript

Updated 3 months ago

mp4ff

Eyevinn

🧡67

Library and tools for working with MP4 files containing video, audio, subtitles, or metadata. The focus is on fragmented files. Includes mp4ff-info, mp4ff-encrypt, mp4ff-decrypt and other tools.

625

116

MIT

Updated 5 days ago

aacac-3adts+17

Perth

resemble-ai

💛71

Open Audio Watermarking Tool

488

MIT

Python

Updated 15 hours ago

spchcat

petewarden

🧡61

Speech recognition tool to convert audio to text transcripts, for Linux and Raspberry Pi.

482

MPL-2.0

Updated 3 weeks ago

linuxraspberry-pispeech-recognition

audio-development-tools

Yuan-ManX

💛71

Audio Development Tools (ADT) is a project for advancing sound, speech, and music technologies, featuring components for machine learning, sound synthesis, speech and music generation, signal processing, game audio, digital audio workstations (DAWs), and more.

445

MIT

Updated 15 hours ago

artificial-intelligenceaudioaudio-generation+10

GitHub Explorer

Search Results

labelImg

clone-voice

ruby_llm

stable-audio-tools

AsrTools

aeneas

LiveCaptions-Translator

AudioMass

arduino-audio-tools

spatial-media

sonobus

vokoscreenNG

fansly-downloader

oTranscribe

ai-game-devtools

audino

sample-generator

ai-audio-datasets

chatgpt-cli

audio_shop

auditok

TTS-Audio-Suite

APT

flymd

wpair-app

audiolib.js

mp4ff

Perth

spchcat

audio-development-tools

labelImg

clone-voice

ruby_llm

stable-audio-tools

AsrTools

aeneas

LiveCaptions-Translator

AudioMass

arduino-audio-tools

spatial-media

sonobus

vokoscreenNG

fansly-downloader

oTranscribe

ai-game-devtools

audino

sample-generator

ai-audio-datasets

chatgpt-cli

audio_shop

auditok

TTS-Audio-Suite

APT

flymd

wpair-app

audiolib.js

mp4ff

Perth

spchcat

audio-development-tools