Found 22,297 repositories(showing 30)
pytube
Lightweight, dependency-free Python library and CLI for downloading YouTube videos, playlists, and captions.
instaloader
Download pictures (or videos) along with their captions and other metadata from Instagram.
mwaterfall
A simple iOS photo and video browser with grid view, captions and selections.
smacke
Automagically synchronize subtitles with video.
jdepoix
This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require an API key nor a headless browser, like other selenium based solutions do!
vladmandic
SD.Next: All-in-one WebUI for AI generative image and video creation, captioning and processing
karpathy
Efficient Image Captioning code in Torch, runs on GPU
webadderall
Create polished screen recordings for free. An open-source screen recorder for Mac/Windows/Linux that adds auto-zooms, animated cursors, auto-captions and more to your videos.
ashnkumar
Keras model to generate HTML code from hand-drawn website mockups. Implements an image captioning architecture to drawn source images.
denizsafak
Generate audiobooks from EPUBs, PDFs and text with synchronized captions.
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
SakiRinn
Lightweight and powerful real-time audio/speech translation tool based on Windows LiveCaptions.
ๅญๅนๆบ็ฟป๏ผ็ฟป่ฏๅญๅนๆไปถ .srt .ass .vtt๏ผๅๅ็ฑปไบงๅ็ธๆฏ๏ผ็น็นๆฏๅฏไปฅ่ชๅทฑๅกซๅ API key๏ผ่ฟๆ ทไปทๆ ผๆไฝใๆๆฐ็ๆฌ 5.3.7 - ๅๅธๆถ้ด 2025 ๅนด 9 ๆ 10 ๅท
stephengpope
The NCA Toolkit API eliminates monthly subscription fees by consolidating common API functionalities into a single FREE API. Designed for businesses, creators, and developers, it streamlines advanced media processing, including video editing and captioning, image transformations, cloud storage, and Python code execution.
krzemienski
A curated list of awesome streaming video tools, frameworks, libraries, and learning resources.
ttengwang
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything
yawiii
ๆ็คบ่ฏๅฐๅฉๆๅฏไปฅไธ้ฎ่ฐ็จๆบ่ฐฑใ็ก ๅบๆตๅจใgeminiใๆฌๅฐollamaใ็พๅบฆ็ญๅคง่ฏญ่จๆจกๅๆๅก๏ผๅฎ็ฐๆ็คบ่ฏ็ฟป่ฏใๆถฆ่ฒๆฉๅใๅพ็ๅๆจใๆฏๆๆ็คบ่ฏ้ข่ฎพๅฎ็ฐไธ้ฎๆๅ ฅใๅๅฒๆ็คบ่ฏๆฅๆพ็ญๅ่ฝใๆฏไธไธชๅ จ่ฝๅๆ็คบ่ฏๆไปถใThe Prompt Assistant enables one-click access to LLMs/VLMs for prompt translation, expansion, and image captioning. It also supports one-click preset insertion and historical prompt search.
abb128
Linux Desktop application that provides live captioning
BMSVieira
Movie focused HTML5 Player
rushindrasinha
Automated YouTube Shorts pipeline: news โ script โ AI visuals โ voiceover โ captions โ upload
jcjohnson
Dense image captioning in Torch
Live Transcribe is an Android application that provides real-time captioning for people who are deaf or hard of hearing. This repository contains the Android client libraries for communicating with Google's Cloud Speech API that are used in Live Transcribe.
NVlabs
[ICCV 2025] Implementation for Describe Anything: Detailed Localized Image and Video Captioning
peteanderson80
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
royshil
OBS plugin for local speech recognition and captioning using AI
rmokady
Simple image captioning model
brh55
:raised_hands: A pure JS react-native component to render a masonry~ish layout for images with support for dynamic columns, progressive image loading, device rotation, on-press handlers, and headers/captions.
gielcobben
Get Caption, start watching.
jhc13
Tag manager and captioner for image datasets
ratwithacompiler
Closed Captioning OBS plugin using Google Speech Recognition