Search Results

Found 3,247 repositories(showing 30)

sketch-code

ashnkumar

💛77

Keras model to generate HTML code from hand-drawn website mockups. Implements an image captioning architecture to drawn source images.

5.2k

681

Python

Updated 16 hours ago

augmentationdeep-learningimage-processing+2

bottom-up-attention

peteanderson80

💛70

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

1.5k

376

MIT

Jupyter Notebook

Updated 1 day ago

caffecaptioning-imagesfaster-rcnn+5

CLIP_prefix_caption

rmokady

💛74

Simple image captioning model

1.4k

222

MIT

Jupyter Notebook

Updated 18 hours ago

CoCa-pytorch

lucidrains

🧡62

Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch

1.2k

MIT

Python

Updated 1 week ago

artificial-intelligenceattention-mechanismcontrastive-learning+4

joycaption

fpgaminer

💛72

JoyCaption is an image captioning Visual Language Model (VLM) being built from the ground up as a free, open, and uncensored model for the community to use in training Diffusion models.

1.1k

Apache-2.0

Jupyter Notebook

Updated 11 hours ago

captioningjoycaptionvlm

Hybrid RAG system combining vector search, knowledge graph (LightRAG), and cross-encoder reranking — with Docling document parsing, visual intelligence (image/table captioning), agentic streaming chat, and inline citations. Powered by Gemini or local Ollama models.

257

Python

Updated 8 hours ago

chromadbcitationdocling+13

ComfyUI-JoyCaption

1038lab

🧡66

Joy Caption is a ComfyUI node using the LLaVA model to generate stylized image captions, supporting batch processing and GGUF models.

253

GPL-3.0

Python

Updated 2 days ago

comfyuiggufjoycaption+2

Up-Down-Captioner

peteanderson80

❤️36

Automatic image captioning model based on Caffe, using features from bottom-up attention.

249

MIT

Jupyter Notebook

Updated 6 months ago

caffecaptioning-imagesimage-captioning+1

RSICD_optimal

201528014227051

🧡56

Datasets for remote sensing images (Paper:Exploring Models and Data for Remote Sensing Image Caption Generation)

229

Updated 3 weeks ago

datasets

qapyq

FennelFetish

🧡60

An image viewer and AI-assisted editing/captioning/masking tool that helps with curating datasets for generative AI models, finetunes and LoRA.

152

AGPL-3.0

Python

Updated 2 weeks ago

aiannotationautomation+12

ComfyUI-MiniCPM

1038lab

🧡50

A custom ComfyUI node for MiniCPM vision-language models, supporting v4, v4.5, and v4 GGUF formats, enabling high-quality image captioning and visual analysis.

148

GPL-3.0

Python

Updated 2 weeks ago

comfyuicustom-nodesgguf+5

image-caption-generator

neural-nuts

❤️36

[DEPRECATED] A Neural Network based generative model for captioning images using Tensorflow

146

BSD-3-Clause

Jupyter Notebook

Updated 4 months ago

artificial-intelligencecaptioning-imagescomputer-vision+8

Deep_Learning_in_Python_2018

snrazavi

❤️46

Deep Learning workshop including image classification, face recognition, Object detection, language modelling, image captioning and neural machine translation.

140

Jupyter Notebook

Updated 1 month ago

deep-learningdeep-neural-networksface-recognition+11

one-network-many-uses

paraschopra

❤️35

Four-in-one deep network: image search, image captioning, similar words and similar images using a single model

136

Jupyter Notebook

Updated 3 months ago

clip-gpt-captioning

jmisilo

❤️40

CLIPxGPT Captioner is Image Captioning Model based on OpenAI's CLIP and GPT-2.

118

MIT

Python

Updated 7 months ago

computer-visioncvdeep-learning+7

DINO-X-MCP

IDEA-Research

🧡65

Official DINO-X Model Context Protocol (MCP) server that empowers LLMs with real-world visual perception through image object detection, localization, and captioning APIs.

117

Apache-2.0

TypeScript

Updated 21 hours ago

image-recognitionmcpmcp-server+2

transformer_image_caption

njchoma

❤️40

Image Captioning based on Bottom-Up and Top-Down Attention model

104

MIT

Jupyter Notebook

Updated 7 months ago

attention-modeldeep-learningimage-captioning+2

Im2txt

HughKu

❤️25

Image captioning ready-to-go inference: show and tell model compatible with Tensorflow r1.9

Python

Updated 7 months ago

im2txtimagecaptioningpython3+1

Chen-Yang-Liu

❤️40

[IEEE GRSL 2024 🔥] RSCaMa: Remote Sensing Image Change Captioning with State Space Model

Python

Updated 1 month ago

change-captioningchange-detectionmamba+1

videoCC-data

google-research-datasets

❤️40

VideoCC is a dataset containing (video-URL, caption) pairs for training video-text machine learning models. It is created using an automatic pipeline starting from the Conceptual Captions Image-Captioning Dataset.

CC-BY-4.0

Updated 1 year ago

LaBERT

bearcatt

❤️40

A length-controllable and non-autoregressive image captioning model.

Python

Updated 2 months ago

controllable-image-captioningeccv2020image-captioning+1

PureT

232525

❤️20

Implementation of 'End-to-End Transformer Based Model for Image Captioning' [AAAI 2022]

Jupyter Notebook

Updated 6 months ago

image_captioning_with_transformers

zarzouram

❤️35

Pytorch implementation of image captioning using transformer-based model.

MIT

Jupyter Notebook

Updated 6 months ago

beam-searchencoder-decoderimage-captioning+6

image-caption-generator

Sajid030

🧡56

Deep learning-based image captioning with Flickr8k dataset. Code includes data prep, model training, and a Streamlit app.

116

MIT

Jupyter Notebook

Updated 1 day ago

cnnimage-caption-generatorimage-processing+6

FuseCap

RotsteinNoam

❤️45

FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions

MIT

Python

Updated 2 months ago

GitHub Explorer

Search Results

sketch-code

bottom-up-attention

CLIP_prefix_caption

CoCa-pytorch

joycaption

NexusRAG

ComfyUI-JoyCaption

Up-Down-Captioner

RSICD_optimal

qapyq

ComfyUI-MiniCPM

image-caption-generator

Deep_Learning_in_Python_2018

one-network-many-uses

clip-gpt-captioning

DINO-X-MCP

transformer_image_caption

Im2txt

biomedica-etl

zerolan-core

Image-Caption-Generator

MAX-Image-Caption-Generator

image_captioning

RSCaMa

videoCC-data

LaBERT

PureT

image_captioning_with_transformers

image-caption-generator

FuseCap

sketch-code

bottom-up-attention

CLIP_prefix_caption

CoCa-pytorch

joycaption

NexusRAG

ComfyUI-JoyCaption

Up-Down-Captioner

RSICD_optimal

qapyq

ComfyUI-MiniCPM

image-caption-generator

Deep_Learning_in_Python_2018

one-network-many-uses

clip-gpt-captioning

DINO-X-MCP

transformer_image_caption

Im2txt

biomedica-etl

zerolan-core

Image-Caption-Generator

MAX-Image-Caption-Generator

image_captioning

RSCaMa

videoCC-data

LaBERT

PureT

image_captioning_with_transformers

image-caption-generator

FuseCap