Search Results

Found 40 repositories(showing 30)

clip-gpt-captioning

jmisilo

❤️40

CLIPxGPT Captioner is Image Captioning Model based on OpenAI's CLIP and GPT-2.

118

MIT

Python

Updated 7 months ago

computer-visioncvdeep-learning+7

Image caption generation using a hybrid CLIP-GPT2 architecture. CLIP encodes the image while GPT-2 decodes into natural language captions. Modular and configurable pipelines for training, inference, and evaluation on datasets like COCO.

Jupyter Notebook

Updated 2 weeks ago

bert-modelscaptioning-imagesclip-model+10

Image-captioning-with-Vision-Language-Model

phongviet

❤️35

An image captioning system that combines CLIP with GPT-2.

Jupyter Notebook

Updated 4 months ago

Simple-Image-Captioning-with-CLIP-and-GPT

MjdMahasneh

❤️25

No description available

Python

Updated 1 year ago

image-captioning-with-CLIP-and-GPT-2

heisenberg1804

🧡65

End-to-end image captioning system using CLIP ViT-B/32 for visual encoding and GPT-2 with LoRA fine-tuning for caption generation. Trained on COCO Captions (Karpathy split) with a learned mapping network bridging vision and language embedding spaces.

Jupyter Notebook

Updated 19 hours ago

Image-Captioning-Bot

Noob-Coder2

❤️45

This project is an AI-powered image captioning bot that generates descriptive captions for images using the Conceptual Captions(Shortened Version) and CLIP-GPT architecture.

Python

Updated 2 months ago

figuring-out-figures

kliu128

❤️35

Multimodal image + text captioning for 416k figures from arXiv. Uses CLIP + SciBERT + GPT-2 in an encoder-decoder architecture. CS224N final project.

Jupyter Notebook

Updated 1 year ago

Image-Captioning-Model

saksham-ops

❤️35

Image Captioning with CLIP and GPT-2 — Multimodal deep learning model integrating CLIP vision encoder and GPT-2 text decoder for image-to-text generation. Trained on 30K+ Flickr images, achieving BLEU-4 of 5.28% and CIDEr of 37.76%, outperforming CNN+LSTM baseline by 12%.

Jupyter Notebook

Updated 5 months ago

AI-Image-Caption-Recommender-Python-FastAPI-Streamlit-PyTorch-Hugging-Face

ruchirhuchgol-del

❤️35

Developed an end-to-end AI system that generates relevant image captions based on user-provided keywords. Integrated CLIP for image-text similarity and GPT-2 for creative text generation to produce and rank captions.

Python

Updated 5 months ago

image_captioning_gpt2

lucasmbll

❤️45

Pretrained a GPT‑2 (124M) on FineWeb‑Edu (~10B tokens) and fine‑tuned it for COCO image captioning using a frozen CLIP ViT‑B/32 encoder. Explores gated middle cross‑attention, BLIP‑2 Q‑Former prefixes, and lightweight linear prefixes.

Python

Updated 1 month ago

AgriCLIP-Adapting-CLIP-for-Agriculture-and-Livestock-via-Domain-Specialized-Cross-Model-Alignment

vyshnavi-111

❤️35

AgriCLIP is a CLIP-based vision–language model for agriculture and livestock. Trained on ALive (600k image–text pairs) with GPT-4 captions and fine-grained DINO features, it achieves 48% zero-shot accuracy, outperforming CLIP in crop, livestock, and fish classification tasks.

Updated 5 months ago

Image-Captioning-System-CLIP-GPT-2-

udaykumar1307

❤️35

Image Captioning System (CLIP + GPT-2)

Python

Updated 4 months ago

Image-Captioning

Vignesh010101

❤️40

Image Captioning Using CLIP & GPT Models

MIT

Python

Updated 1 year ago

caption-datacaption-generationcaptioning-images+9

gpt2-clip-captioning

shrnik

❤️25

No description available

Python

Updated 11 months ago

ImageCaptioning-GPT2-CLIP

koushik-mahamkali

❤️35

ImageCaptioning requiring low specs and increasing optimization

Python

Updated 1 year ago

clip-gpt2-image-captioning

jatinpsingh

🧡55

Image captioning pipeline using CLIP vision encoder and GPT-2 decoder

Python

Updated 2 weeks ago

image-captioning-clip-gpt

manugaurdl

❤️30

Pytorch implementation of Clip-Cap paper.

Python

Updated 1 year ago

VideoCaptionerWithClip

lachlanchen

❤️45

Video & image captioning with OpenAI CLIP embeddings + GPT decoder

Python

Updated 1 month ago

image-captioning-with-clip-gpt2

baichuanzhou

❤️25

No description available

Python

Updated 2 years ago

image-captioning-via-GPT2-and-CLIP

Chantoone

❤️25

No description available

Jupyter Notebook

Updated 3 months ago

3D-Object-Captioning-BLIP2-CLIP-GPT

yunusskeete

❤️35

Automated Scalable 3D Captioning with Pretrained Models (Based on Cap3D)

Python

Updated 1 year ago

CLIP-GPT-2-Captioning-System-with-Image-Similarity-Scoring

Harshitat23

❤️25

No description available

Jupyter Notebook

Updated 9 months ago

Image_Captioning

sauravsoni6377

❤️35

An image captioning system that combines CLIP for image feature extraction and GPT-2 for generating descriptive captions.

Python

Updated 12 months ago

ZenClips-AIClipper

Syntax1on

❤️25

AI tool: Upload video → Auto highlights via Whisper + GPT → Clip preview w/ captions 🎬

Unlicense

Updated 8 months ago

Image-Caption-Generator-

rafat-74

❤️35

Image Caption Generator using CLIP + GPT-2 Developed a model that generates image captions using CLIP and GPT-2 Implemented using VS Code, tested on trained datasets, with automatic English-to-Arabic translation

Python

Updated 11 months ago

Image-Caption-Model

ho-edwardd

❤️35

Developed a Transformer Mapper architecture to link a pretrained CLIP image model and pretrained GPT-2 language model for robust image captioning.

Python

Updated 4 months ago

AI-Instagram-Caption-Generator

Anandupy

🧡50

AI-based Instagram Caption Generator using DeepFace, CLIP, and GPT-2 with Emotion Detection and NLP.

MIT

Jupyter Notebook

Updated 1 month ago

gpt2-vision-language

theophile-lt

❤️40

From-scratch GPT-2 trained on Fineweb_edu (10B tokens), extended to image captioning with frozen CLIP features and lightweight multimodal bridges.

Python

Updated 2 months ago

Clip-Prefix-Captioning

usha1310

❤️35

Built an AI-powered image captioning tool by integrating CLIP and GPT-2 using prefix mapping, with real-time demos via Gradio and Streamlit.

Python

Updated 10 months ago

AI-Caption-Hashtag-Generator

jaypatelp001

❤️35

AI Caption & Hashtag Generator is a **Streamlit web app** that generates creative captions and trending hashtags for your images using **CLIP** and **GPT-based models

Python

Updated 1 year ago

GitHub Explorer

Search Results

clip-gpt-captioning

CV-image-captioning-clip-gpt2

Image-captioning-with-Vision-Language-Model

Simple-Image-Captioning-with-CLIP-and-GPT

image-captioning-with-CLIP-and-GPT-2

Image-Captioning-Bot

figuring-out-figures

Image-Captioning-Model

AI-Image-Caption-Recommender-Python-FastAPI-Streamlit-PyTorch-Hugging-Face

image_captioning_gpt2

AgriCLIP-Adapting-CLIP-for-Agriculture-and-Livestock-via-Domain-Specialized-Cross-Model-Alignment

Image-Captioning-System-CLIP-GPT-2-

Image-Captioning

gpt2-clip-captioning

ImageCaptioning-GPT2-CLIP

clip-gpt2-image-captioning

image-captioning-clip-gpt

VideoCaptionerWithClip

image-captioning-with-clip-gpt2

image-captioning-via-GPT2-and-CLIP

3D-Object-Captioning-BLIP2-CLIP-GPT

CLIP-GPT-2-Captioning-System-with-Image-Similarity-Scoring

Image_Captioning

ZenClips-AIClipper

Image-Caption-Generator-

Image-Caption-Model

AI-Instagram-Caption-Generator

gpt2-vision-language

Clip-Prefix-Captioning

AI-Caption-Hashtag-Generator

clip-gpt-captioning

CV-image-captioning-clip-gpt2

Image-captioning-with-Vision-Language-Model

Simple-Image-Captioning-with-CLIP-and-GPT

image-captioning-with-CLIP-and-GPT-2

Image-Captioning-Bot

figuring-out-figures

Image-Captioning-Model

AI-Image-Caption-Recommender-Python-FastAPI-Streamlit-PyTorch-Hugging-Face

image_captioning_gpt2

AgriCLIP-Adapting-CLIP-for-Agriculture-and-Livestock-via-Domain-Specialized-Cross-Model-Alignment

Image-Captioning-System-CLIP-GPT-2-

Image-Captioning

gpt2-clip-captioning

ImageCaptioning-GPT2-CLIP

clip-gpt2-image-captioning

image-captioning-clip-gpt

VideoCaptionerWithClip

image-captioning-with-clip-gpt2

image-captioning-via-GPT2-and-CLIP

3D-Object-Captioning-BLIP2-CLIP-GPT

CLIP-GPT-2-Captioning-System-with-Image-Similarity-Scoring

Image_Captioning

ZenClips-AIClipper

Image-Caption-Generator-

Image-Caption-Model

AI-Instagram-Caption-Generator

gpt2-vision-language

Clip-Prefix-Captioning

AI-Caption-Hashtag-Generator