Search Results

Found 95 repositories(showing 30)

IP-Adapter

tencent-ailab

💛81

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

6.5k

424

Apache-2.0

Jupyter Notebook

Updated 1 day ago

prompt-pretraining

amazon-science

❤️40

Official implementation for the paper "Prompt Pre-Training with Over Twenty-Thousand Classes for Open-Vocabulary Visual Recognition"

259

Apache-2.0

Python

Updated 4 months ago

Text2midi is the first end-to-end model for generating MIDI files from textual descriptions. By leveraging pretrained large language models and a powerful autoregressive transformer decoder, text2midi allows users to create symbolic music that aligns with detailed textual prompts, including musical attributes like chords, tempo, and style.

160

MIT

Python

Updated 2 days ago

aimidimusic

PowerfulPromptFT

ShiZhengyan

❤️35

[NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Learner"

MIT

Python

Updated 4 months ago

continue-learningfine-tuningin-context-learning+12

RWKV-ASR

AGENDD

❤️35

This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the idea of SLAM_ASR and used the RWKV language model as the LLM, and instead of directly writing a prompt template we directly finetuned the initial state of the RWKV model.

Python

Updated 4 months ago

aigc_data

ssbuild

❤️40

share data， prompt data , pretraining data

Apache-2.0

Python

Updated 8 months ago

aigc-datadatainstruct+5

dolores

DNE-Digital

🧡50

Dolores is a Python library designed to improve the developer experience when working with pretrained language models. Dolores provides prompts for interacting with language models that result in interesting or useful outputs.

BSD-3-Clause

Python

Updated 1 month ago

gptgpt-3gpt-3-prompts+3

ProMISe

MedICL-VU

❤️35

[ISBI 2024 Oral] ProMISe: Prompt-driven 3D Medical Image Segmentation Using Pretrained Image Foundation Models

Python

Updated 5 months ago

APG

TOM-tym

❤️40

Official PyTorch implementation of our ICCV2023 paper “When Prompt-based Incremental Learning Does Not Meet Strong Pretraining”

MIT

Python

Updated 1 year ago

UniPrompt

hedongxiao-tju

🧡60

[NeurIPS 2025] One Prompt Fits All: Universal Graph Adaptation for Pretrained Models

MIT

Python

Updated 2 weeks ago

Category-Specific-Prompt

PRIS-CV

❤️30

Code release for "Category-Specific Prompts for Animal Action Recognition with Pretrained Vision-Language Models"

Python

Updated 4 months ago

VLM-Survey-

SufyanDanish

🧡55

A comprehensive survey of Vision–Language Models: Pretrained models, fine-tuning, prompt engineering, adapters, and benchmark datasets

Updated 1 week ago

adaptersaibenchmark+17

causalEval

c-box

❤️35

Code for ACL 2022 long paper: Can Prompt Probe Pretrained Language Models? Understanding the Invisible Risks from a Causal View

Python

Updated 2 years ago

PromptZ

adriaciurana

❤️35

Generate prompts using GA algorithm for a pretrained LLM

MIT

Python

Updated 3 months ago

pretraining_analysis

sangmichaelxie

❤️35

Code for the NeurIPS 2021 paper "Why Do Pretrained Language Models Help in Downstream Tasks? An Analysis of Head and Prompt Tuning"

Python

Updated 2 years ago

MolCAP

wangyu-sd

❤️30

Molecular Chemical reActivity pretraining and prompted-finetuning enhanced molecular representation learning

Python

Updated 1 month ago

Crop-and-Prompt

zzyking

🧡50

Added image cropping for better prompting based on the official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

NOASSERTION

Python

Updated 1 month ago

DiffusionGen

IvanC987

❤️40

A Flask-based GUI for latent diffusion image generation with real-time denoising (DDIM/DDPM), CFG, and advanced user controls (prompt input, img2img, upscaling, etc.). This project integrates various supporting pretrained models (Stable Diffusion VAE, CLIP, Real-ESRGAN, VGG16) into a custom Latent Diffusion Model pipeline for image synthesis.

MIT

Python

Updated 11 months ago

QwenCLIP

Wxy-24

🧡65

This is the implementation of [ISBI26]:QwenCLIP: Boosting Medical Vision-Language Pretraining via LLM Embeddings and Prompt tuning

Python

Updated 23 hours ago

IVP

kaoyuky

❤️35

This is Pytorch implementation for "Rethinking Remote Sensing Pretrained Model: Instance-Aware Visual Prompting for Remote Sensing Scene Classification". If you have any questions, please contact kys220900680@hnu.edu.cn

Python

Updated 1 year ago

SAT-Pretrain

zhaoziheng

🧡55

This is the official repository to conduct knowledge enhancement pretraining in "Large-Vocabulary Segmentation for Medical Images with Text Prompts".

Python

Updated 3 weeks ago

SHIVA-Smart-Helmet-for-Impaired-Vision-Assistance

Yash-911

❤️35

This project aims to utilize the technology of Object Detection combined with voice assistant features for visually impaired people which can help them reach their destination and also help them to read sign boards using Computer Vision. This project proposes to build a prototype that performs real time object detection using deep neural network model, YOLOV3. Further the object, and the class of the object is prompted through speech stimulus to the blind person. Along with this we are augmenting a voice assistant for frequent requirements and utilities such as sending emails, getting information over internet, etc. This work uses a combination of YOLOv3 on pretrained dataset and darknet detection framework to build rapid real time multi object detection for a compact, portable and minimal response time device construction. Several prototypes and models have been made keeping in mind the blind people having different usages such as Object Recognizer for the Blind, Visual Aid for the Blind, Google Lookup, etc. Among these technologies, we want to create a computer vision assisted solution keeping in mind their needs and their movements. Computer Vision based solutions are emerging as one of the most promising options due to their affordability and accessibility. This project proposes a system for visually impaired people. The proposed project aims to create a wearable visual aid for visually impaired people.

Python

Updated 1 year ago

Finetuning-Stable-Diffusion-for-Naruto-Image-Generation

mahdeensam

❤️35

In this assignment, we’re going to finetune a pretrained Stable Diffusion model to create images based on Naruto-themed prompts. We’ll use the "small-stable-diffusion-v0" model and a dataset of Naruto-related captions. By the end, our model should generate awesome Naruto-style images from text prompts.

Jupyter Notebook

Updated 1 year ago

ai-text-to-image

Lohith0204

🧡50

AI Text-to-Image Generator is a deep learning–based application that converts natural language text prompts into high-quality images using pretrained diffusion models. The project demonstrates the practical use of generative AI by leveraging modern text-to-image architectures to transform user descriptions into visually meaningful outputs.

MIT

Python

Updated 2 months ago

Entity-related-Unsupervised-Pretraining-with-Visual-Prompts-for-MABSA

lkh-meredith

❤️30

The code of our paper "Entity-related Unsupervised Pretraining with Visual Prompts for Multimodal Aspect-based Sentiment Analysis"

Python

Updated 2 years ago

omneval

ExpressAI

❤️20

Prompting Evaluation for Pretrained Language Models

Python

Updated 4 years ago

MLPF

3-Flamingo

❤️35

for my paper:A Script Event Prediction Method Based on Multi-Level Joint Pretraining and Prompt Fine-Tuning

Updated 7 months ago

advanced-ProMISe

1maginat0r

❤️35

[ISBI 2024 Oral] ProMISe: Prompt-driven 3D Medical Image Segmentation Using Pretrained Image Foundation Models

Python

Updated 1 year ago

img-caption-img-flickr8k

assignments-sliit

❤️35

Caption Generation using Flickr8k dataset by @jbrownlee and image generation from caption prompt using pretrained models

Jupyter Notebook

Updated 2 years ago

colab-notebookcompleted-projectflickr8k-dataset+6

IT-Prompt

jiaolifengmi

❤️20

Official PyTorch code for "Prompt-based Continual Learning for Extending Pretrained CLIP Models' Knowledge (ACMMM Asia 2024)".

Updated 1 year ago

GitHub Explorer

Search Results

IP-Adapter

prompt-pretraining

Text2midi

PowerfulPromptFT

RWKV-ASR

aigc_data

dolores

ProMISe

APG

UniPrompt

Category-Specific-Prompt

VLM-Survey-

causalEval

PromptZ

pretraining_analysis

MolCAP

Crop-and-Prompt

DiffusionGen

QwenCLIP

IVP

SAT-Pretrain

SHIVA-Smart-Helmet-for-Impaired-Vision-Assistance

Finetuning-Stable-Diffusion-for-Naruto-Image-Generation

ai-text-to-image

Entity-related-Unsupervised-Pretraining-with-Visual-Prompts-for-MABSA

omneval

MLPF

advanced-ProMISe

img-caption-img-flickr8k

IT-Prompt

IP-Adapter

prompt-pretraining

Text2midi

PowerfulPromptFT

RWKV-ASR

aigc_data

dolores

ProMISe

APG

UniPrompt

Category-Specific-Prompt

VLM-Survey-

causalEval

PromptZ

pretraining_analysis

MolCAP

Crop-and-Prompt

DiffusionGen

QwenCLIP

IVP

SAT-Pretrain

SHIVA-Smart-Helmet-for-Impaired-Vision-Assistance

Finetuning-Stable-Diffusion-for-Naruto-Image-Generation

ai-text-to-image

Entity-related-Unsupervised-Pretraining-with-Visual-Prompts-for-MABSA

omneval

MLPF

advanced-ProMISe

img-caption-img-flickr8k

IT-Prompt