Found 110 repositories(showing 30)
keepmind9
ACP-compatible AI CLI bridge to IM platforms. Connect Claude Code, Gemini, OpenCode, and other ACP-enabled tools to Discord, Telegram, Feishu, DingTalk, Weixin, QQ. Control desktop AI assistants from your phone with streaming responses, whitelist auth, and proxy - no public IP required.
g-hano
A versatile tool that leverages Google's LLM Gemini, along with HuggingFace models, to generate text and images based on user prompts. It utilizes Langchain for text generation and Hugging Face models for image generation. The project consists of a Streamlit GUI interface where users can interact with the generated content.
This is a repository for the LinkedIn Learning course Build an Image Captioning Tool for Visually Impaired Users with Gemini
millerjl1980
Open WebUI toolkit leveraging Gemini 2.0 Flash Experimental for fast and efficient image and text generation.
MrGamesKingPro
OCR images exported by VideoSubFinder using Gemini OCR then export the result as srt file.
제미나이 2.5 Flash 프리뷰 이미지 생성 모델을 이용한 이미지 편집 도구입니다
ajf1016
Just a try with Whisper model(Speech to text) and Gemini LLM..First im converting audio file to raw text..after that give that text into gemini llm along with some prompts for summarizing
Harperbot
No description available
santiagosamuel3455
Imagen descripcion prompt system
Joycai
It is a webui tool runing locally to edit your image by accessing gemini image API. You need API key of the GoogleAIStudio
kokodev0726
No description available
Gemini Prompt To Image adalah aplikasi web sederhana yang dibangun dengan HTML, CSS, dan JavaScript murni. Aplikasi ini memungkinkan pengguna untuk mengunggah gambar dan memasukkan instruksi kreatif untuk membuat gambar baru melalui API Gemini.
Raghavan1988
Flask web application built with Gemini and Spire to talk to images in PDF
Akshitha0118
A Streamlit-based AI app that uses Google Gemini to generate intelligent descriptions and insights from images with optional user prompts.
No description available
Underdevelopment
Gemini Pro, your do-it-all AI tool, translates languages, sparks creativity, and answers questions, all while efficiently running on devices from phones to data centers, making it accessible for developers and businesses to unlock AI's potential.
This project demonstrates an advanced application integrating Large Language Models (LLMs) and large image models using Gemini Pro for generative AI. Features include text generation, image synthesis, and a user-friendly interface. Technologies used: Python, TensorFlow/PyTorch, Django/Flask, Docker, and AWS.
By Weng Fei Fung (Weng). Prompt-engineered the vibe-coding workflow, creating a text-to-image AI app. Improved code generation quality through context-truncation mitigation techniques and to control for biases that break large codebases in the Gemini 3 Flash Preview model.
No description available
SuericZhe
No description available
xuelyes
连接Geimin CLI与飞书
DoctorC0de
A lightweight Node.js background service that connects your local gemini-cli agent to Feishu (Lark). This allows you to chat with and control your local Gemini agent directly through Feishu messages.
R-Apricity
No description available
junAD89
No description available
DivyaGuddeti
Model that describes the given image in text format.
JuN-front
No description available
sparkzzt
No description available
Oleksii-Poltavets
No description available
TTD Gemini Image Toolkit: A powerful command-line interface (CLI) for creative image workflows. Leverage Google Gemini AI to generate stunning images from text prompts, add custom text overlays, and perform generative refinement or editing on existing images. Streamline your content creation with AI-powered image manipulation and generation.