Found 351 repositories(showing 30)
deepset-ai
Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.
lancedb
Resource, examples & tutorials for multimodal AI, RAG and agents using vector search and LLMs
dusty-nv
Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector DB, and RAG.
debnsuma
A Practical Course on Embeddings, RAG, Multimodal Models, and Agents with Amazon Nova.
ocean-luna
[ACM MM2025] Official code of " HM-RAG: Hierarchical Multi-Agent Multimodal Retrieval Augmented Generation"
joeseesun
📚 2025 AI Engineering Reading List - 120+ curated AI research papers covering LLMs, Multimodal, RAG, Agents, Diffusion Models and more
framersai
Build autonomous AI agents with adaptive intelligence and emergent behaviors. Multimodal RAG included.
Zhang-986
An intelligent AI agent system that brings the world of Stardew Valley to life through conversational AI, RAG vector databases, and multimodal analysis.
AhmedAl93
Agent-based implementation of RAG, incorporating AI agents into the RAG pipeline to orchestrate its components and perform additional actions beyond simple information retrieval and generation to overcome the limitations of the non-agentic pipeline
Code4Delphi
IA na Prática: LLM, RAG, MCP, Agents, Function Calling, Multimodal, TTS/STT e mais
Мультимодальный RAG на годовых отчетах Сбера
tokisaka23
RxLM-Med: A multimodal clinical AI agent featuring System 2 reasoning, cross-lingual hierarchical RAG (BM25 + FAISS + RRF), deterministic medical calculation engine, and Traffic Light Protocol (TLP) safety alignment — built on Qwen-VL with LoRA fine-tuning, SFT/DPO alignment, and INT4 quantization for real-world lab report interpretation.
Bionic-AI-Solutions
No description available
marceloeatworld
Cloudflare-native AI agent — 13 tools, codemode, 5-layer memory, self-learning, multimodal I/O. Telegram, Discord & WhatsApp bots. Web search, image gen, TTS, document RAG, calendar, task delegation. 100% serverless on Workers + Durable Objects.
Tianyu-yang-anna
No description available
No description available
JGallegoPerez
A multi-agent, multimodal system that can incorporate RAG, memory and planning; and can take input from other traditional ML algorithms (e.g. CNNs).
maxmoundas
Course on LLMs and Prompt Engineering. Covers LLM fundamentals, training, evaluation, prompting techniques, RAG, multimodal capabilities, agents, MCP, and LLM-powered software engineering tools.
Vijaysingh1621
An agentic multimodal RAG assistant with voice queries, PDF/image understanding, web search, and grounded answers.
emedinac
A growing collection of hands-on projects pushing the boundaries of GenAI: covering LLMs, agents, RAG variants, multimodal systems, and general generative models. Built to explore, test, and create at the edge of what’s possible.
Offline-first, multimodal AI agent for Windows 11 + WSL2 — integrates voice, vision, OCR, RAG search, and terminal control into one secure local brain. Powered by Ollama (Qwen 7B), Whisper, Piper TTS, and FAISS, it lets your PC listen, see, read, and act — all privately, offline, and auditable.
Kirushikesh
A simple MultiModal RAG Agent on Text and Tables using LangChain and Docling.
fork123aniket
Multimodal Agentic GenAI Workflow – Seamlessly blends retrieval and generation for intelligent storytelling
genbounty
Damn Vulnerable AI Application - For LLM Red Team Training. LLM testing, RAG testing, Multimodal testing, Agent testing, LLM paload generation
mmqtx
Cardio-X: An Open-Source AI Cardiologist Agent. Multimodal ECG analysis with Grad-CAM explainability, RAG-enhanced reasoning, and LLM-powered diagnostics.
FLotfiGit
A curated collection of experiments and frameworks exploring the intersection of Agentic AI and Generative AI, including reasoning driven LLM agents, retrieval augmented generation (RAG), multimodal synthesis, and autonomous decision making systems.
openbot-chat
OpenBot is an open-source AI agent development platform that combines low-code agility with professional-grade AI engineering capabilities. By integrating visual agent orchestration, multimodal model management, and enterprise-ready RAG pipelines.
hgsanyang
🎵 懂你灵魂的私人数字调音师 | A multimodal AI music agent powered by Hybrid RAG, Neo4j, and Long-term Memory.
This project focuses on developing a real-time, intelligent healthcare assistant powered by Agentic RAG (Retrieval-Augmented Generation) and the LangChain framework. Designed to provide multimodal support—text, image, and document-based inputs—it aims to assist medical personnel and individuals in remote areas.
salilkadam
Advanced RAG system with Graphiti knowledge graph integration, multimodal processing capabilities, and vector database support