Search Results

Found 1,147 repositories(showing 30)

promptfoo

💚95

Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration. Used by OpenAI and Anthropic.

19.4k

1.7k

MIT

TypeScript

Updated just now

cici-cdcicd+15

Weave

liaotxcn

🧡60

A highly efficient, secure, and stable application development platform with excellent performance, easy scalability, and deep integration of AI capabilities such as LLM, AI Chat, RAG, and Agents.高效、安全、稳定的服务研发平台，具备良好性能，同时易扩展，深度集成LLM、AIChat、RAG、Agent等AI能力

255

Updated 20 hours ago

agentaichatdocker+8

purecpp

pureai-ecosystem

🧡60

This repository contains a Retrieval-Augmented Generation (RAG) framework developed in C++ for high performance and scalability, with CUDA support for computational acceleration.

116

MIT

C++

Updated 3 weeks ago

aws-agentic-document-assistant

aws-samples

🧡55

An agent based LLM assistant that extends RAG with batch entity extraction and SQL querying to improve performance on multi-step and analytical questions.

MIT-0

Jupyter Notebook

Updated 1 week ago

awsbedrockcdk+3

rag-with-rl

FareedKhan-dev

🧡50

Maximizing the Performance of a Simple RAG using RL

MIT

Jupyter Notebook

Updated 2 months ago

llmopenaipython+3

RocketRAG

TheLion-ai

❤️40

RocketRAG is a high-performance Retrieval-Augmented Generation (RAG) system designed with a focus on speed, simplicity, and extensibility. Built on top of state-of-the-art libraries, it provides both CLI and web server capabilities for seamless integration into any workflow.

NOASSERTION

Python

Updated 1 month ago

MaxKB4j

taishan666

🧡65

MaxKB4j is an open-source LLMOps platform for LLM workflow applications and RAG developed based on the Java language. The project mainly draws on MaxKB, Dify and FastGPT, and combines the advantages of the two into one project. It is redesigned and developed using the high-performance, high-stability and secure reliable JAVA language.

GPL-3.0

Java

Updated 1 day ago

agentaichatgpt+2

DualStack-Agent

Yuliu11

🧡65

Agentic-RAG Framework: A high-performance knowledge retrieval system featuring Hybrid Search (FAISS + BM25), RRF Re-ranking, and MySQL/Redis persistent storage.

Python

Updated 5 days ago

promptfoo-action

promptfoo

🧡60

The GitHub Action for Promptfoo. Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.

MIT

TypeScript

Updated 7 hours ago

actionsllmprompt-engineering+2

Korean-Embedding-Model-Performance-Benchmark-for-Retriever

ssisOneTeam

❤️40

Korean Sentence Embedding Model Performance Benchmark for RAG

Jupyter Notebook

Updated 1 month ago

hyperparameter-tuningkorean-sentence-embeddinglangchain-python+3

DynaRAG

Predixus

❤️45

A simple, robust, RAG engine that focuses on speed and performance

BSD-3-Clause

Updated 1 week ago

Superior-RAG-for-Complex-PDFs-using-LlamaParse

Ashufet

💛70

I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA on it with high performance.

Apache-2.0

Jupyter Notebook

Updated 4 days ago

diRAGnosis

AstraBert

❤️40

Diagnose the performance of your RAG🩺

MIT

Python

Updated 8 months ago

dockerevaluation-frameworkfastapi+9

MIRAGE

nlpai-lab

🧡60

MIRAGE is a light benchmark to evaluate RAG performance.

Apache-2.0

Python

Updated 3 weeks ago

Aquiles-RAG

Aquiles-ai

🧡50

Is a high-performance Augmented Recovery-Generation (RAG) solution based on Redis, Qdrant or PostgreSQL. It offers a high-level interface using FastAPI REST APIs

Apache-2.0

Python

Updated 2 months ago

api-restembeddingsfastapi+12

rust-local-rag

ksaritek

🧡60

🦀 High-performance local RAG server in Rust that integrates with Claude Desktop via MCP. Search PDF documents privately using Ollama embeddings - no external API calls.

Rust

Updated 6 days ago

doc-embedder

a-agmon

❤️45

A high-performance RAG indexing pipeline implemented in Rust using LanceDB and Candle

Rust

Updated 1 month ago

RAG-Performance

SciPhi-AI

🧡55

Measuring RAG solutions throughput and latency

MIT

Python

Updated 2 weeks ago

benchmarkingbenchmarkshaystack+6

vector-graph-rag

zilliztech

🧡65

Graph RAG with pure vector search, achieving SOTA performance in multi-hop reasoning scenarios.

MIT

Python

Updated 7 hours ago

graph-raggraphragknowledge-base+3

RAG_nfcorpus

keeeevinShen

❤️40

a project aim to improve the RAG performance on BelR/nfcorpus dataset

MIT

Jupyter Notebook

Updated 5 months ago

FRED-Data-Analysis

hoangsonww

🧡60

💸 A comprehensive AI-powered data explorer that combines FRED economic data & insights with vector search, regression analysis, and interactive RAG chatbot via Pinecone Vector DB, OpenAI, Claude, and Gemini. Built with TypeScript, React, and Express for seamless full-stack performance.

MIT

TypeScript

Updated 3 days ago

anthropicchartjsclaude-ai+17

HR-Toolkit

Islam-Reda-13

❤️40

A high-performance RAG system built to manage and retrieve knowledge from 4,000+ HR documents.

Apache-2.0

Python

Updated 3 months ago

guidance-for-high-speed-rag-chatbots-on-aws

aws-solutions-library-samples

❤️35

This Guidance demonstrates how to develop a high-performance Retrieval-Augmented Generation (RAG) chatbot by combining Amazon Aurora PostgreSQL-Compatible Edition with the open-source pgvector extension.

MIT-0

Python

Updated 4 months ago

RAG-Framework-Evaluation

oztrkoguz

❤️35

This project aims to compare different Retrieval-Augmented Generation (RAG) frameworks in terms of speed and performance.

Apache-2.0

Python

Updated 8 months ago

autogenautogen-ragcrewai+9

vectrain

philpher0x

❤️45

Vectrain is a high-performance, modular, plug-and-play RAG pipeline that ingests data, generates vector embeddings, and stores them in vector databases for semantic search, recommendations, and analytics.

MIT

Updated 1 month ago

aiembedding-vectorsembeddings+12

falcon-evaluate

Praveengovianalytics

❤️35

Falcon Evaluate is an open-source Python library aims to revolutionise the LLM - RAG evaluation process by offering a low-code solution. Our goal is to make the evaluation process as seamless and efficient as possible, allowing you to focus on what truly matters.This library aims to provide an easy-to-use toolkit for assessing the performance, bias

MIT

Python

Updated 1 year ago

evaluation-metricsgenerative-ailarge-language-models+3

AIE7-Demo-Day-Project

neomatrix369

❤️25

RagCheck is a proactive corpus quality assessment tool that analyses RAG application document collections before deployment, identifying content gaps and providing specific recommendations to improve query performance. The platform transforms reactive corpus fixes into proactive quality assurance, helping organisations achieve as high as 85% score.

TypeScript

Updated 1 month ago

qualityragtool

symrank

analyticsinmotion

💛70

🐍📦 High-performance cosine similarity ranking for Retrieval-Augmented Generation (RAG) pipelines.

Apache-2.0

Python

Updated 1 day ago

cosine-similarityfoundational-modelslarge-language-models+6

local_rag_pipeline

dronefreak

💛70

An advanced, fully local, and GPU-accelerated RAG pipeline. Features a sophisticated LLM-based preprocessing engine, state-of-the-art Parent Document Retriever with RAG Fusion, and a modular, Hydra-configurable architecture. Built with LangChain, Ollama, and ChromaDB for 100% private, high-performance document Q&A.

MIT

Python

Updated 5 days ago

chromadbdocument-qadocument-query-search+17

ai00-mem

Ai00-X

🧡50

A high-performance memory system for AI applications with advanced RAG capabilities, vector-graph hybrid storage, and adaptive learning. Built with Rust for scalability and efficiency.

MIT

Rust

Updated 2 months ago

GitHub Explorer

Search Results

promptfoo

Weave

purecpp

aws-agentic-document-assistant

rag-with-rl

RocketRAG

MaxKB4j

DualStack-Agent

promptfoo-action

Korean-Embedding-Model-Performance-Benchmark-for-Retriever

DynaRAG

Superior-RAG-for-Complex-PDFs-using-LlamaParse

diRAGnosis

MIRAGE

Aquiles-RAG

rust-local-rag

doc-embedder

RAG-Performance

vector-graph-rag

RAG_nfcorpus

FRED-Data-Analysis

HR-Toolkit

guidance-for-high-speed-rag-chatbots-on-aws

RAG-Framework-Evaluation

vectrain

falcon-evaluate

AIE7-Demo-Day-Project

symrank

local_rag_pipeline

ai00-mem

promptfoo

Weave

purecpp

aws-agentic-document-assistant

rag-with-rl

RocketRAG

MaxKB4j

DualStack-Agent

promptfoo-action

Korean-Embedding-Model-Performance-Benchmark-for-Retriever

DynaRAG

Superior-RAG-for-Complex-PDFs-using-LlamaParse

diRAGnosis

MIRAGE

Aquiles-RAG

rust-local-rag

doc-embedder

RAG-Performance

vector-graph-rag

RAG_nfcorpus

FRED-Data-Analysis

HR-Toolkit

guidance-for-high-speed-rag-chatbots-on-aws

RAG-Framework-Evaluation

vectrain

falcon-evaluate

AIE7-Demo-Day-Project

symrank

local_rag_pipeline

ai00-mem