Search Results

Found 175 repositories(showing 30)

chunking_evaluation

brandonstarxel

💛71

This package, developed as part of our research detailed in the Chroma Technical Report, provides tools for text chunking and evaluation. It allows users to compare different chunking methods and includes implementations of several novel chunking strategies.

482

MIT

Python

Updated 18 hours ago

conlleval

sighsmile

❤️46

conlleval in Python (script for chunking/NER evaluation)

125

Python

Updated 2 months ago

RAG-Boilerplate

mburaksayici

🧡65

RAG boilerplate with semantic/propositional chunking, hybrid search (BM25 + dense), LLM reranking, query enhancement agents, CrewAI orchestration, Qdrant vector search, Redis/Mongo sessioning, Celery ingestion pipeline, Gradio UI, and an evaluation suite (Hit-Rate, MRR, hybrid configs).

Python

Updated 1 day ago

ai-agentscrewaihybrid-search+11

ai-code-review

bobmatnyc

🧡50

AI-powered code review CLI with multiple providers (Gemini, Claude, OpenAI). Features 95%+ token reduction via semantic chunking, 7 review types (security/performance/evaluation), multi-language support, interactive fixes, and developer skill assessment.

MIT

TypeScript

Updated 3 weeks ago

rag-chunking-techniques

IBM

❤️45

This repository contains the code for implementation of RAG approach with company policies data, evaluation of RAG solution and smart chunking techniques

Apache-2.0

Jupyter Notebook

Updated 2 months ago

Clustering-Glossary-Terms-Extracted-from-Large-Sized-Software-Requirements-using-FastText

SibaMishra

❤️35

This repository contains the results of automatic glossary terms extraction and their clustering considering two important qualitative attributes, i.e. feature and benefit of the original CrowdRE requirement specifications dataset. In the original CrowdRE dataset, each entry has 6 attributes, i.e., role, feature, benefit, domain, tags and date-time of creation. Since, we are interested in extracting domain-specific terms from this dataset, we only focus on feature and benefit attributes of this dataset. The dataset used in our experiments containing only the feature and benefit attributes of the original CrowdRE dataset can be viewed in the file named "CrowdRE Requirements Dataset.csv". However, the original CrowdRE dataset is devloped by P. K. Murukannaiah et al. and can be accessed from "The smarthome crowd requirements dataset", https://crowdre.github.io/murukannaiah-smarthome-requirements-dataset/, April, 2017. We have computed and reported the ground truth set for a random subset of 100 requirement specifications of the used CrowdRE dataset. In total, we have manually identified a total of 120 ground truth glossary terms with 30 overlapping clusters. The ground truth glossary terms have been calculated from the best intuition of the people (s) involved in this project in an unbiased manner, as there exists no benchmark or gold standard related to the ground truth extraction and clustering for the CrowdRE dataset. The file named "Ground Truth Clusters.docx" shows the ground truth glossary terms along with the manually formulated semantically similar clusters. Note: the clusters are separated with (######) symbol in the file. Further, the manually identified 120 glossary terms in the ground truth set are shown in the third column of the file named as "Extracted Glossary Terms (With and Without WordNet Removal) and Ground Truth Glossary Terms.csv". We have extracted a total of 143 and 292 glossary terms from the CrowdRE dataset with or without removing some words specified in the WordNet lexical database (https://wordnet.princeton.edu/) using a mature text chunking approach. The results are shown in the first and second column of the file named "Extracted Glossary Terms (With and Without WordNet Removal) and Ground Truth Glossary Terms.csv". The extracted glossary terms are trained with the help of a domain specific corpora that is most related to used CrowdRE dataset, i.e. (Wikipedia Home Automation Category for a maximum depth of two, "https://en.wikipedia.org/wiki/Category:Home_automation") and with a pre-trained word vectors UMBC webbase corpus and statmt.org news dataset trained with subwords information in wikipedia 2017 (T. Mikolov, E. Grave, P. Bojanowski, C. Puhrsch, A. Joulin. Advances in Pre-Training Distributed Word Representations) using FastText word embedding vectors (https://fasttext.cc/docs/en/english-vectors.html). The main purpose of the training is to deduce the clusters by forming a the similarity matrix for the extracted glossary terms. For this, we have used two clustering algorithms, viz. K-Means and EM clustering algorithms. The similarity matrix have been developed using the computed semantic similarity scores (cosine similarity) between the word vectors using the word embedding based FastText model. The results in terms of automated formulated clusters for the random subset of 100 requirement specifications of the CrowdRE dataset for which the ground truth glossary terms are calculated are shown in the files named "Automated Ideal (Ground Truth) Clusters.docx" and "Automated Extraction and Clustering.docx" respectively. Note: there exists a maximum of n/2 clusters for n glossary terms. For evaluating the efficacy of the clustering algorithms, we used some commonly used performance evaluation metrics like (precision, recall, f-scores). The evaluation graphs utilizing the area under curve plots (AUC) and evaluating the normalized AUC scores for all the used clustering algorithms are trained on two different datasets and the evaluation results are shown in the two separate files namely, "Cluster Plots.docx" and "Extraction +Clustering Plots.docx" respectively.

Updated 6 months ago

Chunk-Factory

AgBigdataLab

❤️40

Chunk-Factory is a fast, efficient text chunking library with real-time evaluation.

MIT

Python

Updated 9 months ago

ragflow-optimizer

stranger00135

🧡50

Automatically discover the best RAGFlow chunking parameters for each document type. Two-phase optimization with LLM-based evaluation.

MIT

Python

Updated 4 days ago

chunkingllmnlp+4

rag-chunking-evaluation

Leo310

❤️40

Assess the effectiveness of chunking strategies in RAG systems via a custom evaluation framework.

MIT

Jupyter Notebook

Updated 5 months ago

chunkingevaluation-frameworkretrieval+1

AutoChunker

ahmeabd

🧡55

AutoChunker Paper Implementation: Structured Text Chunking and its Evaluation

MIT

Python

Updated 3 weeks ago

rag-pipeline-financial-documents

sjordan1975

🧡55

Systematic evaluation of RAG configurations for financial document search. Tests 66 combinations of chunking, embedding, and retrieval strategies against a 160-page annual report. Winner: sentence-based chunking with no overlap (MRR 0.83)

Python

Updated 1 week ago

Offline-Wikipedia-RAG-System-Using-Local-LLM-and-Sentence-Transformers

shamilop100

❤️45

Lightweight RAG system using MiniLM embeddings and GPT-2/Ollama for Wikipedia question answering. Uses the rag-datasets/rag-mini-wikipedia dataset for training and evaluation, with document chunking, ChromaDB-based semantic search, and context-aware answer generation evaluated via EM and F1 metrics.

Python

Updated 1 month ago

Semantic_Chunking_eval

satish860

❤️35

Evaluation Pipeline for Semantic Chunking

Jupyter Notebook

Updated 1 year ago

performance-benchmarks

DocSlicer

❤️45

RAG chunking benchmark suite - evaluation code for docslicer.ai

Python

Updated 2 months ago

retrieval-grounded-llm

johntmunger

🧡55

Retrieval-grounded LLM architecture using semantic chunking, vector-backed search, citation mapping, and evaluation-driven refinement.

JavaScript

Updated 2 weeks ago

RAG-Evaluation-Framework

dubistdu

❤️45

RAG retrieval evaluation: boundary-aware chunking, synthetic question generation, vector retrieval metrics (Recall@K, MRR), and sanity checks.

Python

Updated 1 month ago

rag-chunk-paper

denys-yu

❤️45

Research codebase for studying chunking strategies in Retrieval-Augmented Generation (RAG), with reproducible experiments, indexing methods, and QA-based evaluation.

Python

Updated 2 months ago

aiazurechunking+1

act-libero

Jackie7ii

🧡60

Standalone implementation of ACT (Action Chunking with Transformers) on LIBERO simulation benchmarks. Supports training, in-training rollout eval, and full evaluation with video saving.

MIT

Python

Updated 2 weeks ago

RAG-for-Legal-Documents

usb1998

❤️45

Retrieval-Augmented Generation pipeline for legal document question answering using MPNet embeddings, cross-encoder reranking, optimized chunking, extractive answering, and detailed ROUGE/BLEU evaluation.

Jupyter Notebook

Updated 2 months ago

document-chunking-guide

dannyblaker

🧡50

Complete guide to document chunking from basics to production. Includes 7 chunking strategies (character, word, sentence, token-based, recursive, semantic), RAG implementation, evaluation metrics, and comprehensive documentation. Perfect for NLP, LLM applications, and RAG systems. Learn with working code examples and best practices.

Apache-2.0

Python

Updated 2 months ago

Document-RAG-QA

moelsaka01

❤️45

End-to-end Retrieval Augmented Generation (RAG) platform with document ingestion, chunking, embedding-based retrieval, FastAPI backend, themed UI, index metadata endpoint, evaluation metrics, and Docker deployment.

Dockerfile

Updated 1 month ago

RAG_Type

vanshksingh

❤️35

Advanced Retrieval-Augmented Generation (RAG) techniques with modular implementations of hierarchical indexing, adaptive retrieval, semantic chunking, and explainable retrieval. Includes evaluation scripts and sample datasets for benchmarking.

Python

Updated 7 months ago

Maternal_Agent

vidhij23

❤️35

This repository provides a comprehensive suite for Agentic framework with Retrieval-Augmented Generation (RAG), document processing, and evaluation, with a focus on maternal health. It includes modular RAG pipelines, document chunking, vector store management, evaluation scripts, and a rich set of Jupyter notebooks for experimentation and analysis.

Jupyter Notebook

Updated 7 months ago

GenAI-AgenticAI

gitanjaligilhotra1-lab

❤️45

Beginner-to-advanced Generative AI knowledge base covering AI/ML fundamentals, LLMs, prompting, embeddings & vector databases, RAG (chunking, retrieval), agents (A2A), MCP, LangChain/LlamaIndex, fine-tuning (LoRA), evaluation, optimization, and enterprise GenAI systems.

Updated 2 months ago

Kalpit-Pvt-Ltd-UK

Ravisir21

❤️35

Prototype and evaluation of a RAG Q&A system using Ambedkar corpus. Built with LangChain, ChromaDB, HuggingFace embeddings, and Ollama LLM. Includes retrieval, semantic, and answer quality metrics with chunking analysis for performance optimization.

Python

Updated 4 months ago

training-data-builder-for-llm-py-version

AI-Solutions-KK

🧡50

Model-agnostic NLP pipeline that converts PDFs into clean, chunked, training-ready datasets for BERT, LoRA, QLoRA, and semantic pair training. Includes OCR fallback, noise cleaning, chunking, multi-format dataset export, and automatic evaluation reports.

Apache-2.0

Jupyter Notebook

Updated 1 month ago

Soulsborne-RAG

FelipeRochaMartins

❤️45

Soulsborne RAG is an end‑to‑end Retrieval‑Augmented Generation system for Soulsborne games, showcasing modern RAG practices (scraping, LLM‑based chunking/refinement, vector search, contextualization, query expansion, reranking, and evaluation) with local/remote models.

MIT

Jupyter Notebook

Updated 1 month ago

embeddingsgradiollm+5

dataset-pipeline-pro

AI-Solutions-KK

🧡50

Apache-2.0

TypeScript

Updated 1 month ago

lvmh-financial-rag

achrafjarrou

🧡55

Système RAG production-ready pour analyse financière LVMH 2023. FastAPI + ChromaDB + Groq LLM. Pipeline complet: chunking intelligent, vector search, re-ranking, cache, métriques. 85% accuracy, 234ms latence. Tests automatiques, Docker, évaluation golden dataset. Python 3.11 | LangChain | MLOps

Python

Updated 3 weeks ago

ai-engineeringchromadbfastapi+10

genai-rag-engine

sankarbaseone

❤️35

A modular Retrieval-Augmented Generation (RAG) engine for building enterprise AI assistants. Supports document ingestion, chunking, embeddings, vector search, and LLM-based answer generation. Includes evaluation tools and an extensible architecture for chatbots, knowledge bases, and AI copilots.

Python

Updated 4 months ago

GitHub Explorer

Search Results

chunking_evaluation

conlleval

RAG-Boilerplate

ai-code-review

rag-chunking-techniques

Clustering-Glossary-Terms-Extracted-from-Large-Sized-Software-Requirements-using-FastText

Chunk-Factory

ragflow-optimizer

rag-chunking-evaluation

AutoChunker

rag-pipeline-financial-documents

Offline-Wikipedia-RAG-System-Using-Local-LLM-and-Sentence-Transformers

Semantic_Chunking_eval

performance-benchmarks

retrieval-grounded-llm

RAG-Evaluation-Framework

rag-chunk-paper

act-libero

RAG-for-Legal-Documents

document-chunking-guide

Document-RAG-QA

RAG_Type

Maternal_Agent

GenAI-AgenticAI

Kalpit-Pvt-Ltd-UK

training-data-builder-for-llm-py-version

Soulsborne-RAG

dataset-pipeline-pro

lvmh-financial-rag

genai-rag-engine

chunking_evaluation

conlleval

RAG-Boilerplate

ai-code-review

rag-chunking-techniques

Clustering-Glossary-Terms-Extracted-from-Large-Sized-Software-Requirements-using-FastText

Chunk-Factory

ragflow-optimizer

rag-chunking-evaluation

AutoChunker

rag-pipeline-financial-documents

Offline-Wikipedia-RAG-System-Using-Local-LLM-and-Sentence-Transformers

Semantic_Chunking_eval

performance-benchmarks

retrieval-grounded-llm

RAG-Evaluation-Framework

rag-chunk-paper

act-libero

RAG-for-Legal-Documents

document-chunking-guide

Document-RAG-QA

RAG_Type

Maternal_Agent

GenAI-AgenticAI

Kalpit-Pvt-Ltd-UK

training-data-builder-for-llm-py-version

Soulsborne-RAG

dataset-pipeline-pro

lvmh-financial-rag

genai-rag-engine