Found 4,592 repositories(showing 30)
huggingface
State-of-the-Art Text Embeddings
BlinkDL
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.
OpenBMB
A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines
huggingface
🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools
embeddings-benchmark
MTEB: Massive Text Embedding Benchmark
ddangelov
Top2Vec learns jointly embedded topic, document and word vectors.
huggingface
Efficient few-shot learning with Sentence Transformers
beir-cellar
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
MinishLab
Fast State-of-the-Art Static Embeddings
yongzhuo
中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short),字词句向量嵌入层(embeddings)和网络层(graph)构建基类,FastText,TextCNN,CharCNN,TextRNN, RCNN, DCNN, DPCNN, VDCNN, CRNN, Bert, Xlnet, Albert, Attention, DeepMoji, HAN, 胶囊网络-CapsuleNet, Transformer-encode, Seq2seq, SWEM, LEAM, TextGCN
IntelLabs
Efficient Retrieval Augmentation and Generation Framework
dmayboroda
On-premises conversational RAG with configurable containers
kennethleungty
Running Llama 2 and other Open-Source LLMs on CPU Inference Locally for Document Q&A
BaranziniLab
Empower Large Language Models (LLM) using Knowledge Graph based Retrieval-Augmented Generation (KG-RAG) for knowledge intensive tasks
octimot
An editing tool that uses AI to transcribe, understand content and search for anything in your footage, integrated with ChatGPT and other AI models
wangyuxinwhy
unified embedding model
fynnfluegge
Local first semantic code search and chat | Leverage custom copilots with fine-tuning datasets from code in Alpaca, Conversational, Completion and Instruction format
AIAnytime
This is a medical bot built using Llama2 and Sentence Transformers. The bot is powered by Langchain and Chainlit. The bot runs on a decent CPU machine with a minimum of 16GB of RAM.
huggingface
🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of Optimum's hardware optimizations & quantization schemes.
voidism
Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"
dpressel
MinT: Minimal Transformer Library and Tutorials
yosungho
Line as a Visual Sentence: Context-aware Line Descriptor for Visual Localization (Line Transformer)
sfzhou5678
An unofficial implementation of Poly-encoder (Poly-encoders: Transformer Architectures and Pre-training Strategies for Fast and Accurate Multi-sentence Scoring)
ricardorei
Minimalist implementation of a BERT Sentence Classifier with PyTorch Lightning, Transformers and PyTorch-NLP.
davidberenstein1957
This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-shot classification with Huggingface.
jhgan00
한국어 사전학습 모델을 활용한 문장 임베딩
labteral
Simple State-of-the-Art BERT-Based Sentence Classification with Keras / TensorFlow 2. Built with HuggingFace's Transformers.
ALucek
Optimize Document Retrieval with Fine-Tuned KnowledgeBases
yuanzhoulvpi2017
sentence-transformers to onnx 让sbert模型推理效率更快
rag-wtf
Open Source Text Embedding Models with OpenAI Compatible API