Search Results

Found 70 repositories(showing 30)

llm-context-limits

taylorwilsdon

🧡50

Since OpenAI and friends refuse to give us a max_ctx param in /models, here's the current context window, input token and output token limits for OpenAI (API), Anthropic, Qwen, Deepseek, llama, Phi, Gemini and Mistral

MIT

Updated 1 month ago

codetoprompt

yash9439

💛70

Transform any codebase, web page, or document into an optimized LLM prompt. CodeToPrompt intelligently compresses code and filters content to overcome context window limits.

MIT

Python

Updated 3 days ago

clov-ai

alexandephilia

💛70

Context Limiter & Output Vetter for context bloat. It is a highly specialized, structure-aware JSON built specifically to intercept and compress MCP responses before they annihilate your LLM's context window.

NOASSERTION

Rust

Updated 6 days ago

agentic-codingagentic-workflowai+14

llm-text-compressor

taylorbayouth

❤️40

A Python script to compress large text files for LLM context windows, optimizing the ratio of essential information to tokens used. It offers various compression techniques (key points, glossary terms, paraphrasing, etc.) to fit important content within token limits, reducing the risk of losing context and improving clarity and impact.

MIT

Python

Updated 3 months ago

arxiv2025-inherent-limits-plms

UKPLab

❤️30

Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Learning Capabilities"

Apache-2.0

Python

Updated 3 months ago

explainabilityin-context-learninginstruction-tuning+3

promptize-ai-rs

danwritecode

❤️40

Solving context limits when working with AI LLM models by implementing a "chunkable" prop macro on your prompt structs.

MIT

Rust

Updated 9 months ago

chat-exporter

Abzaek

🧡55

Export ChatGPT conversations to Markdown to bypass message limits and seamlessly transfer context between LLMs.

JavaScript

Updated 3 weeks ago

chatgptchrome-extensioncontext+2

llm-info

paradite

❤️35

Information on LLM models, context window token limit, output token limit, pricing and more.

TypeScript

Updated 3 months ago

context-windowinformationlanguage-model+5

awesome-free-inference

bradAGI

🧡50

The complete guide to free AI/LLM inference APIs — 20+ providers, 81+ verified $0 models, rate limits, context windows, and code examples

MIT

Updated 1 week ago

Unable-to-Forget

zhuangziGiantfish

🧡60

Unable to forget: Proactive lnterference Reveals Working Memory Limits in LLMs Beyond Context Length

MIT

Python

Updated 4 weeks ago

knowledge-graph

hilyfux

🧡55

Stop AI Coding from forgetting. A knowledge graph–driven memory layer for LLMs (ChatGPT, Claude, Codex, DeepSeek, Gemini), enabling persistent long-term memory beyond context window limits. Build smarter AI agents with structured context, better consistency, and scalable multi-step reasoning across complex coding workflows.

MIT

Shell

Updated 25 minutes ago

rlm-rs-plugin

zircote

🧡60

Claude Code plugin for processing documents 100x larger than context limits using the Recursive Language Model pattern. Rust-powered chunking, hybrid semantic + BM25 search, and sub-LLM orchestration.

MDX

Updated 1 day ago

ai-agentsbm25chunking+11

llm-vram-calculator

GPUforLLM

❤️45

Accurate VRAM calculator for Local LLMs (Llama 4, DeepSeek V3, Qwen 2.5). Calculates GGUF quantization, GQA context overhead, and offloading limits

MIT

HTML

Updated 2 weeks ago

deepseekggufgpu-calculator+5

llm_code_prompter

FilippoLeone

❤️40

The LLM Code Prompter is a command-line utility designed to generate structured prompts from code repos for GPT-4 models, leveraging the maximum context limit.

MIT

Python

Updated 5 months ago

nl-deterministic-context-budgeting

ElliotOne

🧡60

Deterministic context budgeting for LLM prompts, demonstrating stable prompt packing within fixed token limits.

MIT

Updated 3 weeks ago

ai-architectureai-engineeringcontext-window+7

token-counter-cli

puya

🧡50

Fast CLI tool for counting tokens with LLM context limit comparison

MIT

Python

Updated 2 months ago

cli-python-tiktoken-tokens-llm-text-analysis

ctx-cli

vilsonrodrigues

❤️35

Context management for LLM agents. Persistent memory that survives context limits.

Python

Updated 3 months ago

contextkit

drandrewlaw

🧡60

Intelligent conversation compaction for LLM applications. Never hit context window limits again. Works with any LLM provider.

MIT

TypeScript

Updated 1 week ago

agentaiclaude+9

agent-memory

sezginpaydas

🧡50

Persistent vector-based memory for local LLMs using PostgreSQL to overcome context window limits.

MIT

Python

Updated 1 month ago

llm-chat-summarizer

glpayson

🧡50

Summarize long LLM chat conversations using rolling summarization to preserve context and continue conversations past token limits

MIT

Python

Updated 1 month ago

recursive-document-agent

rudramadhabofficial

❤️35

An infinite-context LLM agent inspired by MIT's Recursive Language Models (RLM). Uses autonomous tool-use and recursion to navigate and analyze massive datasets without RAG or context limits.

Python

Updated 3 months ago

MemNAI

Sahith59

🧡65

A local-first backend AI framework that solves LLM context limits by dynamically routing queries, branching conversations independently, and compressing old memories mathematically using ChromaDB

Python

Updated 3 days ago

ai_evals_v2

vishal-labade

💛70

AI Evals v2 is a structured, reproducible LLM evaluation framework that isolates behavioral reliability from memory capacity. It introduces controlled experiment families, a Memory Compliance Score (MCS), and context-cliff analysis to quantify when reliability failures stem from scale vs. context limits.

MIT

Python

Updated 3 days ago

optimal-ollama

ArthurusDent

❤️45

Optimal Ollama is a cross-platform benchmarking and tuning tool designed to find the "Sweet Spot" for your local LLMs. It helps you determine the maximum context window a model can handle on your specific hardware before performance degrades or memory limits are exceeded.

MIT

Python

Updated 1 month ago

llmollamaopenclaw

llm-context

roycrisses

🧡55

A CLI tool + importable library that analyzes any codebase, finds the most relevant files to a user's question, trims content to fit any LLM's token limit, and outputs a ready-to-use context block (to clipboard or directly to an LLM API).

Python

Updated 2 weeks ago

cookbooks

Aryan-202

🧡60

An intelligent optimization engine that dynamically adjusts LLM selection, context size, and token limits based on real-time hardware telemetry to maximize inference efficiency and prevent resource bottlenecks.

MIT

Jupyter Notebook

Updated 4 weeks ago

aiaioptimizationedgecomputing+6

quantify-ai

cheikh2shift

🧡60

Library for getting LLM context window limits

MIT

Updated 3 weeks ago

aicligolang+1

context-bomb

0pfleet

❤️45

Generate documents with exact token counts to test LLM context window limits

Python

Updated 1 month ago

pareto-llm

varriaza

🧡60

Find the Pareto Frontier for open LLMs across model, quantization and context limits

MIT

Python

Updated 2 weeks ago

llm-context-probe

PsiClawOps

🧡65

Active context window probing for LLM providers — finds real enforced limits, not just advertised ones

JavaScript

Updated 6 days ago

GitHub Explorer

Search Results

llm-context-limits

codetoprompt

clov-ai

llm-text-compressor

arxiv2025-inherent-limits-plms

promptize-ai-rs

chat-exporter

llm-info

awesome-free-inference

Unable-to-Forget

knowledge-graph

rlm-rs-plugin

llm-vram-calculator

llm_code_prompter

nl-deterministic-context-budgeting

token-counter-cli

ctx-cli

contextkit

agent-memory

llm-chat-summarizer

recursive-document-agent

MemNAI

ai_evals_v2

optimal-ollama

llm-context

cookbooks

quantify-ai

context-bomb

pareto-llm

llm-context-probe

llm-context-limits

codetoprompt

clov-ai

llm-text-compressor

arxiv2025-inherent-limits-plms

promptize-ai-rs

chat-exporter

llm-info

awesome-free-inference

Unable-to-Forget

knowledge-graph

rlm-rs-plugin

llm-vram-calculator

llm_code_prompter

nl-deterministic-context-budgeting

token-counter-cli

ctx-cli

contextkit

agent-memory

llm-chat-summarizer

recursive-document-agent

MemNAI

ai_evals_v2

optimal-ollama

llm-context

cookbooks

quantify-ai

context-bomb

pareto-llm

llm-context-probe