Found 16 repositories(showing 16)
montevive
🚀 Autocache - Intelligent Anthropic API Cache Proxy Automatically inject cache-control fields into Claude API requests to reduce costs by up to 90% and latency by up to 85%. Works as a transparent drop-in replacement for popular AI platforms like n8n, Flowise, Make.com, LangChain, and LlamaIndex—no code changes required
KevinZhao
Local reverse proxy between Claude Code CLI and AWS Bedrock — fixes prompt caching TTL, Agent SDK missing cache, thinking/effort optimization, and model ID mapping
OneGoToAI
**Pruner** is a lightweight CLI wrapper for Claude Code that automatically reduces your API token usage through local proxy interception, context pruning, and prompt caching — without changing how you work.
voipmonitor
Proxy that lets Claude Code talk to self-hosted sglang/vLLM backends with KV cache normalization and vision routing
metaphori-ai
Claude Code Cache Bug Fix using mitmproxy!
d3soxyephedrine
Claude Max OAuth proxy — unlocks 1M context, prompt caching, and API-only features for Claude Code on Replit and beyond
AlexGS74
Normalizing HTTP proxy between Claude Code and vLLM — rips out cache busters for stable KV cache prefix (22% → 95%+ hit rate)
brilliantrough
自动给 anthropic claude 模型添加缓存,可结合 one api 等项目将 anthropic claude API 转换成自带缓存的 OpenAI 格式 API -- 此项目主要由 claude code + glm-4.6 进行开发和维护
acampkin95
claude-cache-proxy module from Projects collection
ialV
an auto-cache proxy for claude
jeremyeder
Local Squid caching proxy for Claude Code sessions
itsHabib
Project based course to build a Cache Proxy, built with claude code teams, for learning purposes
magno73
OpenAI-compatible proxy for Claude Code CLI — use your Claude Pro/Max subscription as an API endpoint. Supports sessions, prompt caching, streaming (SSE), and extended thinking.
BitCool232
Serverless proxy backend for SHealth iOS app — sits between the app and Anthropic Claude API for key security, caching, and rate limiting
timholm
Cost-optimizing reverse proxy for Claude/LLM APIs. Routes each request to the cheapest model that can handle it. Semantic caching, DAG workflows, health-aware backends, ML-assisted routing.
brianwu02
An AI-first development environment for solo developers. Run multiple AI coding agents (Claude, Gemini) in parallel across tmux sessions, with all the infrastructure you need to ship — database, cache, object storage, email, reverse proxy, and monitoring.
All 16 repositories loaded