GitHub Explorer

by Alexey Ratnikov

GitHub Explorer

GitHub Explorer|TRENDING COMPARE|FEEDBACK

Back to search

christopherkarani/EdgeRunner - GitHub Explorer | GitHub Explorer | Trending | Compare

Back to search

EdgeRunner

christopherkarani•PUBLIC

View on GitHub

🚀 LLM inference Engine in Swift/Metal, Load GGUF and safe tensors modes, no conversion, no cpp, pure swift

Created on Mar 1, 2026

Updated on Apr 6, 2026

Stars

Forks

Watchers

Open Issues

Repository Health Score

🧡

65/100

Fair

Overall repository health assessment

Score Breakdown

Activity

Active development - updated this week

30/30

100%

Recent Commits

Update README.md

Christopher Karani•2 days ago

11d0fb7View on GitHub

chore: clean up repo structure for public-facing release

Autoresearch Agent•1 week ago

b889dccView on GitHub

feat: add simple EdgeRunner facade API with stream and generate methods

Autoresearch Agent•1 week ago

f59aec4View on GitHub

fix: pass dense V buffer to decode kernel for 4k-16k contexts

Autoresearch Agent•1 week ago

a69219eView on GitHub

fix: populate dense V cache during prefill for all context lengths

Autoresearch Agent•1 week ago

bbbc0b1View on GitHub

fix: populate dense V cache during prefill for all context lengths

Autoresearch Agent•1 week ago

154fd75View on GitHub

fix: dense-V decode path for turboquant quality regression

Autoresearch Agent•1 week ago

1d65194View on GitHub

fix: disable unimplemented turboquant hybrid-V paths

Autoresearch Agent•1 week ago

edb1eacView on GitHub

merge: turboquant KV-cache integration from perf2-turboquant-isolation

Autoresearch Agent•1 week ago

c9d8280View on GitHub

chore: commit local perf2 changes before turboquant merge

Autoresearch Agent•1 week ago

26fbb4cView on GitHub

perf: use dense attention for long turboquant prefill — ttft 624029 → 504410 ms with 33.24 tok/s preserved

Autoresearch Agent•1 week ago

b9df8ecView on GitHub

perf: increase 16k row thinning again — 27.08 → 33.21 tok/s (+22.6%)

Autoresearch Agent•1 week ago

40d5994View on GitHub

perf: double 16k row thinning — 24.32 → 27.08 tok/s (+11.3%)

Autoresearch Agent•1 week ago

7b5fec6View on GitHub

perf: keep only top tile value row at 16k — 22.87 → 24.32 tok/s (+6.3%)

Autoresearch Agent•1 week ago

f4317c0View on GitHub

perf: fix 16k sparse score scaling — 22.33 → 22.87 tok/s (+2.4%)

Autoresearch Agent•1 week ago

eff8c12View on GitHub

View all commits