GitHub Explorer

by Alexey Ratnikov

GitHub Explorer

GitHub Explorer|TRENDING COMPARE|FEEDBACK

Back to search

artalis-io/bitnet.c - GitHub Explorer | GitHub Explorer | Trending | Compare

Back to search

bitnet.c

artalis-io•PUBLIC

View on GitHub

Minimal, zero-dependency LLM inference in pure C11. CPU-first with NEON/AVX2 SIMD. Flash MoE (pread + LRU expert cache). TurboQuant 3-bit KV compression (8.9x less memory per session). 20+ GGUF quant formats. Compiles to WASM.

avx2ccpu-inferenceggufinferencekv-cache

MIT License

Created on Mar 12, 2026

Updated on Apr 4, 2026

Stars

Forks

Watchers

Open Issues

Repository Health Score

💛

70/100

Good

Overall repository health assessment

Score Breakdown

Activity

Active development - updated this week

30/30

100%

Issues Analytics

Total Issues

All time

Open

0% of total

Closed

Recent Commits

GPU performance: Q4 matvec rewrite, compute COPY, flash attention, zero-copy weights

Mark Farkas•1 week ago

a9bfc5aView on GitHub

Add benchmark suite, fix bench_kernels for current API

Mark Farkas•1 week ago

f5653beView on GitHub

Enable TQ on draft model, update README + CLAUDE.md docs

Mark Farkas•1 week ago

f303052View on GitHub

Enable TQ on draft model, update CLAUDE.md for prompt cache + TQ docs

Mark Farkas•1 week ago

222e973View on GitHub

Fix TQ cache reset, add full-turn prompt caching, verify WASM TQ path

Mark Farkas•1 week ago

1fbef8eView on GitHub

TQ-aware prompt cache + audit fixes: bn_alloc, size_t casts, warning cleanup

Mark Farkas•1 week ago

6790097View on GitHub

Q4_0 AVX2: inline dot_i8_float, eliminate redundant integer accumulator

Mark Farkas•1 week ago

b44bb05View on GitHub

SSM delta: Kahan compensated summation for S@k and S^T@q

Mark Farkas•1 week ago

67b4297View on GitHub

Q4_K GPU matvec: integer accumulation for NEON-matched precision

Mark Farkas•1 week ago

b313320View on GitHub

Fix all audit findings: C1-C2 critical, H1-H4 high, M1-M7 medium

Mark Farkas•1 week ago

f340670View on GitHub

Fix SSM GPU shaders: add per-layer state and conv_state offsets

Mark Farkas•1 week ago

87c5cb1View on GitHub

Q4_0 AVX2: float-domain accumulation eliminates per-block integer hsum

Mark Farkas•1 week ago

92abd82View on GitHub

Revert Q-gated debug changes, keep CPU fallback for correctness

Mark Farkas•1 week ago

0d27068View on GitHub

Add Q4_0 AVX2 tiled matmul kernel for batched prefill

Mark Farkas•1 week ago

4f1f490View on GitHub

GPU Q-gated: keep rejection, clean up debug prints

Mark Farkas•1 week ago

c432556View on GitHub

View all commits

GitHub Explorer

bitnet.c

Score Breakdown

Issues Activity: Last 6 months

Hottest Issues