mac code — Claude Code, but it runs on your Mac for free. 35B AI agent at 30 tok/s via Apple Silicon flash-paging. $0/month.
Stars
544
Forks
55
Watchers
544
Open Issues
4
Overall repository health assessment
No package.json found
This might not be a Node.js project
111
commits
mlx-gemma4: KV sharing is ACTIVATION reuse, not weight sharing
eedeeacView on GitHubmlx-gemma4: add KV sharing + PLE, E2B loading progress
8953aa6View on GitHubGemma 4 Q8 benchmark: 0.27 tok/s on 16 GB (llama.cpp: 0 tok/s)
b464ec0View on GitHubGEMMA 4 WORKS! RMSNorm fix: raw weights, no +1 offset
14180edView on GitHubAdd Gemma 4-26B MoE to tiny-bit-terminal compare + models
eb02d96View on GitHubAdd Gemma 4-26B MoE to tiny-bit-terminal model options
aeedd98View on GitHubGemma 4: architecture confirmed correct, need expert FFN fix
f7bc5c1View on GitHubGemma 4: add post_feedforward_layernorm after dense+moe combine
0e8f6f4View on GitHubGemma 4: fix RMSNorm for GGUF weights + attention fixes
0c1fd60View on GitHubFix missing post_attention_layernorm in Gemma 4 model
c3bbeacView on GitHubGemma 4 MLX engine: forward pass works, needs weight mapping fix
bf82a9fView on GitHub