Found 1 repositories(showing 1)
azarash1
This application optimizes Large Language Models (LLMs) for Apple Silicon devices using the MLX framework. It incorporates the latest research in model quantization, pruning, and memory management to enable users to run larger models on lower-spec laptops.
All 1 repositories loaded