Qwen3.6-27B-3bit-mlx · Hugging Face: 3 & 5 mixed quant for RAM poor Mac users.

Posted by JLeonsarmiento@reddit | LocalLLaMA | View on Reddit | 19 comments

Just dropped a 3bit mixed quant (5bit for embeds and prediction layers) for Mac users.

There was only one 3 bit version of this model (from Unsloth), but it was very heavy and painfully slow.

This one is twice as fast, and in my own agentic tests equally good.