Qwen3 Next imatrix GGUFs up!

Posted by noneabove1182@reddit | LocalLLaMA | View on Reddit | 18 comments

Just figured I'd post in case anyone's looking for imatrix and IQ quants

https://huggingface.co/bartowski/Qwen_Qwen3-Next-80B-A3B-Instruct-GGUF

https://huggingface.co/bartowski/Qwen_Qwen3-Next-80B-A3B-Thinking-GGUF

As usual this also uses my PR/fork for slightly more optimized MoE quantization

https://github.com/ggml-org/llama.cpp/pull/12727