QWEN3.6 + ik_llama is fast af

Posted by _BigBackClock@reddit | LocalLLaMA | View on Reddit | 33 comments

QWEN3.6 + ik_llama is fast af

running qwen3.6 UD_Q_4_K_M on 16GB vram + 32GB ram with 200k cw @50+ tok/s