Gemma 4 and Qwen 3.6 with q8_0 and q4_0 KV cache: KL divergence results

Posted by oobabooga4@reddit | LocalLLaMA | View on Reddit | 67 comments