Gemma 4 on Llama.cpp should be stable now

Posted by ilintar@reddit | LocalLLaMA | View on Reddit | 137 comments

With the merging of https://github.com/ggml-org/llama.cpp/pull/21534, all of the fixes to known Gemma 4 issues in Llama.cpp have been resolved. I've been running Gemma 4 31B on Q5 quants for some time now with no issues.

Runtime hints:

Have fun :)