Shoutout to Gemma4 as a conversational assistant / agent

Posted by goldcakes@reddit | LocalLLaMA | View on Reddit | 22 comments

I'm seriously impressed by Gemma4 26B A4B. On my M5 MacBook Pro (so still not that much memory bandwidth by GPU standards), it's blazingly fast and it's a very good generalist / everyday local LLM.

It has a little bit of personality to its responses, and seems to perform decently for everything: creative writing, debugging and coding, random chats, image recognition and classification, etc.

I tried Qwen3.6 35B A3B, and the coding performance feels close (slight lead for Qwen; but it's bigger params so I have less free RAM), but it's definitely not as good as Gemma outside of coding tasks, and generally feels bit more 'robotic' to chat to and work with.