Went to the monthly AI dev meetup

Posted by nathandreamfast@reddit | LocalLLaMA | View on Reddit | 38 comments

Usual crowd. Everyone's on Claude or Codex, nobody's really sure how any of it actually works, and that's fine, that's the vibe. Then there's this guy. The Claude guy. You know the type even before he speaks. First thing he wants to know is what I'm running. I tell him: GLM, custom multi-agent setup, local small LLM routing traffic between GLM 5.1, Kimi K2.6, MiMo v2.5-Pro and a few OpenRouter models, all hitting a bleeding edge llama.cpp build I access over WireGuard wherever I am. He looks at me like I'm speaking another language. "So... not Opus?" Not Opus. Not Codex. Not anything with a pricing page and a friendly little UI. He doesn't know what to do with this information. Someone throws out a challenge. Build a working browser game, go. I paste the prompt in, agents fan out and start doing their thing, and I close my laptop lid. That's the whole move. Years of refining this XFCE4 setup means they just keep working with the lid down. Autonomously. While I get a coffee. I crack the lid once to check progress and the guy next to me is staring at the compaction logs scrolling past. "What is that." I tell him it's Qwen3.6-35B-A3B-uncensored-heretic-Q5_K_S.gguf doing over 200 tokens per second just eating through context compaction on local hardware. He goes quiet. Fair enough. The Claude guy is not having a good time. Toggling between plan mode and build mode. Sweating a bit. The kind of focused where you can tell things aren't going well but he hasn't admitted it yet. My Telegram pings. App's done, deployed, playable in the browser. I didn't touch anything after I closed the lid. His screen is half a game that doesn't work. He stares at it, closes the laptop, and walks straight out without a word. One of his mates looks over at me. "You just made a big mistake today buddy." I thought about it for a second. "Don't mess with local LLM guys bro." Nobody said anything after that.