"Actually wait" ... the current thinking SOTA open source

Posted by FPham@reddit | LocalLLaMA | View on Reddit | 34 comments

I'm trying GLM 5.1 but is it just me or the thing really just works by over-cranking thinking to almost ridiculous heights?

It has been for last 20 minutes writing novellas about what it is going to do with all, Uhm, Actually wait, but no..., and I really just asked it to write an owner draw CButton with different colors.

Now don't get me wrong, at the end it seems to get there - but I'm just having my own "Actually wait" thinking moment:

Is this the way they made it so smart?

While the other models like Claude (the $20 is now just a total test mode ripoff - the tokens get spent in 15 minutes then you wait for hours) or ChatGPT (I currently prefer codex lately over CC, honestly it feels as smart) simply give you the answer almost right away for such simple things.

Edit, 30 minutes and > 100k tokens and now it starts writing CThemedButtonCtrl