> if only there is a way to easily distill weights
It's not an unsolved problem, we know know how to do it in general and who has experience with it, etc.
Just a matter of getting enough compute together.
I mean, just get together with people and chip in 5 bucks.
Maybe a kickstarter (we will hire X and Y who are known in the community to know what they are doing and use Z for infrastructure to rent and X thinks it will take W amount of compute and time) ?
It's the other half of the problem even if it is KoFi rather than KickStarter/GoFundMe: nailing the ballpark amounts down, assuming that (a) compute price fluctuates like hell (b) distillations can fail sometimes (c) taxes and red tape insanity when foreigners wants to pitch in cash.
Seems like you are just trying to look for reasons why it wouldn't work.
I'm saying it just needs some people to put in some effort to figure out what would work.
Lived long enough to see these thing recurring consistently, that I want to plant my face on the table when no dream-team steps up. It's how waiting for a savior is complementary to running into a burning building.
Also for some damn reason DashCon 1 appears in my mind. Hmmm
A better 4.5V but they state in the readme that they know it has flaws for text and they didn't release text benchmarks.
Not saying it's bad, but for me it implies they don't think it's a superset of GLM-4.5-Air
Diagrams in the reasoning/planning stage, cool. That's a first.
https://media.discordapp.net/attachments/1451755268789768192/1452707589744889997/image.png?ex=694acadf&is=6949795f&hm=f1c5a42ea847a6f85e7cd7ba49639ae383dcbedb5765d8323acc471c524deac5&=&format=webp&quality=lossless
Result:
https://chat.z.ai/space/v08umaevwcn0-art
Prompt: Create a user friendly, attractive web radio app that will play free SomaFM streams. Make it fully featured. Use your web search tool functionality to identify the correct station endpoints, 'album art', etc.
I'm not going to even evaluate it with their API if I can't eventually transition to on-prem or to a provider that better suits my needs. For that to even be on the table they'd need to crush Sonnet or something.
Yeah, glm 4.7 is not better than minimax 2.1 for certain coding tasks, perhaps even worse, but someone should test them both more to assess them further..
I've been testing 4.7 for the last hour, and it's incredible. Python and HTML: all tasks solved. About 2,000 lines of code in Python and 1,200 in HTML+CSS, etc. Maximum 2 runs and everything was fine.
I haven't tried 4.7 with CLI agentic coding tools yet. GLM-4.6 had an issue with not really understanding how to optimally use tools for performing a task, especially in comparison to M2. Is that addressed?
GLM-4.6 was actually worse at tool calling than GLM-4.5-Air for me. It's still a good model though, I just had to prompt it more to encourage tool calling.
"real programming"
*Asks it to two shot a greenfield project of a small game*
What do you think is more common in industry? Backend/frontend? Or small games in a greenfield codebase?
Just because they're interpreted languages doesn't diminish the incredible and amazing things you can do with them.
(Thinking specifically about Python...)
Don't be "that guy" and let people be excited.
Also, I bet it's a hell of a lot better at C, Kotlin/Java, Swift, and probably any language than I am and I'm getting paid lots of money to do it.
More power in the hands of people who don't need to go through all the shit I went through is great. Can't wait until it completely outclasses any engineer (instead of just 90% of us). Then we can focus on the actual complex issues instead of just the code to get us to the resolution.
Vibe coders are excited about models just to vibe code a... language that's supposed to be easier for humans. Sure, okay. Failure of imagination. If you have an all-powerful AI that can do the coding part for you surely it can do what you can't. But no vibe coders want a pansy AI that's just like them
If you're not "vibe coding" all of the simple shit we do as part of our job you are wasting insane amounts of time.
Great coders don't make great engineers. Great problem-solvers do.
So yeah, keep your head in the sand. Label anyone who uses AI as a "vibe coder" and keep your gatekeeping up. The rest of us are running circles around our peers and getting more done in much easier ways than ever.
Look down your nose at people who will soon be outperforming you all you want. One day you'll look around and realize the entire industry has changed and you're stuck clutching your pearls.
Gemini 3 pro, or Deepseek 3.2 Speciale. I try breaking a game security and Claude only throw "I see" "I found the problem..." Then start to write a lot of .md files and code that nothing related to real problem.
This is what I've found too, I use LLMs for research and have found that Claude models are REALLY bad at deeply thinking through deep, abstract concepts. Opus 4.5 is dumber than even old reasoning models like R1 and o1 in this regard. They are very good at creating boilerplate SWE-type code though.
I honestly cannot relate. Maybe it's because I told it to write everything in mermaid graphs and data flows and stick to data-oriented programming, or maybe it's because I told it to break down everything into tasks and also criticise itself, or maybe it's because I gave it an .MD file I wrote by hand which was up to my standards and told it to read that if it needs style guidance. But the .md files it produces for me are short and to the point. Usually I get it to plan around the end goal, then tell it to translate its plan to an .md and then tick off one task after another
Not that I am not happy about all the chinese releases, but if you look at uncontaminated benchmarks like [swe-rebench] (https://swe-rebench.com/) you see a big gap between GLM 4.6 and GPT 5.x models instead if the 2% on swe-bench verified. Don't trust benchmarks companies can perform themselves.
That's still a very respectable showing for GLM 4.6 and represents probably where I'd put it given my experience with it. I'd wager GLM 4.7 will be significantly higher than DeepSeek 3.2 when they test it
Good open source model, but bad business practice. Their paid model got nerf to infinity, though GLM 4.6 was actually a good model if you can pay from other providers.
https://preview.redd.it/q9czibbqgt8g1.png?width=1162&format=png&auto=webp&s=118b4808a3e8fa8dc176a95cb7085d5cd392c2dc
Loved the transparency of the model. I always go for the more extreme or philosophical on personal life questions, and the model gave me the best response possible, no filters on what was being recommended. No other model has ever suggested anything like this.
Runpod most probably, or GColab if you are on Pro.
On Runpod you'd need multiple GPUs though, something like 4x6000 Pros Blackwells for respectable context windows and sick speeds.
tbh, using it with dcp in opencode just blew me away!
[https://github.com/Opencode-DCP/opencode-dynamic-context-pruning](https://github.com/Opencode-DCP/opencode-dynamic-context-pruning)
Tried the model on [https://chat.z.ai/](https://chat.z.ai/) . For a particular test puzzle question (appears in the computer game Indiana Jones and the Great Circle ) it failed, that 4.6 could answer. A bit depressing, meaning that the improvement are a bit more on the benchmaxxing side?
Your post is getting popular and we just featured it on our Discord! [Come check it out!](https://discord.gg/PgFhZ8cnWW)
You've also been given a special flair for your contribution. We appreciate your post!
*I am a bot and this action was performed automatically.*
Flashbacks to that time where you'd download something from kazaa over dial-up, and after a few hours of waiting you'd get ... not the movie you wanted :D
I like how they compare to OpenAI's flagship but Anthropic's one-step-down model.
Come on guys, real people using Claude today are using Opus, not Sonnet. Don't be misleading in your evals.
How dare you.
Here on LOCALllama, we praise every model, even those 99% of us cant run.
We are here for the benchmark publish by the model maker, they are Gospel.
Get with the program dude, this is the bot and marketing time, not the /r/LocalLLaMA of old.
GLM 4.7 is generally available, I just tried it with my super cheap subscription plan.
For anyone looking to subscribe, they are currently offering stacked discounts 50%+(20-30%)+10% for black Friday deals.
Use link [https://z.ai/subscribe?ic=OUCO7ISEDB](https://z.ai/subscribe?ic=OUCO7ISEDB)
131 Comments
Any-Conference1005@reddit
LagOps91@reddit
Any-Conference1005@reddit
LagOps91@reddit
Emotional-Baker-490@reddit
SilentLennie@reddit
TomLucidor@reddit
SilentLennie@reddit
TomLucidor@reddit
SilentLennie@reddit
TomLucidor@reddit
SilentLennie@reddit
TomLucidor@reddit
SilentLennie@reddit
TomLucidor@reddit
abnormal_human@reddit
Karyo_Ten@reddit
bbjurn@reddit
RickyRickC137@reddit
Tall-Ad-7742@reddit
ttkciar@reddit
jacek2023@reddit
JustinPooDough@reddit
jacek2023@reddit
_VirtualCosmos_@reddit
fanhed@reddit
Zyj@reddit
_VirtualCosmos_@reddit
Zyj@reddit
_VirtualCosmos_@reddit
TheRealMasonMac@reddit
_VirtualCosmos_@reddit
Emotional-Baker-490@reddit
pilibitti@reddit
kimodosr@reddit
Recoil42@reddit
duboispourlhiver@reddit
thrownawaymane@reddit
duboispourlhiver@reddit
thrownawaymane@reddit
duboispourlhiver@reddit
pmttyji@reddit
Long_comment_san@reddit
LagOps91@reddit
Long_comment_san@reddit
DingyAtoll@reddit
martinsky3k@reddit
DingyAtoll@reddit
usernameplshere@reddit
AnticitizenPrime@reddit
Square_Quarter516@reddit
Arindam_200@reddit
GTHell@reddit
AnticitizenPrime@reddit
No_Conversation9561@reddit
coder543@reddit
zmarty@reddit
ForsookComparison@reddit
power97992@reddit
usernameplshere@reddit
dan_goosewin@reddit
zmarty@reddit
power97992@reddit
thatsnot_kawaii_bro@reddit
Repulsive_Educator61@reddit
Dany0@reddit
LegacyRemaster@reddit
TheRealMasonMac@reddit
Karyo_Ten@reddit
SuperChewbacca@reddit
Dany0@reddit
thatsnot_kawaii_bro@reddit
RickDripps@reddit
Dany0@reddit
jazir555@reddit
RickDripps@reddit
AlwaysLateToThaParty@reddit
Purple-Programmer-7@reddit
Professional_Price89@reddit
MrMrsPotts@reddit
Professional_Price89@reddit
buppermint@reddit
Fuzzy_Independent241@reddit
Dany0@reddit
wittlewayne@reddit
Mkengine@reddit
Dany0@reddit
decentralize999@reddit
GTHell@reddit
RandomThoughtsAt3AM@reddit
TomLucidor@reddit
Mochila-Mochila@reddit
seppe0815@reddit
TomLucidor@reddit
mivog49274@reddit
Shir_man@reddit
KvAk_AKPlaysYT@reddit (OP)
Shir_man@reddit
KvAk_AKPlaysYT@reddit (OP)
KvAk_AKPlaysYT@reddit (OP)
ParadigmComplex@reddit
KvAk_AKPlaysYT@reddit (OP)
Kompicek@reddit
unbrained_01@reddit
SilentLennie@reddit
kimodosr@reddit
Goldandsilverape99@reddit
KvAk_AKPlaysYT@reddit (OP)
KvAk_AKPlaysYT@reddit (OP)
Goldandsilverape99@reddit
WithoutReason1729@reddit
serige@reddit
ResidentPositive4122@reddit
AlbeHxT9@reddit
abnormal_human@reddit
mantafloppy@reddit
SlaveZelda@reddit
Nicoolodion@reddit
DHasselhoff77@reddit
dan_goosewin@reddit
waste2treasure-org@reddit
ReallyFineJelly@reddit
coder543@reddit
pmttyji@reddit
Zyj@reddit
Different_Fix_2217@reddit
Minute-Act-4943@reddit
JLeonsarmiento@reddit
jacek2023@reddit
No_Conversation9561@reddit
doradus_novae@reddit