StepJumpy4782

Gemma 4 31B GGUF quants ranked by KL divergence (unsloth, bartowski, lmstudio-community, ggml-org)

Posted by oobabooga4@reddit | LocalLLaMA | View on Reddit | 103 comments

StepJumpy4782@reddit

wow super cool great work. could you explain more on what long document dataset and prompts were? since that is a notable usecase for me and your data showing it performing the worst is interesting to me. how are you running these? among latest qwen you said were working on, could we get some analysis like this for the really big ones (GLM 5.1 just came out :)). anything the community could help with in that regard?

R.I.P. MCP (Model Context Protocol) 2024-2026 - Killed by curl

Posted by jorgeiblanco@reddit | LocalLLaMA | View on Reddit | 42 comments

llama.cpp's new parser breaks tons of models, its staying that way, here's how to fix it

Posted by refulgentis@reddit | LocalLLaMA | View on Reddit | 40 comments

StepJumpy4782@reddit

meh I don't mind posts like these because im not staying in the loop of llama.cpp changes even though I pull latest daily. what are people doing to stay on top of things? the releases notes are fairly sparse. but yeah you need to relax. just use an older version and wait for the fix or patch it as you did.

American closed models vs Chinese open models is becoming a problem.

Posted by __JockY__@reddit | LocalLLaMA | View on Reddit | 622 comments

StepJumpy4782@reddit

lmaoo One thing that comes to mind is kinds of backdoors. Could train it that specific prompts anticipated to be used by the enemy are intentionally really bad / include obscure vulns / bad advise etc. Now if thats actually happening, well remains to be seen. I certainly have my doubts. Its open after all and a finding like that would instantly destroy so much trust they would have built up.

AMA With Z.AI, The Lab Behind GLM-4.7

Posted by zixuanlimit@reddit | LocalLLaMA | View on Reddit | 418 comments

StepJumpy4782@reddit

A bit of the loop with the latest happenings, will give 4.7 a go. What specifically makes GLM 4.7 stand out compared to everyone else? What more can we expect with future releases (closed and open)? And more specifically, what future areas of research are you guys most interesting in learning about?