dampflokfreund
-
Issues with Gemma 4 tool calling - abrupt gen ending despite the model telling me it wants to do X.
Posted by dampflokfreund@reddit | LocalLLaMA | View on Reddit | 22 comments
-
Enough with perplexity and KLD! BenchLocal benchmarks real use cases and is easy to use for everyone
Posted by dampflokfreund@reddit | LocalLLaMA | View on Reddit | 0 comments
-
Bartowski vs Unsloth for Gemma 4
Posted by dampflokfreund@reddit | LocalLLaMA | View on Reddit | 79 comments
-
Notice Qwen 3.5 reprocessing the prompt every time, taking long to answer for long prompts? That's actually because of its architecture.
Posted by dampflokfreund@reddit | LocalLLaMA | View on Reddit | 48 comments
-
Qwen 3.5 MXFP4 quants are coming - confirmed by Junyang Lin
Posted by dampflokfreund@reddit | LocalLLaMA | View on Reddit | 70 comments
-
In my experience, the QAT Gemma 3 quants by stduhpf still perform the best.
Posted by dampflokfreund@reddit | LocalLLaMA | View on Reddit | 28 comments
-
5060 chokes on VRAM just by playing Cyberpunk with raytracing enabled at 1080p with DLSS!
Posted by dampflokfreund@reddit | hardware | View on Reddit | 195 comments
-
One of the most annoying symptoms of low VRAM is not talking about nearly enough
Posted by dampflokfreund@reddit | hardware | View on Reddit | 4 comments
-
Honestly, THUDM might be the new star on the horizon (creators of GLM-4)
Posted by dampflokfreund@reddit | LocalLLaMA | View on Reddit | 68 comments
-
PSA: Gemma 3 QAT gguf models have some wrongly configured tokens
Posted by dampflokfreund@reddit | LocalLLaMA | View on Reddit | 50 comments
-
Every time
Posted by dampflokfreund@reddit | LocalLLaMA | View on Reddit | 0 comments
-
TechPowerUp-Benchmark: 5700 XT starts Alan Wake 2, but it is significantly slower than even a RTX 3050 due to lack of Mesh Shading.
Posted by dampflokfreund@reddit | hardware | View on Reddit | 149 comments