windows_error23

google/gemma-4-12B · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 302 comments

A man arrives at the office in the morning where several of his co-workers are already chatting next to the coffee machine. He listens to them for a bit, then suddenly asks: "How big are penguins?"

Posted by Aldaron23@reddit | Jokes | View on Reddit | 119 comments

Gemma 4 has been released

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 702 comments

DeepSeekOCR & codefuse-ai/F2LLM-v2 are ready on llama.cpp

Posted by pmttyji@reddit | LocalLLaMA | View on Reddit | 5 comments

Gemini Pro leaks its raw chain of thought, gets stuck in an infinite loop, narrates its own existential crisis, then prints (End) thousands of times

Posted by Powerful-Signal6312@reddit | LocalLLaMA | View on Reddit | 95 comments

OpenAI to acquire Astral

Posted by Useful-Macaron8729@reddit | Python | View on Reddit | 391 comments

OpenAI to acquire Astral

Posted by Useful-Macaron8729@reddit | Python | View on Reddit | 391 comments

OpenAI to acquire Astral

Posted by Useful-Macaron8729@reddit | Python | View on Reddit | 391 comments

OpenAI to acquire Astral

Posted by Useful-Macaron8729@reddit | Python | View on Reddit | 391 comments

OpenAI to acquire Astral

Posted by Useful-Macaron8729@reddit | Python | View on Reddit | 391 comments

Breaking : The small qwen3.5 models have been dropped

Posted by Illustrious-Swim9663@reddit | LocalLLaMA | View on Reddit | 334 comments

Qwen3 VL 30b a3b is pure love

Posted by Njee_@reddit | LocalLLaMA | View on Reddit | 91 comments

windows_error23@reddit

But the model is originally in bf16? If my understanding is correct, the fp32 mmproj is for people with hardware that doesn't support bf16 so that they can use it in full precision as an alternative instead of the quantized f16. Could be wrong on this.

Qwen3-VL GGUF!

Posted by khubebk@reddit | LocalLLaMA | View on Reddit | 52 comments

Qwen 3 !!!

Posted by ResearchCrafty1804@reddit | LocalLLaMA | View on Reddit | 460 comments

Google Ironwood TPU (7th generation) introduction

Posted by zimmski@reddit | LocalLLaMA | View on Reddit | 76 comments

Samsung introduces Gauss2: A Multimodal Generative AI model in three sizes (Compact, Balanced, Supreme)

Posted by Balance-@reddit | LocalLLaMA | View on Reddit | 16 comments

Wen GGUF?

Posted by Porespellar@reddit | LocalLLaMA | View on Reddit | 52 comments