jacek2023

google/gemma-4-12B · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 171 comments

google/gemma-4-12B · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 171 comments

google/gemma-4-12B · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 171 comments

Let us let Google know that we want the Gemma 4 124b

Posted by seamonn@reddit | LocalLLaMA | View on Reddit | 39 comments

google/gemma-4-12B · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 171 comments

google/gemma-4-12B · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 171 comments

google/gemma-4-12B · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 171 comments

google/gemma-4-12B · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 171 comments

google/gemma-4-12B · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 171 comments

google/gemma-4-12B · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 171 comments

jacek2023@reddit (OP)

[https://developers.googleblog.com/gemma-4-12b-the-developer-guide/](https://developers.googleblog.com/gemma-4-12b-the-developer-guide/)

google/gemma-4-12B · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 171 comments

jacek2023@reddit (OP)

it's a valid question but you should ask it on X 😉 [https://x.com/osanseviero/status/2062205174785921438](https://x.com/osanseviero/status/2062205174785921438)

google/gemma-4-12B · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 171 comments

ggml-org/gemma-4-12b-it-GGUF · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 13 comments

jacek2023@reddit (OP)

[https://www.reddit.com/r/LocalLLaMA/comments/1tvtn6m/googlegemma412b\_hugging\_face/](https://www.reddit.com/r/LocalLLaMA/comments/1tvtn6m/googlegemma412b_hugging_face/)

ggml-org/gemma-4-12b-it-GGUF · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 13 comments

jacek2023@reddit (OP)

[https://www.reddit.com/r/LocalLLaMA/comments/1tvtn6m/googlegemma412b\_hugging\_face/](https://www.reddit.com/r/LocalLLaMA/comments/1tvtn6m/googlegemma412b_hugging_face/)

google/gemma-4-12B · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 171 comments

google/gemma-4-12B · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 171 comments

ggml-org/gemma-4-12b-it-GGUF · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 13 comments

Qwen 3.7 Plus just briefly appeared and then disappeared on OpenRouter.

Posted by ihatebeinganonymous@reddit | LocalLLaMA | View on Reddit | 28 comments

How does the new abliteration tool Apostate compare with others? - Abliterlitics

Posted by nathandreamfast@reddit | LocalLLaMA | View on Reddit | 12 comments

Calling it now Microsoft is buying Unsloth.

Posted by Wrong_Mushroom_7350@reddit | LocalLLaMA | View on Reddit | 290 comments

Macbook M5 Pro 24GB or 48GB

Posted by Resident_Bell_4457@reddit | LocalLLaMA | View on Reddit | 66 comments

New Microsoft models are not open, right?

Posted by ihatebeinganonymous@reddit | LocalLLaMA | View on Reddit | 7 comments

jacek2023@reddit

They wrote "locally" https://preview.redd.it/rvwzohoj415h1.png?width=605&format=png&auto=webp&s=9293f7f04f846eb88d442cf52636dc30fcac30a9

Calling it now Microsoft is buying Unsloth.

Posted by Wrong_Mushroom_7350@reddit | LocalLLaMA | View on Reddit | 290 comments

Would you consider getting an NVIDIA RTX Spark laptop?

Posted by gamblingapocalypse@reddit | LocalLLaMA | View on Reddit | 157 comments

StepFun 3.5 MTP by pwilkin · Pull Request #23274 · ggml-org/llama.cpp

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 21 comments

StepFun 3.5 MTP by pwilkin · Pull Request #23274 · ggml-org/llama.cpp

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 21 comments

Intel Arc Pro B70 llama.cpp benchmarks posted

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 48 comments

Intel Arc Pro B70 llama.cpp benchmarks posted

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 48 comments

I know… I know… But how to replace ChatGPT locally?

Posted by Thin_Pollution8843@reddit | LocalLLaMA | View on Reddit | 11 comments

Ignoring benchmarks, how do the newest local models (gemma 4 31B, 26BA4B, Qwen 3.6) “feel” to you? What do you think they compare to?

Posted by opoot_@reddit | LocalLLaMA | View on Reddit | 42 comments

Intel Arc Pro B70 llama.cpp benchmarks posted

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 48 comments

Is agenting usage increasing CPU usage for you?

Posted by superloser48@reddit | LocalLLaMA | View on Reddit | 10 comments

Dual rtx 3090 build

Posted by Sufficient_Phone_242@reddit | LocalLLaMA | View on Reddit | 68 comments

Is agenting usage increasing CPU usage for you?

Posted by superloser48@reddit | LocalLLaMA | View on Reddit | 10 comments

Intel Arc Pro B70 llama.cpp benchmarks posted

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 48 comments

jacek2023@reddit (OP)

You commented on my post, I shared someone’s benchmarks for people considering B70s. I replied to your comment saying that buying B70s might be easier than buying four 3090s. I already have three 3090s.

Intel Arc Pro B70 llama.cpp benchmarks posted

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 48 comments

Intel Arc Pro B70 llama.cpp benchmarks posted

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 48 comments

Intel Arc Pro B70 llama.cpp benchmarks posted

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 48 comments

jacek2023@reddit (OP)

I’ve been trying to buy a fourth 3090 for a long time, but prices are rising and availability is very low. At this point, I think buying four B70s would be easier than finding 3090s

Intel Arc Pro B70 llama.cpp benchmarks posted

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 48 comments

Intel Arc Pro B70 llama.cpp benchmarks posted

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 48 comments

jacek2023@reddit (OP)

I have no idea, but I see SYCL pull requests in llama.cpp, so I assume the backend is still being improved. These benchmarks at least establish a baseline. GPU works and it’s a much more affordable than 5090 (to run big models you need VRAM first and speed is often less crucial)

I hate to be this guy but: Any good, recent CODING models in the 70-80B range?

Posted by ParaboloidalCrest@reddit | LocalLLaMA | View on Reddit | 112 comments

I hate to be this guy but: Any good, recent CODING models in the 70-80B range?

Posted by ParaboloidalCrest@reddit | LocalLLaMA | View on Reddit | 112 comments

jacek2023@reddit

I have multiple finetunes of GLM Air but I don't use it for coding. Currently I use Qwen 3.6 27B with pi and I have zero problems with it - actually Claude Code (with Opus) started to annoy me with its speed.

I hate to be this guy but: Any good, recent CODING models in the 70-80B range?

Posted by ParaboloidalCrest@reddit | LocalLLaMA | View on Reddit | 112 comments

Genuinely what do we do about the bot comments in this sub

Posted by Borkato@reddit | LocalLLaMA | View on Reddit | 102 comments

Added an old 2070 Super to my rig and I can't go back...worse, now I need more

Posted by PferdOne@reddit | LocalLLaMA | View on Reddit | 46 comments

MTP is nice and all, but what about PP speeds?

Posted by milpster@reddit | LocalLLaMA | View on Reddit | 30 comments

jacek2023@reddit

It's very important to minimize prompt processing (number of tokens to process), make sure you use latest llama.cpp and you "preserve thinking", this way my prompt processing is fast

Someone out there likely needs this: TP vs PP for 2 identical GPUs

Posted by xspider2000@reddit | LocalLLaMA | View on Reddit | 33 comments

Someone out there likely needs this: TP vs PP for 2 identical GPUs

Posted by xspider2000@reddit | LocalLLaMA | View on Reddit | 33 comments

Someone out there likely needs this: TP vs PP for 2 identical GPUs

Posted by xspider2000@reddit | LocalLLaMA | View on Reddit | 33 comments

next MiniMax will be released in ~10 Days

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 52 comments

jacek2023@reddit (OP)

Yes, I probably won't be able to run this model, but I wanted to point out that MiniMax is still open, while Qwen’s current status is unknown.