jacek2023

google/gemma-4-12B · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 99 comments
ui: Mermaid Diagrams in chat + interactive preview by allozaur · Pull Request #24032 · ggml-org/llama.cpp

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 6 comments
Holo3.1 35B/9B/4B/0.8B (Qwen 3.5 finetunes)

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 12 comments
ggml-org/gemma-4-12b-it-GGUF · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 13 comments
Intel Arc Pro B70 llama.cpp benchmarks posted

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 48 comments
StepFun 3.5 MTP by pwilkin · Pull Request #23274 · ggml-org/llama.cpp

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 21 comments
LiquidAI/LFM2.5-8B-A1B · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 83 comments
ui: Add Thinking mode toggle with reasoning effort levels + improvements for Chat Form Add Action UI by allozaur · Pull Request #23434 · ggml-org/llama.cpp

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 9 comments
GitHub - google-gemma/gemma-skills: Skills for the Gemma and model/agent interactions

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 7 comments
next MiniMax will be released in ~10 Days

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 52 comments
Add EXAONE 4.5 implementations by nuxlear · Pull Request #21733 · ggml-org/llama.cpp

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 2 comments
llama: use f16 mask for FA to save VRAM by am17an · Pull Request #23764 · ggml-org/llama.cpp

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 78 comments
Gryphe/Pantheon-Reasoning-27B · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 18 comments
llama : website + unified `llama` binary · ggml-org/llama.cpp · Discussion #23875

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 15 comments
llama.app - Official home for llama.cpp

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 1 comments
webui: support video files as input by foldl · Pull Request #22830 · ggml-org/llama.cpp

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 10 comments
Is harness a new buzzword?

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 122 comments
Qwen/Qwen-Image-Bench · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 12 comments
Nemotron-Labs-Diffusion from NVIDIA

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 38 comments
mistralai/Mistral-Medium-3.5-128B · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 322 comments
Mistral-Medium-3.5-128B-Q3_K_M on 3x3090 (72GB VRAM)

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 23 comments
LGAI-EXAONE/K-EXAONE-236B-A23B · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 66 comments
server: fix checkpoints creation by jacekpoplawski · Pull Request #22929 · ggml-org/llama.cpp

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 40 comments
llama: avoid copying logits during prompt decode in MTP by am17an · Pull Request #23198 · ggml-org/llama.cpp

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 54 comments
Qwen is cooking hard

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 225 comments
This is where we are right now, LocalLLaMA

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 491 comments
karpathy / autoresearch

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 96 comments
LatitudeGames/Equinox-31B · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 36 comments
Tencent Hy 30B/7B/1.8B

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 29 comments
Move to backend sampling for MTP draft path by gaugarg-nv · Pull Request #23287 · ggml-org/llama.cpp

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 37 comments
[WIP] Gemma 4 MTP

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 50 comments
oops

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 0 comments
backend-agnostic tensor parallelism has been merged into llama.cpp

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 60 comments
inclusionAI/Ring-2.6-1T · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 39 comments
unsloth/MiMo-V2.5-GGUF · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 59 comments
llama + spec: MTP Support by am17an · Pull Request #22673 · ggml-org/llama.cpp

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 7 comments
Which Gemma model do you want next?

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 116 comments
meantime on r/vibecoding

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 128 comments
Qwen/WebWorld 32B/14B/8B (Qwen3 finetune)

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 10 comments
FINAL-Bench/Darwin-36B-Opus · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 21 comments
server, webui: support continue generation on reasoning models by ServeurpersoCom · Pull Request #22727 · ggml-org/llama.cpp

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 3 comments
it's time to update your Gemma 4 GGUFs

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 119 comments
feat: Add Mimo v2.5 model support by AesSedai · Pull Request #22493 · ggml-org/llama.cpp

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 39 comments
Gemma 4 on Android phones

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 31 comments
examples : add llama-eval by ggerganov · Pull Request #21152 · ggml-org/llama.cpp

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 22 comments
feat: add MiMo v2.5 vision by AesSedai · Pull Request #22883 · ggml-org/llama.cpp

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 8 comments
Desktop pets for AI coding agents

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 4 comments
AI has replaced programmers… totally.

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 297 comments
inclusionAI/Ling-2.5-1T · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 23 comments
model: add sarvam_moe architecture support by sumitchatterjee13 · Pull Request #20275 · ggml-org/llama.cpp

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 5 comments