jacek2023
-
google/gemma-4-12B · Hugging Face
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 99 comments
-
ui: Mermaid Diagrams in chat + interactive preview by allozaur · Pull Request #24032 · ggml-org/llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 6 comments
-
Holo3.1 35B/9B/4B/0.8B (Qwen 3.5 finetunes)
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 12 comments
-
ggml-org/gemma-4-12b-it-GGUF · Hugging Face
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 13 comments
-
Intel Arc Pro B70 llama.cpp benchmarks posted
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 48 comments
-
StepFun 3.5 MTP by pwilkin · Pull Request #23274 · ggml-org/llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 21 comments
-
LiquidAI/LFM2.5-8B-A1B · Hugging Face
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 83 comments
-
ui: Add Thinking mode toggle with reasoning effort levels + improvements for Chat Form Add Action UI by allozaur · Pull Request #23434 · ggml-org/llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 9 comments
-
GitHub - google-gemma/gemma-skills: Skills for the Gemma and model/agent interactions
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 7 comments
-
next MiniMax will be released in ~10 Days
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 52 comments
-
Add EXAONE 4.5 implementations by nuxlear · Pull Request #21733 · ggml-org/llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 2 comments
-
llama: use f16 mask for FA to save VRAM by am17an · Pull Request #23764 · ggml-org/llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 78 comments
-
Gryphe/Pantheon-Reasoning-27B · Hugging Face
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 18 comments
-
llama : website + unified `llama` binary · ggml-org/llama.cpp · Discussion #23875
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 15 comments
-
llama.app - Official home for llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 1 comments
-
webui: support video files as input by foldl · Pull Request #22830 · ggml-org/llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 10 comments
-
Is harness a new buzzword?
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 122 comments
-
Qwen/Qwen-Image-Bench · Hugging Face
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 12 comments
-
Nemotron-Labs-Diffusion from NVIDIA
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 38 comments
-
mistralai/Mistral-Medium-3.5-128B · Hugging Face
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 322 comments
-
Mistral-Medium-3.5-128B-Q3_K_M on 3x3090 (72GB VRAM)
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 23 comments
-
LGAI-EXAONE/K-EXAONE-236B-A23B · Hugging Face
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 66 comments
-
server: fix checkpoints creation by jacekpoplawski · Pull Request #22929 · ggml-org/llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 40 comments
-
llama: avoid copying logits during prompt decode in MTP by am17an · Pull Request #23198 · ggml-org/llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 54 comments
-
Qwen is cooking hard
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 225 comments
-
This is where we are right now, LocalLLaMA
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 491 comments
-
karpathy / autoresearch
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 96 comments
-
LatitudeGames/Equinox-31B · Hugging Face
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 36 comments
-
Tencent Hy 30B/7B/1.8B
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 29 comments
-
Move to backend sampling for MTP draft path by gaugarg-nv · Pull Request #23287 · ggml-org/llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 37 comments
-
[WIP] Gemma 4 MTP
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 50 comments
-
oops
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 0 comments
-
backend-agnostic tensor parallelism has been merged into llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 60 comments
-
inclusionAI/Ring-2.6-1T · Hugging Face
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 39 comments
-
unsloth/MiMo-V2.5-GGUF · Hugging Face
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 59 comments
-
llama + spec: MTP Support by am17an · Pull Request #22673 · ggml-org/llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 7 comments
-
Which Gemma model do you want next?
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 116 comments
-
meantime on r/vibecoding
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 128 comments
-
Qwen/WebWorld 32B/14B/8B (Qwen3 finetune)
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 10 comments
-
FINAL-Bench/Darwin-36B-Opus · Hugging Face
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 21 comments
-
server, webui: support continue generation on reasoning models by ServeurpersoCom · Pull Request #22727 · ggml-org/llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 3 comments
-
it's time to update your Gemma 4 GGUFs
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 119 comments
-
feat: Add Mimo v2.5 model support by AesSedai · Pull Request #22493 · ggml-org/llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 39 comments
-
Gemma 4 on Android phones
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 31 comments
-
examples : add llama-eval by ggerganov · Pull Request #21152 · ggml-org/llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 22 comments
-
feat: add MiMo v2.5 vision by AesSedai · Pull Request #22883 · ggml-org/llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 8 comments
-
Desktop pets for AI coding agents
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 4 comments
-
AI has replaced programmers… totally.
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 297 comments
-
inclusionAI/Ling-2.5-1T · Hugging Face
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 23 comments
-
model: add sarvam_moe architecture support by sumitchatterjee13 · Pull Request #20275 · ggml-org/llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 5 comments