Formal-Exam-8767
The Future of Free & Local Models: Training Co-Ops? Professional Orgs? Churches?
Posted by liftheavyscheisse@reddit | LocalLLaMA | View on Reddit | 18 comments
The Future of Free & Local Models: Training Co-Ops? Professional Orgs? Churches?
Posted by liftheavyscheisse@reddit | LocalLLaMA | View on Reddit | 18 comments
Formal-Exam-8767@reddit
Macbook M5 Pro 24GB or 48GB
Posted by Resident_Bell_4457@reddit | LocalLLaMA | View on Reddit | 69 comments
Formal-Exam-8767@reddit
Macbook M5 Pro 24GB or 48GB
Posted by Resident_Bell_4457@reddit | LocalLLaMA | View on Reddit | 69 comments
Formal-Exam-8767@reddit
I burned a weekend making the models "remember" me. The fix had nothing to do with trying to run bigger models locally
Posted by shbong@reddit | LocalLLaMA | View on Reddit | 20 comments
Formal-Exam-8767@reddit
Would you consider getting an NVIDIA RTX Spark laptop?
Posted by gamblingapocalypse@reddit | LocalLLaMA | View on Reddit | 167 comments
Formal-Exam-8767@reddit
Direct 100.0 t/s on Strix Halo with Qwen3 30B-A3B. Can anyone reproduce or beat this?
Posted by JSVD2@reddit | LocalLLaMA | View on Reddit | 22 comments
Formal-Exam-8767@reddit
Are GPUs getting cheaper?
Posted by iMakeSense@reddit | LocalLLaMA | View on Reddit | 11 comments
Formal-Exam-8767@reddit
Another shout out to llama.cpp build b9455 2x3090
Posted by Fabulous_Fact_606@reddit | LocalLLaMA | View on Reddit | 43 comments
Formal-Exam-8767@reddit
For those of you running vllm locally for inference what quantifications do you use
Posted by Limp_Classroom_2645@reddit | LocalLLaMA | View on Reddit | 4 comments
Formal-Exam-8767@reddit
Qwen 3.6-35B-A3B with 977 tk/s prompt processing and 262k context window on Intel Arc B70 Pro
Posted by Atomynos_Atom@reddit | LocalLLaMA | View on Reddit | 44 comments
Formal-Exam-8767@reddit
Intel Arc Pro B70 llama.cpp benchmarks posted
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 48 comments
Formal-Exam-8767@reddit
Intel Arc Pro B70 llama.cpp benchmarks posted
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 48 comments
Formal-Exam-8767@reddit
RTX Spark does not have 600GB/s Bandwith
Posted by rpiguy9907@reddit | LocalLLaMA | View on Reddit | 194 comments
Formal-Exam-8767@reddit
RTX Spark does not have 600GB/s Bandwith
Posted by rpiguy9907@reddit | LocalLLaMA | View on Reddit | 194 comments
Formal-Exam-8767@reddit
Stop asking what model to run. There are literally only two.
Posted by Wrong_Mushroom_7350@reddit | LocalLLaMA | View on Reddit | 571 comments
Formal-Exam-8767@reddit
Cheap V100 32gb
Posted by MachineZer0@reddit | LocalLLaMA | View on Reddit | 28 comments
Formal-Exam-8767@reddit
Someone out there likely needs this: TP vs PP for 2 identical GPUs
Posted by xspider2000@reddit | LocalLLaMA | View on Reddit | 33 comments
Formal-Exam-8767@reddit
NVIDIA RTX Spark — Slim Laptops & Small Desktops
Posted by zxyzyxz@reddit | LocalLLaMA | View on Reddit | 56 comments
Formal-Exam-8767@reddit
NVIDIA RTX Spark — Slim Laptops & Small Desktops
Posted by zxyzyxz@reddit | LocalLLaMA | View on Reddit | 56 comments
Formal-Exam-8767@reddit
Why is there no community project for training your own LLM from scratch on consumer hardware?
Posted by tevlon@reddit | LocalLLaMA | View on Reddit | 69 comments
Formal-Exam-8767@reddit
Models still being vulnerable to Prompt Injection is actually a huge architectural red flag...
Posted by Comrade_Mugabe@reddit | LocalLLaMA | View on Reddit | 39 comments
Formal-Exam-8767@reddit
Custom 4x RTX PRO 6000 Blackwell server vs Dell GB300 for ~30 fine-tuned production pipelines — looking for honest input on direction
Posted by Consistent_Wash_276@reddit | LocalLLaMA | View on Reddit | 72 comments
Formal-Exam-8767@reddit
RTX5080 vs RTX 3090 ?
Posted by DarkAndrei@reddit | LocalLLaMA | View on Reddit | 48 comments
Formal-Exam-8767@reddit
Not sure if this was posted. But I think it's highly relevant to us.
Posted by Paradigmind@reddit | LocalLLaMA | View on Reddit | 194 comments
Formal-Exam-8767@reddit
Poor performance on RX 9070 XT
Posted by WhatererBlah555@reddit | LocalLLaMA | View on Reddit | 25 comments
Formal-Exam-8767@reddit
Poor performance on RX 9070 XT
Posted by WhatererBlah555@reddit | LocalLLaMA | View on Reddit | 25 comments
Formal-Exam-8767@reddit
Stop pretending self-hosting is cheaper. It's not. We do it for different reasons and we should say so.
Posted by Napster3301@reddit | LocalLLaMA | View on Reddit | 88 comments
Formal-Exam-8767@reddit
llama.cpp oom issue
Posted by TheTerrasque@reddit | LocalLLaMA | View on Reddit | 23 comments
Formal-Exam-8767@reddit
server: fix checkpoints creation by jacekpoplawski · Pull Request #22929 · ggml-org/llama.cpp
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 40 comments
Formal-Exam-8767@reddit
New Release of ROCm based MLX LLM Engine - lemon-mlx-engine
Posted by GeramyL@reddit | LocalLLaMA | View on Reddit | 35 comments
Formal-Exam-8767@reddit
Gemma 4 thinks I'm gaslighting it when I talk about Gemma 4 line of models
Posted by Jorlen@reddit | LocalLLaMA | View on Reddit | 14 comments
Formal-Exam-8767@reddit
Gemma 4 thinks I'm gaslighting it when I talk about Gemma 4 line of models
Posted by Jorlen@reddit | LocalLLaMA | View on Reddit | 14 comments
Formal-Exam-8767@reddit
AMD Powers Next-Generation Agent Computers with New Ryzen AI Halo Developer Platform and Ryzen AI Max PRO 400 Series Processors
Posted by Baumpaladin@reddit | LocalLLaMA | View on Reddit | 66 comments
Formal-Exam-8767@reddit
AMD BC-250 and the search for Cheap Compute
Posted by dugganmania@reddit | LocalLLaMA | View on Reddit | 41 comments
Formal-Exam-8767@reddit
[WIP] Gemma 4 MTP
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 50 comments
Formal-Exam-8767@reddit
[WIP] Gemma 4 MTP
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 50 comments
Formal-Exam-8767@reddit
Intel's Crescent Island PCB Leaks, Showing a Massive Xe3P GPU, 16-Pin Connector, 160GB LPDDR5X as Intel Sidesteps the HBM Shortage
Posted by FullstackSensei@reddit | LocalLLaMA | View on Reddit | 89 comments
Formal-Exam-8767@reddit
Intel's Crescent Island PCB Leaks, Showing a Massive Xe3P GPU, 16-Pin Connector, 160GB LPDDR5X as Intel Sidesteps the HBM Shortage
Posted by FullstackSensei@reddit | LocalLLaMA | View on Reddit | 89 comments
Formal-Exam-8767@reddit
What is the point of MoE models, beyond being faster?
Posted by ihatebeinganonymous@reddit | LocalLLaMA | View on Reddit | 135 comments
Formal-Exam-8767@reddit
Qwen 3.6 27B on 24GB VRAM setup: backend comparisons, quant choice and settings (llama.cpp, ik_llama.cpp, BeeLlama, vllm)
Posted by VolandBerlioz@reddit | LocalLLaMA | View on Reddit | 130 comments
Formal-Exam-8767@reddit
Reliable Open Source LLM as a Service
Posted by pravictor@reddit | LocalLLaMA | View on Reddit | 10 comments
Formal-Exam-8767@reddit
Different gpu mixed node
Posted by Force88@reddit | LocalLLaMA | View on Reddit | 10 comments
Formal-Exam-8767@reddit
The "the future is fictional" problem of many local LLMs
Posted by PromptInjection_@reddit | LocalLLaMA | View on Reddit | 54 comments
Formal-Exam-8767@reddit
Turboquant+MTP for ROCm(Llama CPP)
Posted by DrBearJ3w@reddit | LocalLLaMA | View on Reddit | 16 comments
Formal-Exam-8767@reddit
I've seen a lot of folks ask "can local LLMs actually do anything useful?"
Posted by NoWorking8412@reddit | LocalLLaMA | View on Reddit | 121 comments
Formal-Exam-8767@reddit
Estimate inference speed of local Qwen3.6-35B on Mac M5...
Posted by Altruistic-Dust-2565@reddit | LocalLLaMA | View on Reddit | 17 comments
Formal-Exam-8767@reddit
DIY market declining amid high RAM prices
Posted by Terminator857@reddit | LocalLLaMA | View on Reddit | 114 comments
Formal-Exam-8767@reddit
how you justify your spending time and resource for Local LLM to your love one ?
Posted by Merchant_Lawrence@reddit | LocalLLaMA | View on Reddit | 6 comments
Formal-Exam-8767@reddit
Strix Halo Clustering (Hardware Setup Discussion)
Posted by Thanks-Suitable@reddit | LocalLLaMA | View on Reddit | 20 comments