RipperFox
how would you set up a local llm server for a business of 7 people?
Posted by snowieslilpikachu69@reddit | LocalLLaMA | View on Reddit | 60 comments
How do you start your Llama.cpp server?
Posted by Citadel_Employee@reddit | LocalLLaMA | View on Reddit | 40 comments
RipperFox@reddit
How do you start your Llama.cpp server?
Posted by Citadel_Employee@reddit | LocalLLaMA | View on Reddit | 40 comments
RipperFox@reddit
How do you start your Llama.cpp server?
Posted by Citadel_Employee@reddit | LocalLLaMA | View on Reddit | 40 comments
RipperFox@reddit
Hard freakin' decision..Blackwell 96G or Mac Studio 256G
Posted by HyPyke@reddit | LocalLLaMA | View on Reddit | 212 comments
RipperFox@reddit
Hard freakin' decision..Blackwell 96G or Mac Studio 256G
Posted by HyPyke@reddit | LocalLLaMA | View on Reddit | 212 comments
RipperFox@reddit
Qwen 3.6 27B Makes Huge Gains in Agency on Artificial Analysis - Ties with Sonnet 4.6
Posted by dionysio211@reddit | LocalLLaMA | View on Reddit | 177 comments
RipperFox@reddit
Why are we actually sampling reasoning and output the same way?
Posted by ReporterWeary9721@reddit | LocalLLaMA | View on Reddit | 21 comments
RipperFox@reddit
Forgive my ignorance but how is a 27B model better than 397B?
Posted by No_Conversation9561@reddit | LocalLLaMA | View on Reddit | 286 comments
RipperFox@reddit
Forgive my ignorance but how is a 27B model better than 397B?
Posted by No_Conversation9561@reddit | LocalLLaMA | View on Reddit | 286 comments
RipperFox@reddit
Qwen 3.6 35B crushes Gemma 4 26B on my tests
Posted by Lowkey_LokiSN@reddit | LocalLLaMA | View on Reddit | 116 comments
RipperFox@reddit
DGX Spark just arrived — planning to run vLLM + local models, looking for advice
Posted by dalemusser@reddit | LocalLLaMA | View on Reddit | 90 comments
RipperFox@reddit
Gemma 4 31B — 4bit is all you need
Posted by tolitius@reddit | LocalLLaMA | View on Reddit | 75 comments
RipperFox@reddit
openrouter/elephant-alpha is 99% Chinese, likely Qwen 3 Nex
Posted by Winter_Put_6046@reddit | LocalLLaMA | View on Reddit | 5 comments
RipperFox@reddit
What Is Elephant-Alpha ???
Posted by One_Title_3656@reddit | LocalLLaMA | View on Reddit | 117 comments
RipperFox@reddit
What Is Elephant-Alpha ???
Posted by One_Title_3656@reddit | LocalLLaMA | View on Reddit | 117 comments
RipperFox@reddit
What Is Elephant-Alpha ???
Posted by One_Title_3656@reddit | LocalLLaMA | View on Reddit | 117 comments
RipperFox@reddit
What Is Elephant-Alpha ???
Posted by One_Title_3656@reddit | LocalLLaMA | View on Reddit | 117 comments
RipperFox@reddit
Qwen3.5-35B-A3B-Uncensored-FernflowerAI-GGUF
Posted by EvilEnginer@reddit | LocalLLaMA | View on Reddit | 203 comments
RipperFox@reddit
Gemma 4 has a systemic attention failure. Here's the proof.
Posted by EvilEnginer@reddit | LocalLLaMA | View on Reddit | 150 comments
RipperFox@reddit
What Is Elephant-Alpha ???
Posted by One_Title_3656@reddit | LocalLLaMA | View on Reddit | 117 comments
RipperFox@reddit
Gemma 4 has a systemic attention failure. Here's the proof.
Posted by EvilEnginer@reddit | LocalLLaMA | View on Reddit | 150 comments
RipperFox@reddit
Gemma 4 has a systemic attention failure. Here's the proof.
Posted by EvilEnginer@reddit | LocalLLaMA | View on Reddit | 150 comments
RipperFox@reddit
Gemma 4 has a systemic attention failure. Here's the proof.
Posted by EvilEnginer@reddit | LocalLLaMA | View on Reddit | 150 comments
RipperFox@reddit
Gemma 4 has a systemic attention failure. Here's the proof.
Posted by EvilEnginer@reddit | LocalLLaMA | View on Reddit | 150 comments
RipperFox@reddit
Gemma 4 has a systemic attention failure. Here's the proof.
Posted by EvilEnginer@reddit | LocalLLaMA | View on Reddit | 150 comments
RipperFox@reddit
Gemma 4 has a systemic attention failure. Here's the proof.
Posted by EvilEnginer@reddit | LocalLLaMA | View on Reddit | 150 comments
RipperFox@reddit
Gemma 4 has a systemic attention failure. Here's the proof.
Posted by EvilEnginer@reddit | LocalLLaMA | View on Reddit | 150 comments
RipperFox@reddit
Shipped local LLM-powered SQL generation in a desktop app - Qwen2.5-Coder, fully on-device, with auto self-healing
Posted by Pitiful_Comedian_834@reddit | LocalLLaMA | View on Reddit | 4 comments
RipperFox@reddit
It looks like there are no plans for smaller GLM models
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 128 comments
RipperFox@reddit
Where is MiniMax M2.7?
Posted by lolwutdo@reddit | LocalLLaMA | View on Reddit | 18 comments
RipperFox@reddit
GLM 5.1 tops the code arena rankings for open models
Posted by Auralore@reddit | LocalLLaMA | View on Reddit | 146 comments
RipperFox@reddit
Gemma 4, llama.cpp, tool calls, and tool results - ChatGPT fixed it for me
Posted by TheProgrammer-231@reddit | LocalLLaMA | View on Reddit | 48 comments
RipperFox@reddit
Choice for agentic LLM or help optimize Qwen3.5-35B-A3B for 24GB VRAM
Posted by marivesel@reddit | LocalLLaMA | View on Reddit | 24 comments
RipperFox@reddit
Research: how do you handle persistent context/memory with local models?
Posted by Mammoth_Resolve4418@reddit | LocalLLaMA | View on Reddit | 2 comments
RipperFox@reddit
Get 30K more context using Q8 mmproj with Gemma 4
Posted by Sadman782@reddit | LocalLLaMA | View on Reddit | 18 comments
RipperFox@reddit
Harmonic-9B - Two-stage Qwen3.5-9B fine-tune (Stage 2 still training)
Posted by Crampappydime@reddit | LocalLLaMA | View on Reddit | 7 comments
RipperFox@reddit
Don’t buy the DGX Spark: NVFP4 Still Missing After 6 Months
Posted by Secure_Archer_1529@reddit | LocalLLaMA | View on Reddit | 196 comments
RipperFox@reddit
What should a new SysAdmin know first?
Posted by drake90001@reddit | sysadmin | View on Reddit | 65 comments
RipperFox@reddit
Llama.cpp developers right now
Posted by ML-Future@reddit | LocalLLaMA | View on Reddit | 99 comments
RipperFox@reddit
Copaw-9B (Qwen3.5 9b, alibaba official agentic finetune) is out
Posted by kironlau@reddit | LocalLLaMA | View on Reddit | 64 comments
RipperFox@reddit
Copaw-9B (Qwen3.5 9b, alibaba official agentic finetune) is out
Posted by kironlau@reddit | LocalLLaMA | View on Reddit | 64 comments
RipperFox@reddit
Copaw-9B (Qwen3.5 9b, alibaba official agentic finetune) is out
Posted by kironlau@reddit | LocalLLaMA | View on Reddit | 64 comments
RipperFox@reddit
I'm building a benchmark comparing models for an agentic task. Are there any small models I should be testing that I haven't?
Posted by nickl@reddit | LocalLLaMA | View on Reddit | 40 comments
RipperFox@reddit
I'm building a benchmark comparing models for an agentic task. Are there any small models I should be testing that I haven't?
Posted by nickl@reddit | LocalLLaMA | View on Reddit | 40 comments
RipperFox@reddit
The Hardest Thing: Building and Running the UNIX Kernel from Original Sources
Posted by MatchingTurret@reddit | linux | View on Reddit | 34 comments
RipperFox@reddit
Customer doesn't understand the difference between a HDD and a SSD
Posted by FreaksLP3000@reddit | talesfromtechsupport | View on Reddit | 30 comments