LocalLLaMA | TheaterFire

Sparse Adaptive Attention “MoE”, a potential performance breakthrough for LLMs?

Posted by kaggleqrdl@reddit | LocalLLaMA | View on Reddit | 8 comments
What are your real life/WORK use cases with LOCAL LLMs

Posted by Adventurous-Gold6413@reddit | LocalLLaMA | View on Reddit | 11 comments
Best current dense, nonthinking models in the 8b-14b range?

Posted by Priceless_Pennies@reddit | LocalLLaMA | View on Reddit | 20 comments
Using my Mac Mini M4 as an LLM server—Looking for recommendations

Posted by cockpit_dandruff@reddit | LocalLLaMA | View on Reddit | 7 comments
L16 Prompt Drift: GPT-2 vs LLaMA-3-8B — IDENTICAL DRIFT

Posted by Mysterious_Doubt_341@reddit | LocalLLaMA | View on Reddit | 0 comments
Qwen3 Max Thinking this week

Posted by ResearchCrafty1804@reddit | LocalLLaMA | View on Reddit | 33 comments
Getting llm on low end phone

Posted by hemtai_lover@reddit | LocalLLaMA | View on Reddit | 2 comments
Hebrew_Nemo: a state-of-the-art Hebrew large language model

Posted by Sicarius_The_First@reddit | LocalLLaMA | View on Reddit | 4 comments
Need advice on building a GPU-based render/Al compute setup: Unsure about hardware direction

Posted by One_Abroad_5937@reddit | LocalLLaMA | View on Reddit | 0 comments
Choosing the right model

Posted by Bowdenzug@reddit | LocalLLaMA | View on Reddit | 4 comments
AMD MI50 32GB/Vega20 GPU Passthrough Guide for Proxmox

Posted by Panda24z@reddit | LocalLLaMA | View on Reddit | 41 comments
Open-source web crawler with markdown output?

Posted by mxdamp@reddit | LocalLLaMA | View on Reddit | 8 comments
Granite 4.0 Nano Language Models

Posted by ApprehensiveAd3629@reddit | LocalLLaMA | View on Reddit | 68 comments
Reliable source for used 3090 ?

Posted by vdiallonort@reddit | LocalLLaMA | View on Reddit | 14 comments
Improving RAG Results with OpenWebUI - Looking for Advice on Custom Pipelines & Better Embeddings

Posted by b5761@reddit | LocalLLaMA | View on Reddit | 8 comments
Best Hardware Setup to Run DeepSeek-V3 670B Locally on $40K–$80K?

Posted by PrevelantInsanity@reddit | LocalLLaMA | View on Reddit | 65 comments
GPT-OSS Safeguard coming soon

Posted by Independent-Ruin-376@reddit | LocalLLaMA | View on Reddit | 12 comments
Speculation or rumors on Gemma 4?

Posted by RobotRobotWhatDoUSee@reddit | LocalLLaMA | View on Reddit | 0 comments
How much does 1T tokens cost? How much did all these amazing people spent on OpenAI tokens?

Posted by aospan@reddit | LocalLLaMA | View on Reddit | 44 comments
Uncensored AI for scientific research

Posted by PrintCreepy8982@reddit | LocalLLaMA | View on Reddit | 14 comments
L16 Prompt Drift Experiment — Live Colab (GPT-2)

Posted by Mysterious_Doubt_341@reddit | LocalLLaMA | View on Reddit | 0 comments
How is the new Grok AI girlfriend animation implemented?

Posted by EvilKY45@reddit | LocalLLaMA | View on Reddit | 42 comments
how the ai models are editing the code snippets???

Posted by lavangamm@reddit | LocalLLaMA | View on Reddit | 3 comments
Best chat interface currently (Aug 2025)

Posted by cmdr-William-Riker@reddit | LocalLLaMA | View on Reddit | 27 comments
Minimax-M2 cracks top 10 overall LLMs (production LLM performance gap shrinking: 7 points from GPT-5 in Artificial Analysis benchmark)

Posted by medi6@reddit | LocalLLaMA | View on Reddit | 23 comments
Poker Tournament for LLMs

Posted by undoing8@reddit | LocalLLaMA | View on Reddit | 28 comments
Serve 100 Large AI Models on a single GPU with low impact to time to first token.

Posted by SetZealousideal5006@reddit | LocalLLaMA | View on Reddit | 6 comments
IBM releases Granite-4.0 Nano (300M & 1B), along with a local browser demo showing how the models can programmatically interact with websites and call tools/browser APIs on your behalf.

Posted by xenovatech@reddit | LocalLLaMA | View on Reddit | 26 comments
What is the best open source TTS model with multi language support?

Posted by Anxietrap@reddit | LocalLLaMA | View on Reddit | 22 comments
MiniMaxAI/MiniMax-M2 · Hugging Face

Posted by Dark_Fire_12@reddit | LocalLLaMA | View on Reddit | 52 comments
Local Hosting Question

Posted by Media_Express@reddit | LocalLLaMA | View on Reddit | 15 comments
Llama3.3:70b vs GPT-OSS:20b for PHP Code Generation

Posted by AppledogHu@reddit | LocalLLaMA | View on Reddit | 13 comments
Phone LLM's benchmarks?

Posted by ctrl-brk@reddit | LocalLLaMA | View on Reddit | 34 comments
Experience with the new model MiniMax M2 and some cost saving tips

Posted by thalacque@reddit | LocalLLaMA | View on Reddit | 26 comments
AMA Announcement: Liquid AI, the team behind Liquid Foundational Models, LEAP and Apollo (Thu, Oct 30 • 10 AM – 1 PM PDT)

Posted by LiquidAI_Team@reddit | LocalLLaMA | View on Reddit | 5 comments
Anyone running local LLM coding setups on 24GB VRAM laptops? Looking for real-world experiences

Posted by AmazinglyNatural6545@reddit | LocalLLaMA | View on Reddit | 20 comments
MiniMax-M2 llama.cpp

Posted by butlan@reddit | LocalLLaMA | View on Reddit | 11 comments
Just dropped Kani TTS English - a 400M TTS model that's 5x faster than realtime on RTX 4080

Posted by ylankgz@reddit | LocalLLaMA | View on Reddit | 37 comments
Book suggestion with chapters to be 'must read' to get most from the book

Posted by bad_detectiv3@reddit | LocalLLaMA | View on Reddit | 5 comments
Bad news: DGX Spark may have only half the performance claimed.

Posted by Dr_Karminski@reddit | LocalLLaMA | View on Reddit | 253 comments
First impressions and thoughts on the GTR9 Pro (Beelink's 395)

Posted by kmouratidis@reddit | LocalLLaMA | View on Reddit | 28 comments
tokens per second on a NASA computer

Posted by Pro-editor-1105@reddit | LocalLLaMA | View on Reddit | 9 comments
GLM-4.6 on fresh SWE-bench–style tasks collected in September 2025

Posted by CuriousPlatypus1881@reddit | LocalLLaMA | View on Reddit | 25 comments
Completed 8xAMD MI50 - 256GB VRAM + 256GB RAM rig for $3k

Posted by MLDataScientist@reddit | LocalLLaMA | View on Reddit | 221 comments
MiniMax M2 Llama.cpp support

Posted by ilintar@reddit | LocalLLaMA | View on Reddit | 14 comments
Local coding models limit

Posted by Blues520@reddit | LocalLLaMA | View on Reddit | 14 comments
OSS alternative to Open WebUI - ChatGPT-like UI, API and CLI

Posted by mythz@reddit | LocalLLaMA | View on Reddit | 93 comments
⚠ NOT ALLOWED TO EVEN WONDER ⚠

Posted by researchAmericanAI@reddit | LocalLLaMA | View on Reddit | 1 comments
PipesHub - Open Source Enterprise Search Platform(Generative-AI Powered)

Posted by Effective-Ad2060@reddit | LocalLLaMA | View on Reddit | 8 comments
⚠ AUTOMATIC ⚠

Posted by researchAmericanAI@reddit | LocalLLaMA | View on Reddit | 0 comments