ClimateBoss
Terrible speeds with LM Studio? (Is LM Studio bad?)
Posted by HugoCortell@reddit | LocalLLaMA | View on Reddit | 81 comments
Terrible speeds with LM Studio? (Is LM Studio bad?)
Posted by HugoCortell@reddit | LocalLLaMA | View on Reddit | 81 comments
ClimateBoss@reddit
Replacing $200/mo Cursor subscription with local Ollama + Claude API. Does this hybrid Mac/Windows setup make sense?
Posted by grohmaaan@reddit | LocalLLaMA | View on Reddit | 27 comments
ClimateBoss@reddit
Terrible speeds with LM Studio? (Is LM Studio bad?)
Posted by HugoCortell@reddit | LocalLLaMA | View on Reddit | 81 comments
ClimateBoss@reddit
RTX 6000 build / drive and fan questions
Posted by Direct_Bodybuilder63@reddit | LocalLLaMA | View on Reddit | 47 comments
ClimateBoss@reddit
RTX 6000 build / drive and fan questions
Posted by Direct_Bodybuilder63@reddit | LocalLLaMA | View on Reddit | 47 comments
ClimateBoss@reddit
The Synthetic Data Playbook: Generating Trillions of the Finest Tokens
Posted by joelinho95@reddit | LocalLLaMA | View on Reddit | 10 comments
ClimateBoss@reddit
High school student seeking advice: Found an architectural breakthrough that scales a 17.6B model down to 417M?
Posted by Appropriate-Scar3116@reddit | LocalLLaMA | View on Reddit | 210 comments
ClimateBoss@reddit
High school student seeking advice: Found an architectural breakthrough that scales a 17.6B model down to 417M?
Posted by Appropriate-Scar3116@reddit | LocalLLaMA | View on Reddit | 210 comments
ClimateBoss@reddit
High school student seeking advice: Found an architectural breakthrough that scales a 17.6B model down to 417M?
Posted by Appropriate-Scar3116@reddit | LocalLLaMA | View on Reddit | 210 comments
ClimateBoss@reddit
Which multi GPU for local training? v100, MI50, RTX 2080 22gb?
Posted by ClimateBoss@reddit | LocalLLaMA | View on Reddit | 6 comments
ClimateBoss@reddit (OP)
How to do Batching in Llama.cpp ? Speed goes down LOL?
Posted by ClimateBoss@reddit | LocalLLaMA | View on Reddit | 8 comments
ClimateBoss@reddit (OP)
Dual Tesla M40 12GiB Qwen 3.5 results (Ollama Ubuntu)
Posted by Ok-Internal9317@reddit | LocalLLaMA | View on Reddit | 3 comments
ClimateBoss@reddit
MLX vs GGUF (Unsloth) - Qwen3.5 122b-10b
Posted by waescher@reddit | LocalLLaMA | View on Reddit | 37 comments
ClimateBoss@reddit
Mac Studio 512GB RAM Option Disappears Amid Global DRAM Shortage
Posted by fairydreaming@reddit | LocalLLaMA | View on Reddit | 6 comments
ClimateBoss@reddit
Lads, time to recompile llama.cpp
Posted by muxxington@reddit | LocalLLaMA | View on Reddit | 56 comments
ClimateBoss@reddit
MLX vs GGUF (Unsloth) - Qwen3.5 122b-10b
Posted by waescher@reddit | LocalLLaMA | View on Reddit | 37 comments
ClimateBoss@reddit
How do I figure out -b batch size to increase token speed?
Posted by ClimateBoss@reddit | LocalLLaMA | View on Reddit | 4 comments
ClimateBoss@reddit (OP)
Which model to use for coding: qwen3.5 or qwen2.5-coder?
Posted by Mashic@reddit | LocalLLaMA | View on Reddit | 25 comments
ClimateBoss@reddit
[totally not an ad] combine 2x MCIO into 1x PCIe x16 adapter
Posted by MelodicRecognition7@reddit | LocalLLaMA | View on Reddit | 28 comments
ClimateBoss@reddit
Axe - a precision agentic coder. large codebases. zero bloat. terminal-native. precise retrieval. powerful inference. open-sourced.
Posted by EmbarrassedAsk2887@reddit | LocalLLaMA | View on Reddit | 14 comments
ClimateBoss@reddit
Current state of Qwen3.5-122B-A10B
Posted by kevin_1994@reddit | LocalLLaMA | View on Reddit | 37 comments
ClimateBoss@reddit
ik_llama.cpp Reasoning not working with GLM Models
Posted by KulangetaPestControl@reddit | LocalLLaMA | View on Reddit | 12 comments
ClimateBoss@reddit
VibeHQ, Orchestrate multiple Claude Code / Codex / Gemini CLI agents collaborate like a real company team. 7 agents built a hospital system from one prompt.
Posted by GGwithRabbit@reddit | LocalLLaMA | View on Reddit | 9 comments
ClimateBoss@reddit
LongCat-Flash-Lite 68.5B maybe a relatively good choice for a pure instruct model within the 24GB GPU VRAM constraint.
Posted by Sad-Pickle4282@reddit | LocalLLaMA | View on Reddit | 10 comments
ClimateBoss@reddit
Ubuntu or Debian? Speed difference on llama.cpp tokens?
Posted by ClimateBoss@reddit | LocalLLaMA | View on Reddit | 8 comments
ClimateBoss@reddit (OP)
How to generate songs using CofmyUi rtx 5060ti 16gb Tutorial
Posted by Legion10008@reddit | LocalLLaMA | View on Reddit | 2 comments
ClimateBoss@reddit
Is microsoft going to train LLM on this? Github is clearly getting destroyed.
Posted by FPham@reddit | LocalLLaMA | View on Reddit | 106 comments
ClimateBoss@reddit
Recommendations for a affordable prebuilt PC to run 120B LLM locally?
Posted by TechnologyLumpy5937@reddit | LocalLLaMA | View on Reddit | 20 comments
ClimateBoss@reddit
LightMem (ICLR 2026): Lightweight and Efficient Memory-Augmented Generation — 10×+ gains with 100× lower cost
Posted by zxlzr@reddit | LocalLLaMA | View on Reddit | 15 comments
ClimateBoss@reddit
Recommendations for a affordable prebuilt PC to run 120B LLM locally?
Posted by TechnologyLumpy5937@reddit | LocalLLaMA | View on Reddit | 20 comments
ClimateBoss@reddit
Vellium v0.4 — alternative simplified UI, updated writing mode and multi-char improvements
Posted by Possible_Statement84@reddit | LocalLLaMA | View on Reddit | 20 comments
ClimateBoss@reddit
Completed my 64GB VRAM rig - dual MI50 build + custom shroud
Posted by roackim@reddit | LocalLLaMA | View on Reddit | 49 comments
ClimateBoss@reddit
Completed my 64GB VRAM rig - dual MI50 build + custom shroud
Posted by roackim@reddit | LocalLLaMA | View on Reddit | 49 comments
ClimateBoss@reddit
Completed my 64GB VRAM rig - dual MI50 build + custom shroud
Posted by roackim@reddit | LocalLLaMA | View on Reddit | 49 comments
ClimateBoss@reddit
Completed my 64GB VRAM rig - dual MI50 build + custom shroud
Posted by roackim@reddit | LocalLLaMA | View on Reddit | 49 comments
ClimateBoss@reddit
Best Qwen3.5-35B-A3B GGUF for 24GB VRAM?!
Posted by VoidAlchemy@reddit | LocalLLaMA | View on Reddit | 83 comments
ClimateBoss@reddit
Best Qwen3.5-35B-A3B GGUF for 24GB VRAM?!
Posted by VoidAlchemy@reddit | LocalLLaMA | View on Reddit | 83 comments
ClimateBoss@reddit
What language large models can I run on a 5060 laptop with 32GB of RAM?
Posted by Smart-Cap-2216@reddit | LocalLLaMA | View on Reddit | 4 comments
ClimateBoss@reddit
4xP100 in NVlink how to get the most out of them?
Posted by Simple_Library_2700@reddit | LocalLLaMA | View on Reddit | 3 comments
ClimateBoss@reddit
What language large models can I run on a 5060 laptop with 32GB of RAM?
Posted by Smart-Cap-2216@reddit | LocalLLaMA | View on Reddit | 4 comments
ClimateBoss@reddit
Qwen 3 coder next ud-q8-xl F16 filling up the two orin rpc mesh!
Posted by braydon125@reddit | LocalLLaMA | View on Reddit | 10 comments
ClimateBoss@reddit
Qwen 3 coder next ud-q8-xl F16 filling up the two orin rpc mesh!
Posted by braydon125@reddit | LocalLLaMA | View on Reddit | 10 comments
ClimateBoss@reddit
I created yet another coding agent - Its tiny and fun (atleast for me), hope the community finds it useful
Posted by Weird_Search_4723@reddit | LocalLLaMA | View on Reddit | 42 comments
ClimateBoss@reddit
I created yet another coding agent - Its tiny and fun (atleast for me), hope the community finds it useful
Posted by Weird_Search_4723@reddit | LocalLLaMA | View on Reddit | 42 comments
ClimateBoss@reddit
How to Prompt Caching with llama.cpp?
Posted by ClimateBoss@reddit | LocalLLaMA | View on Reddit | 13 comments
ClimateBoss@reddit (OP)
I created yet another coding agent - Its tiny and fun (atleast for me), hope the community finds it useful
Posted by Weird_Search_4723@reddit | LocalLLaMA | View on Reddit | 42 comments
ClimateBoss@reddit
CXMT has been offering DDR4 chips at about half the prevailing market rate
Posted by johnnyApplePRNG@reddit | LocalLLaMA | View on Reddit | 16 comments
ClimateBoss@reddit
TeichAI/GLM-4.7-Flash-Claude-Opus-4.5-High-Reasoning-Distill-GGUF · Hugging Face
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 66 comments
ClimateBoss@reddit
llama.cpp PR to implement IQ*_K and IQ*_KS quants from ik_llama.cpp
Posted by TKGaming_11@reddit | LocalLLaMA | View on Reddit | 75 comments