-
Qwen 3 Max Official Pricing
Posted by entsnack@reddit | LocalLLaMA | View on Reddit | 0 comments
-
EmbeddingGemma - 300M parameter, state-of-the-art for its size, open embedding model from Google
Posted by curiousily_@reddit | LocalLLaMA | View on Reddit | 66 comments
-
This is not funny...this is simply 1000000% correct
Posted by theundertakeer@reddit | LocalLLaMA | View on Reddit | 100 comments
-
Qwen 3 Max Official Benchmarks (possibly open sourcing later..?)
Posted by Trevor050@reddit | LocalLLaMA | View on Reddit | 27 comments
-
How do I run AI locally? And what is the most efficient model / software?
Posted by 24_1378@reddit | LocalLLaMA | View on Reddit | 6 comments
-
PSA: Make sure your API ports aren't exposed to the open internet
Posted by nooclear@reddit | LocalLLaMA | View on Reddit | 67 comments
-
Unsloth just released their GGUF of Kimi-K2-Instruct-0905!
Posted by TheAndyGeorge@reddit | LocalLLaMA | View on Reddit | 37 comments
-
LongPage: 300 full novels with reasoning traces for training better writing LLMs
Posted by Senior_Evidence_3793@reddit | LocalLLaMA | View on Reddit | 2 comments
-
Rant..
Posted by LingonberryMore960@reddit | LocalLLaMA | View on Reddit | 4 comments
-
New kimi-k2 on Fiction.liveBench
Posted by fictionlive@reddit | LocalLLaMA | View on Reddit | 1 comments
-
Converted my unused laptop into a family server for gpt-oss 20B
Posted by Vaddieg@reddit | LocalLLaMA | View on Reddit | 86 comments
-
Kwai-Klear/Klear-46B-A2.5B-Instruct: Sparse-MoE LLM (46B total / only 2.5B active)
Posted by paf1138@reddit | LocalLLaMA | View on Reddit | 3 comments
-
New AI Dungeon Models: Wayfarer 2 12B & Nova 70B
Posted by NottKolby@reddit | LocalLLaMA | View on Reddit | 33 comments
-
AMA with Hugging Face Science, the team behind SmolLM, SmolVLM, Fineweb and more.
Posted by eliebakk@reddit | LocalLLaMA | View on Reddit | 458 comments
-
Samantha ai for complete is control
Posted by Musclenerd06@reddit | LocalLLaMA | View on Reddit | 0 comments
-
I made a "reasoning version" of K2 0905 by getting Qwen 3 235B to do the reasoning, then once it exits, I switched to model to K2 and let it continue, and it works great.
Posted by Longjumping_Spot5843@reddit | LocalLLaMA | View on Reddit | 0 comments
-
A Cursor-like coding platform is launching an AMA with 2,000 free credits each, might be worth a look guys
Posted by lucienbaba@reddit | LocalLLaMA | View on Reddit | 3 comments
-
Qwen 3 max
Posted by LeatherRub7248@reddit | LocalLLaMA | View on Reddit | 63 comments
-
Kimi-K2-Instruct-0905 Released!
Posted by Dr_Karminski@reddit | LocalLLaMA | View on Reddit | 175 comments
-
Advice a beginner please!
Posted by SailAway1798@reddit | LocalLLaMA | View on Reddit | 32 comments
-
Where is theBloke?
Posted by holistic-engine@reddit | LocalLLaMA | View on Reddit | 28 comments
-
Qwen released API of Qwen3-Max-Preview (Instruct)
Posted by ResearchCrafty1804@reddit | LocalLLaMA | View on Reddit | 8 comments
-
Testing World Knowledge; and What Reasoning Does To It (regarding airliners, specifically)
Posted by airbus_a360_when@reddit | LocalLLaMA | View on Reddit | 22 comments
-
Current SOTA Text to Text LLM?
Posted by 1GewinnerTwitch@reddit | LocalLLaMA | View on Reddit | 8 comments
-
List of open models released or updated this week on this sub, just in case you missed one.
Posted by aifeed-fyi@reddit | LocalLLaMA | View on Reddit | 19 comments
-
Multiple GPUs and supplying power to the PCIe slots
Posted by hainesk@reddit | LocalLLaMA | View on Reddit | 18 comments
-
Is there any way to make llm convert the english words in my xml file into their meaning in my target language?
Posted by FatFigFresh@reddit | LocalLLaMA | View on Reddit | 1 comments
-
Folks any views on using LLMs like Gemma 3 12b 27b for Embeddings ?
Posted by Leather-Departure-38@reddit | LocalLLaMA | View on Reddit | 8 comments
-
Is there any all-in-one app like LM Studio, but with the option of hosting a Web UI server?
Posted by HRudy94@reddit | LocalLLaMA | View on Reddit | 57 comments
-
Best gpu setup for under $500 usd
Posted by milesChristi16@reddit | LocalLLaMA | View on Reddit | 74 comments
-
Qwen3 latest and most powerful language model
Posted by darkpigvirus@reddit | LocalLLaMA | View on Reddit | 6 comments
-
Which (1 or 2-story) frame to use for 7 GPU rig?
Posted by EmilPi@reddit | LocalLLaMA | View on Reddit | 0 comments
-
LocalScore - Local LLM Benchmark
Posted by sipjca@reddit | LocalLLaMA | View on Reddit | 18 comments
-
Th AI/LLM race is absolutely insane
Posted by No-Underscore_s@reddit | LocalLLaMA | View on Reddit | 141 comments
-
Two RTX 6000 Pro Blackwell..what's it get you?
Posted by SteveRD1@reddit | LocalLLaMA | View on Reddit | 23 comments
-
Struggling with OpenRouter sessions, tried something different
Posted by Any-Marionberry4035@reddit | LocalLLaMA | View on Reddit | 1 comments
-
Best model for speech to text Transcription for including filler words ?
Posted by Similar-Camp9685@reddit | LocalLLaMA | View on Reddit | 1 comments
-
Title: Is Anthropic’s new restriction really about national security, or just protecting market share?
Posted by LuozhuZhang@reddit | LocalLLaMA | View on Reddit | 26 comments
-
Anyone here with an AMD AI Max+ 395 + 128GB setup running coding agents?
Posted by Admirable_Reality281@reddit | LocalLLaMA | View on Reddit | 36 comments
-
Anyone else been using the new nvidia/Llama-3_3-Nemotron-Super-49B-v1_5 model?
Posted by kevin_1994@reddit | LocalLLaMA | View on Reddit | 28 comments
-
Seems new model qwen 3 max preview is already available on qwen chat
Posted by Independent-Wind4462@reddit | LocalLLaMA | View on Reddit | 8 comments
-
PC for local LLM inference/GenAI development
Posted by JMarinG@reddit | LocalLLaMA | View on Reddit | 4 comments
-
Looking to buy a 2nd laptop
Posted by SilverRegion9394@reddit | LocalLLaMA | View on Reddit | 4 comments
-
What is the best inference model you have tried at 64gb VRAM and 128gb VRAM?
Posted by seoulsrvr@reddit | LocalLLaMA | View on Reddit | 5 comments
-
Current best uncensored model?
Posted by Accomplished-Feed568@reddit | LocalLLaMA | View on Reddit | 209 comments
-
VibeVoice quantized to 4 bit and 8 bit with some code to run it...
Posted by teachersecret@reddit | LocalLLaMA | View on Reddit | 21 comments
-
What is the name of that tool??? [HELP]
Posted by Vaguely_Smart_Cookie@reddit | LocalLLaMA | View on Reddit | 2 comments
-
Environments Hub walkthrough: Your Language Model needs better (open) environments to learn
Posted by anakin_87@reddit | LocalLLaMA | View on Reddit | 0 comments
-
Looking for ressoruces and team for AGI
Posted by LahmeriMohamed@reddit | LocalLLaMA | View on Reddit | 8 comments
-
I've made some fun demos using the new kimi-k2-0905
Posted by Dr_Karminski@reddit | LocalLLaMA | View on Reddit | 21 comments