TheaterFire

Login with Reddit

Currently browsing tags:

  • LocalLLaMA
  • Qwen 3 Max Official Pricing

    Posted by entsnack@reddit | LocalLLaMA | View on Reddit | 0 comments

  • EmbeddingGemma - 300M parameter, state-of-the-art for its size, open embedding model from Google

    Posted by curiousily_@reddit | LocalLLaMA | View on Reddit | 66 comments

  • This is not funny...this is simply 1000000% correct

    Posted by theundertakeer@reddit | LocalLLaMA | View on Reddit | 100 comments

  • Qwen 3 Max Official Benchmarks (possibly open sourcing later..?)

    Posted by Trevor050@reddit | LocalLLaMA | View on Reddit | 27 comments

  • How do I run AI locally? And what is the most efficient model / software?

    Posted by 24_1378@reddit | LocalLLaMA | View on Reddit | 6 comments

  • PSA: Make sure your API ports aren't exposed to the open internet

    Posted by nooclear@reddit | LocalLLaMA | View on Reddit | 67 comments

  • Unsloth just released their GGUF of Kimi-K2-Instruct-0905!

    Posted by TheAndyGeorge@reddit | LocalLLaMA | View on Reddit | 37 comments

  • LongPage: 300 full novels with reasoning traces for training better writing LLMs

    Posted by Senior_Evidence_3793@reddit | LocalLLaMA | View on Reddit | 2 comments

  • Rant..

    Posted by LingonberryMore960@reddit | LocalLLaMA | View on Reddit | 4 comments

  • New kimi-k2 on Fiction.liveBench

    Posted by fictionlive@reddit | LocalLLaMA | View on Reddit | 1 comments

  • Converted my unused laptop into a family server for gpt-oss 20B

    Posted by Vaddieg@reddit | LocalLLaMA | View on Reddit | 86 comments

  • Kwai-Klear/Klear-46B-A2.5B-Instruct: Sparse-MoE LLM (46B total / only 2.5B active)

    Posted by paf1138@reddit | LocalLLaMA | View on Reddit | 3 comments

  • New AI Dungeon Models: Wayfarer 2 12B & Nova 70B

    Posted by NottKolby@reddit | LocalLLaMA | View on Reddit | 33 comments

  • AMA with Hugging Face Science, the team behind SmolLM, SmolVLM, Fineweb and more.

    Posted by eliebakk@reddit | LocalLLaMA | View on Reddit | 458 comments

  • Samantha ai for complete is control

    Posted by Musclenerd06@reddit | LocalLLaMA | View on Reddit | 0 comments

  • I made a "reasoning version" of K2 0905 by getting Qwen 3 235B to do the reasoning, then once it exits, I switched to model to K2 and let it continue, and it works great.

    Posted by Longjumping_Spot5843@reddit | LocalLLaMA | View on Reddit | 0 comments

  • A Cursor-like coding platform is launching an AMA with 2,000 free credits each, might be worth a look guys

    Posted by lucienbaba@reddit | LocalLLaMA | View on Reddit | 3 comments

  • Qwen 3 max

    Posted by LeatherRub7248@reddit | LocalLLaMA | View on Reddit | 63 comments

  • Kimi-K2-Instruct-0905 Released!

    Posted by Dr_Karminski@reddit | LocalLLaMA | View on Reddit | 175 comments

  • Advice a beginner please!

    Posted by SailAway1798@reddit | LocalLLaMA | View on Reddit | 32 comments

  • Where is theBloke?

    Posted by holistic-engine@reddit | LocalLLaMA | View on Reddit | 28 comments

  • Qwen released API of Qwen3-Max-Preview (Instruct)

    Posted by ResearchCrafty1804@reddit | LocalLLaMA | View on Reddit | 8 comments

  • Testing World Knowledge; and What Reasoning Does To It (regarding airliners, specifically)

    Posted by airbus_a360_when@reddit | LocalLLaMA | View on Reddit | 22 comments

  • Current SOTA Text to Text LLM?

    Posted by 1GewinnerTwitch@reddit | LocalLLaMA | View on Reddit | 8 comments

  • List of open models released or updated this week on this sub, just in case you missed one.

    Posted by aifeed-fyi@reddit | LocalLLaMA | View on Reddit | 19 comments

  • Multiple GPUs and supplying power to the PCIe slots

    Posted by hainesk@reddit | LocalLLaMA | View on Reddit | 18 comments

  • Is there any way to make llm convert the english words in my xml file into their meaning in my target language?

    Posted by FatFigFresh@reddit | LocalLLaMA | View on Reddit | 1 comments

  • Folks any views on using LLMs like Gemma 3 12b 27b for Embeddings ?

    Posted by Leather-Departure-38@reddit | LocalLLaMA | View on Reddit | 8 comments

  • Is there any all-in-one app like LM Studio, but with the option of hosting a Web UI server?

    Posted by HRudy94@reddit | LocalLLaMA | View on Reddit | 57 comments

  • Best gpu setup for under $500 usd

    Posted by milesChristi16@reddit | LocalLLaMA | View on Reddit | 74 comments

  • Qwen3 latest and most powerful language model

    Posted by darkpigvirus@reddit | LocalLLaMA | View on Reddit | 6 comments

  • Which (1 or 2-story) frame to use for 7 GPU rig?

    Posted by EmilPi@reddit | LocalLLaMA | View on Reddit | 0 comments

  • LocalScore - Local LLM Benchmark

    Posted by sipjca@reddit | LocalLLaMA | View on Reddit | 18 comments

  • Th AI/LLM race is absolutely insane

    Posted by No-Underscore_s@reddit | LocalLLaMA | View on Reddit | 141 comments

  • Two RTX 6000 Pro Blackwell..what's it get you?

    Posted by SteveRD1@reddit | LocalLLaMA | View on Reddit | 23 comments

  • Struggling with OpenRouter sessions, tried something different

    Posted by Any-Marionberry4035@reddit | LocalLLaMA | View on Reddit | 1 comments

  • Best model for speech to text Transcription for including filler words ?

    Posted by Similar-Camp9685@reddit | LocalLLaMA | View on Reddit | 1 comments

  • Title: Is Anthropic’s new restriction really about national security, or just protecting market share?

    Posted by LuozhuZhang@reddit | LocalLLaMA | View on Reddit | 26 comments

  • Anyone here with an AMD AI Max+ 395 + 128GB setup running coding agents?

    Posted by Admirable_Reality281@reddit | LocalLLaMA | View on Reddit | 36 comments

  • Anyone else been using the new nvidia/Llama-3_3-Nemotron-Super-49B-v1_5 model?

    Posted by kevin_1994@reddit | LocalLLaMA | View on Reddit | 28 comments

  • Seems new model qwen 3 max preview is already available on qwen chat

    Posted by Independent-Wind4462@reddit | LocalLLaMA | View on Reddit | 8 comments

  • PC for local LLM inference/GenAI development

    Posted by JMarinG@reddit | LocalLLaMA | View on Reddit | 4 comments

  • Looking to buy a 2nd laptop

    Posted by SilverRegion9394@reddit | LocalLLaMA | View on Reddit | 4 comments

  • What is the best inference model you have tried at 64gb VRAM and 128gb VRAM?

    Posted by seoulsrvr@reddit | LocalLLaMA | View on Reddit | 5 comments

  • Current best uncensored model?

    Posted by Accomplished-Feed568@reddit | LocalLLaMA | View on Reddit | 209 comments

  • VibeVoice quantized to 4 bit and 8 bit with some code to run it...

    Posted by teachersecret@reddit | LocalLLaMA | View on Reddit | 21 comments

  • What is the name of that tool??? [HELP]

    Posted by Vaguely_Smart_Cookie@reddit | LocalLLaMA | View on Reddit | 2 comments

  • Environments Hub walkthrough: Your Language Model needs better (open) environments to learn

    Posted by anakin_87@reddit | LocalLLaMA | View on Reddit | 0 comments

  • Looking for ressoruces and team for AGI

    Posted by LahmeriMohamed@reddit | LocalLLaMA | View on Reddit | 8 comments

  • I've made some fun demos using the new kimi-k2-0905

    Posted by Dr_Karminski@reddit | LocalLLaMA | View on Reddit | 21 comments

Next