Down_The_Rabbithole
Per-Layer Embeddings: A simple explanation of the magic behind the small Gemma 4 models
Posted by -p-e-w-@reddit | LocalLLaMA | View on Reddit | 68 comments
A new paper demonstrates that LLMs could "think" in latent space, effectively decoupling internal reasoning from visible context tokens. This breakthrough suggests that even smaller models can achieve remarkable performance without relying on extensive context windows.
Posted by tehbangere@reddit | LocalLLaMA | View on Reddit | 305 comments
Down_The_Rabbithole@reddit
Bullshit Benchmark - A benchmark for testing whether models identify and push back on nonsensical prompts instead of confidently answering them
Posted by bot_exe@reddit | LocalLLaMA | View on Reddit | 33 comments
Down_The_Rabbithole@reddit
Minimum viable LLM
Posted by Down_The_Rabbithole@reddit | LocalLLaMA | View on Reddit | 30 comments
Down_The_Rabbithole@reddit (OP)
Artificial Analysis: South Korea π°π· is now the clear #3 nation in AI β powered by the Korean National Sovereign AI Initiative there are now multiple Korean AI labs with near frontier intelligence.
Posted by self-fix@reddit | LocalLLaMA | View on Reddit | 59 comments
Down_The_Rabbithole@reddit
ASUS Rumored To Enter DRAM Market Next Year
Posted by Highwaytothebeach@reddit | LocalLLaMA | View on Reddit | 37 comments
Down_The_Rabbithole@reddit
Nvidia DGX Station GB300 784GB available now! 95,000 USD / 80,000 EUR
Posted by GPTshop@reddit | LocalLLaMA | View on Reddit | 331 comments
Down_The_Rabbithole@reddit
Nvidia DGX Station GB300 784GB available now! 95,000 USD / 80,000 EUR
Posted by GPTshop@reddit | LocalLLaMA | View on Reddit | 331 comments
Down_The_Rabbithole@reddit
WTF are these AI companies doing where they supposedly are the cause of the ram price spike?
Posted by Red_Redditor_Reddit@reddit | LocalLLaMA | View on Reddit | 430 comments
Down_The_Rabbithole@reddit
GigaChat3-702B-A36B-preview is now available on Hugging Face
Posted by Any-Ship9886@reddit | LocalLLaMA | View on Reddit | 89 comments
Down_The_Rabbithole@reddit
US Cloud Giants to Spend ~8.16Γ What China Does in 2025β27 β $1.7 Trillion vs $210 Billion, Will it translate to stronger US AI dominance?
Posted by abdouhlili@reddit | LocalLLaMA | View on Reddit | 169 comments
Down_The_Rabbithole@reddit
Meta chief AI scientist Yann LeCun plans to exit to launch startup, FT reports
Posted by brown2green@reddit | LocalLLaMA | View on Reddit | 43 comments
Down_The_Rabbithole@reddit
Server DRAM prices surge up to 50% as AI-induced memory shortage hits hyperscaler supply β U.S. and Chinese customers only getting 70% order fulfillment
Posted by IonizedRay@reddit | LocalLLaMA | View on Reddit | 62 comments
Down_The_Rabbithole@reddit
Server DRAM prices surge up to 50% as AI-induced memory shortage hits hyperscaler supply β U.S. and Chinese customers only getting 70% order fulfillment
Posted by IonizedRay@reddit | LocalLLaMA | View on Reddit | 62 comments
Down_The_Rabbithole@reddit
Server DRAM prices surge up to 50% as AI-induced memory shortage hits hyperscaler supply β U.S. and Chinese customers only getting 70% order fulfillment
Posted by IonizedRay@reddit | LocalLLaMA | View on Reddit | 62 comments
Down_The_Rabbithole@reddit
If You Want to Understand Why Llama Models Flopped, Zuck is the Cause!
Posted by Iory1998@reddit | LocalLLaMA | View on Reddit | 213 comments
Down_The_Rabbithole@reddit
If You Want to Understand Why Llama Models Flopped, Zuck is the Cause!
Posted by Iory1998@reddit | LocalLLaMA | View on Reddit | 213 comments
Down_The_Rabbithole@reddit
If You Want to Understand Why Llama Models Flopped, Zuck is the Cause!
Posted by Iory1998@reddit | LocalLLaMA | View on Reddit | 213 comments
Down_The_Rabbithole@reddit
If You Want to Understand Why Llama Models Flopped, Zuck is the Cause!
Posted by Iory1998@reddit | LocalLLaMA | View on Reddit | 213 comments
Down_The_Rabbithole@reddit
Gemma 4
Posted by Brave-Hold-9389@reddit | LocalLLaMA | View on Reddit | 73 comments
Down_The_Rabbithole@reddit
Stanford just dropped 5.5hrs worth of lectures on foundational LLM knowledge
Posted by igorwarzocha@reddit | LocalLLaMA | View on Reddit | 74 comments
Down_The_Rabbithole@reddit
Good ol gpu heat
Posted by animal_hoarder@reddit | LocalLLaMA | View on Reddit | 38 comments
Down_The_Rabbithole@reddit
Intel Arc Pro B60 24GB professional GPU listed at $599, in stock and shipping
Posted by PhantomWolf83@reddit | LocalLLaMA | View on Reddit | 171 comments
Down_The_Rabbithole@reddit
Is this real? 14b coder.
Posted by Relative_Ad_9881@reddit | LocalLLaMA | View on Reddit | 43 comments
Down_The_Rabbithole@reddit
Is this real? 14b coder.
Posted by Relative_Ad_9881@reddit | LocalLLaMA | View on Reddit | 43 comments
Down_The_Rabbithole@reddit
4x 3090 local ai workstation
Posted by monoidconcat@reddit | LocalLLaMA | View on Reddit | 247 comments
Down_The_Rabbithole@reddit
PSA for Ollama Users: Your Context Length Might Be Lower Than You Think
Posted by gpt872323@reddit | LocalLLaMA | View on Reddit | 55 comments
Down_The_Rabbithole@reddit
Renting GPUs is hilariously cheap
Posted by -p-e-w-@reddit | LocalLLaMA | View on Reddit | 398 comments
Down_The_Rabbithole@reddit
Can 2 RTX 6000 Pros (2X98GB vram) rival Sonnet 4 or Opus 4?
Posted by devshore@reddit | LocalLLaMA | View on Reddit | 222 comments
Down_The_Rabbithole@reddit
Deepseek changes their API price again
Posted by Pro-editor-1105@reddit | LocalLLaMA | View on Reddit | 37 comments
Down_The_Rabbithole@reddit
When will low-cost Chinese GPUs hit the market?
Posted by noellarkin@reddit | LocalLLaMA | View on Reddit | 97 comments
Down_The_Rabbithole@reddit
There are at least 15 open source models I could find that can be run on a consumer GPU and which are better than Grok 2 (according to Artificial Analysis)
Posted by obvithrowaway34434@reddit | LocalLLaMA | View on Reddit | 117 comments
Down_The_Rabbithole@reddit
grok 2 weights
Posted by HatEducational9965@reddit | LocalLLaMA | View on Reddit | 201 comments
Down_The_Rabbithole@reddit
When will low-cost Chinese GPUs hit the market?
Posted by noellarkin@reddit | LocalLLaMA | View on Reddit | 97 comments
Down_The_Rabbithole@reddit
When will low-cost Chinese GPUs hit the market?
Posted by noellarkin@reddit | LocalLLaMA | View on Reddit | 97 comments
Down_The_Rabbithole@reddit
When will low-cost Chinese GPUs hit the market?
Posted by noellarkin@reddit | LocalLLaMA | View on Reddit | 97 comments
Down_The_Rabbithole@reddit
Wow anthropic and Google losing coding share bc of qwen 3 coder
Posted by Independent-Wind4462@reddit | LocalLLaMA | View on Reddit | 128 comments
Down_The_Rabbithole@reddit
Wow anthropic and Google losing coding share bc of qwen 3 coder
Posted by Independent-Wind4462@reddit | LocalLLaMA | View on Reddit | 128 comments
Down_The_Rabbithole@reddit
LocalLLaMA is the last sane place to discuss LLMs on this site, I swear
Posted by ForsookComparison@reddit | LocalLLaMA | View on Reddit | 213 comments
Down_The_Rabbithole@reddit
ollama
Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 340 comments
Down_The_Rabbithole@reddit
I'm disappointed with GPT-5
Posted by Dr_Karminski@reddit | LocalLLaMA | View on Reddit | 128 comments
Down_The_Rabbithole@reddit
Elon Musk says that xAI will make Grok 2 open source next week
Posted by Nunki08@reddit | LocalLLaMA | View on Reddit | 214 comments
Down_The_Rabbithole@reddit
Elon Musk says that xAI will make Grok 2 open source next week
Posted by Nunki08@reddit | LocalLLaMA | View on Reddit | 214 comments
Down_The_Rabbithole@reddit
Gemini 2.5 Deep Think mode benchmarks!
Posted by Beautiful-Essay1945@reddit | LocalLLaMA | View on Reddit | 72 comments
Down_The_Rabbithole@reddit
4B models are consistently overlooked. Runs Locally and Crushes It. Reasoning for UI, Mobile, Software and Frontend design.
Posted by smirkishere@reddit | LocalLLaMA | View on Reddit | 80 comments
Down_The_Rabbithole@reddit
One yearβs benchmark progress: comparing Sonnet 3.5 with open weight 2025 non-thinking models
Posted by nomorebuttsplz@reddit | LocalLLaMA | View on Reddit | 36 comments
Down_The_Rabbithole@reddit
Introducing the world's most powerful model
Posted by eastwindtoday@reddit | LocalLLaMA | View on Reddit | 199 comments
Down_The_Rabbithole@reddit
Llama 4 Benchmarks
Posted by Ravencloud007@reddit | LocalLLaMA | View on Reddit | 139 comments
Down_The_Rabbithole@reddit
Is it worth spending so much time and money on small LLMs?
Posted by ML-Future@reddit | LocalLLaMA | View on Reddit | 79 comments
Down_The_Rabbithole@reddit
Are there any LLMs with less than 1m parameters?
Posted by UselessSoftware@reddit | LocalLLaMA | View on Reddit | 73 comments