isr_431
mistralai/Mistral-Small-24B-Base-2501 · Hugging Face
Posted by Dark_Fire_12@reddit | LocalLLaMA | View on Reddit | 87 comments
Mark Zuckerberg on Llama 4 Training Progress!
Posted by ybdave@reddit | LocalLLaMA | View on Reddit | 90 comments
isr_431@reddit
DeepSeek R1 takes second place on the multi-player benchmark for cooperation, negotiation, and deception.
Posted by zero0_one1@reddit | LocalLLaMA | View on Reddit | 41 comments
isr_431@reddit
Just canceled my OpenAI Plus subscription (for now). Been running DeepSeek-R1 14b locally on my home workstation. I'll probably renew it if OpenAI launches something worthy for Plus tier by then.
Posted by CarbonTail@reddit | LocalLLaMA | View on Reddit | 165 comments
isr_431@reddit
Major changes are coming this year. Buckle up.
Posted by estebansaa@reddit | LocalLLaMA | View on Reddit | 134 comments
isr_431@reddit
What is your method to find good NSFW models? preferably for role playing
Posted by Ok_Appointment2593@reddit | LocalLLaMA | View on Reddit | 188 comments
isr_431@reddit
UGI-Leaderboard Remake! New Political, Coding, and Intelligence benchmarks
Posted by DontPlanToEnd@reddit | LocalLLaMA | View on Reddit | 15 comments
isr_431@reddit
Phi-4 has been released
Posted by paf1138@reddit | LocalLLaMA | View on Reddit | 229 comments
isr_431@reddit
Phi-4 in insanely good at rephrasing the last message for multi-turn rag questions
Posted by LinkSea8324@reddit | LocalLLaMA | View on Reddit | 38 comments
isr_431@reddit
Xiaomi recruits key DeepSeek researcher to lead its AI lab.
Posted by sb5550@reddit | LocalLLaMA | View on Reddit | 18 comments
isr_431@reddit
Dolphin 3.0 !
Posted by Evening_Action6217@reddit | LocalLLaMA | View on Reddit | 54 comments
isr_431@reddit
I don't get it.
Posted by AlgorithmicKing@reddit | LocalLLaMA | View on Reddit | 111 comments
isr_431@reddit
Looks like deepseekv3 API is up
Posted by shing3232@reddit | LocalLLaMA | View on Reddit | 24 comments
isr_431@reddit
Has anyone tested phi4 yet? How does it perform?
Posted by LLMtwink@reddit | LocalLLaMA | View on Reddit | 26 comments
isr_431@reddit
Just installed my first local LLM (Llama3.2)
Posted by garrincha-zg@reddit | LocalLLaMA | View on Reddit | 13 comments
isr_431@reddit
TIL Llama 3.3 can do multiple tool calls and tool composition in a single shot
Posted by zra184@reddit | LocalLLaMA | View on Reddit | 21 comments
isr_431@reddit
Microsoft Phi-4 GGUF available. Download link in the post
Posted by matteogeniaccio@reddit | LocalLLaMA | View on Reddit | 132 comments
isr_431@reddit
Introducing Phi-4: Microsoft’s Newest Small Language Model Specializing in Complex Reasoning
Posted by metalman123@reddit | LocalLLaMA | View on Reddit | 211 comments
isr_431@reddit
Open models wishlist
Posted by hackerllama@reddit | LocalLLaMA | View on Reddit | 238 comments
isr_431@reddit
Ollama has merged in K/V cache quantisation support, halving the memory used by the context
Posted by sammcj@reddit | LocalLLaMA | View on Reddit | 139 comments
isr_431@reddit
Should I get a 14 inch M4 Max 128GB for 123B models?
Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 52 comments
isr_431@reddit
Since things are moving so quickly how do you stay up to date on best current tools and how to use them?
Posted by TryKey925@reddit | LocalLLaMA | View on Reddit | 48 comments
isr_431@reddit
Since things are moving so quickly how do you stay up to date on best current tools and how to use them?
Posted by TryKey925@reddit | LocalLLaMA | View on Reddit | 48 comments
isr_431@reddit
Most intelligent uncensored model under 48GB VRAM?
Posted by PMMEYOURSMIL3@reddit | LocalLLaMA | View on Reddit | 73 comments
isr_431@reddit
Most intelligent uncensored model under 48GB VRAM?
Posted by PMMEYOURSMIL3@reddit | LocalLLaMA | View on Reddit | 73 comments
isr_431@reddit
Most intelligent uncensored model under 48GB VRAM?
Posted by PMMEYOURSMIL3@reddit | LocalLLaMA | View on Reddit | 73 comments
isr_431@reddit
Closed source model size speculation
Posted by redjojovic@reddit | LocalLLaMA | View on Reddit | 22 comments
isr_431@reddit
Someone just created a pull request in llama.cpp for Qwen2VL support!
Posted by Many_SuchCases@reddit | LocalLLaMA | View on Reddit | 36 comments
isr_431@reddit
[Missed Connections] Find Me Very Strange or Unique Models!
Posted by amanda_cat@reddit | LocalLLaMA | View on Reddit | 12 comments
isr_431@reddit
Mistral AI releases (API-only for now it seems) Mistral Large 3 and Pixtral Large
Posted by Vivid_Dot_6405@reddit | LocalLLaMA | View on Reddit | 99 comments
isr_431@reddit
Building a Mini PC for aya-expanse-8b Inference - Recommendations Needed!
Posted by Whiplashorus@reddit | LocalLLaMA | View on Reddit | 35 comments
isr_431@reddit
Building a Mini PC for aya-expanse-8b Inference - Recommendations Needed!
Posted by Whiplashorus@reddit | LocalLLaMA | View on Reddit | 35 comments
isr_431@reddit
Qwen 2.5 7B Added to Livebench, Overtakes Mixtral 8x22B and Claude 3 Haiku
Posted by isr_431@reddit | LocalLLaMA | View on Reddit | 62 comments
isr_431@reddit (OP)
Qwen 2.5 7B Added to Livebench, Overtakes Mixtral 8x22B and Claude 3 Haiku
Posted by isr_431@reddit | LocalLLaMA | View on Reddit | 62 comments
isr_431@reddit (OP)
Your Experience with Small Language Models
Posted by numinouslymusing@reddit | LocalLLaMA | View on Reddit | 34 comments
isr_431@reddit
Why do we not have Loras like Civitai does for diffusion models?
Posted by FesseJerguson@reddit | LocalLLaMA | View on Reddit | 40 comments
isr_431@reddit
Best models under 8GB of VRAM?
Posted by HRudy94@reddit | LocalLLaMA | View on Reddit | 21 comments
isr_431@reddit
Thoughts on Ministral 8B?
Posted by Amgadoz@reddit | LocalLLaMA | View on Reddit | 35 comments
isr_431@reddit
Thoughts on Ministral 8B?
Posted by Amgadoz@reddit | LocalLLaMA | View on Reddit | 35 comments
isr_431@reddit
OpenCoder: open and reproducible code LLM family which matches the performance of Top-Tier Code LLM
Posted by asb@reddit | LocalLLaMA | View on Reddit | 21 comments
isr_431@reddit
what's the cheapest hardware I can run Llama 3.2 11b (image inference) on?
Posted by dirtyring@reddit | LocalLLaMA | View on Reddit | 7 comments
isr_431@reddit
LLM overkill is real: I analyzed 12 benchmarks to find the right-sized model for each use case 🤖
Posted by medi6@reddit | LocalLLaMA | View on Reddit | 80 comments
isr_431@reddit
So where’s Qwen2.5-Coder-32B?
Posted by Balance-@reddit | LocalLLaMA | View on Reddit | 27 comments
isr_431@reddit
Your best 3b model? Llama 3.2, kwen 2.5 or Phi 3.5?
Posted by noaibot@reddit | LocalLLaMA | View on Reddit | 12 comments
isr_431@reddit
Is there anything that beats Mistral-Nemo 12b in coding that's still smaller than a Llama 3.1 70b quant?
Posted by ForsookComparison@reddit | LocalLLaMA | View on Reddit | 30 comments
isr_431@reddit
Meta releases an open version of Google's NotebookLM
Posted by isr_431@reddit | LocalLLaMA | View on Reddit | 135 comments
isr_431@reddit (OP)
Cohere releases Aya Expanse multilingual AI model family
Posted by umarmnaq@reddit | LocalLLaMA | View on Reddit | 40 comments
isr_431@reddit
Best alternative to LM Studio?
Posted by PaytonAndHolyfield@reddit | LocalLLaMA | View on Reddit | 17 comments
isr_431@reddit
Best 3B model nowadays?
Posted by mr_house7@reddit | LocalLLaMA | View on Reddit | 38 comments
isr_431@reddit
Petition to auto-delete anything that mentions Matt Shumer, "Reflection", or any link to his Twitter or any affiliated Twitter accounts (Sahil, etc)
Posted by XMasterrrr@reddit | LocalLLaMA | View on Reddit | 46 comments