Xhehab_
OpenAI GPT OSS: 21B & 117B models (3.6B & 5.1B active)
Posted by Xhehab_@reddit | LocalLLaMA | View on Reddit | 8 comments
Qwen-Image — a 20B MMDiT model
Posted by Xhehab_@reddit | LocalLLaMA | View on Reddit | 24 comments
Xhehab_@reddit (OP)
Qwen3- Coder 👀
Posted by Xhehab_@reddit | LocalLLaMA | View on Reddit | 202 comments
Xhehab_@reddit (OP)
Qwen3- Coder 👀
Posted by Xhehab_@reddit | LocalLLaMA | View on Reddit | 202 comments
Xhehab_@reddit (OP)
Qwen3- Coder 👀
Posted by Xhehab_@reddit | LocalLLaMA | View on Reddit | 202 comments
Xhehab_@reddit (OP)
What's the smartest tiny LLM you've actually used?
Posted by Luston03@reddit | LocalLLaMA | View on Reddit | 128 comments
Xhehab_@reddit
DeepSeek R1 0528 Hits 71% (+14.5 pts from R1) on Aider Polyglot Coding Leaderboard
Posted by Xhehab_@reddit | LocalLLaMA | View on Reddit | 108 comments
Xhehab_@reddit (OP)
DeepSeek-R1-0528 Official Benchmarks Released!!!
Posted by Xhehab_@reddit | LocalLLaMA | View on Reddit | 155 comments
Xhehab_@reddit (OP)
DeepSeek-R1-0528 Official Benchmarks Released!!!
Posted by Xhehab_@reddit | LocalLLaMA | View on Reddit | 155 comments
Xhehab_@reddit (OP)
I think I found llama 4 - the "cybele" model on lmarena. It's very, very good and revealed it name ☺️
Posted by Salty-Garage7777@reddit | LocalLLaMA | View on Reddit | 60 comments
Xhehab_@reddit
Mistral’s new “Flash Answers”
Posted by According_to_Mission@reddit | LocalLLaMA | View on Reddit | 73 comments
Xhehab_@reddit
Llama 4 is going to be SOTA
Posted by Xhehab_@reddit | LocalLLaMA | View on Reddit | 254 comments
Xhehab_@reddit (OP)
Llama 4 is going to be SOTA
Posted by Xhehab_@reddit | LocalLLaMA | View on Reddit | 254 comments
Xhehab_@reddit (OP)
Llama 4 is going to be SOTA
Posted by Xhehab_@reddit | LocalLLaMA | View on Reddit | 254 comments
Xhehab_@reddit (OP)
Llama 4 is going to be SOTA
Posted by Xhehab_@reddit | LocalLLaMA | View on Reddit | 254 comments
Xhehab_@reddit (OP)
ROCM vs CUDA in September 2023?
Posted by tronathan@reddit | LocalLLaMA | View on Reddit | 5 comments
Xhehab_@reddit
KoboldCpp 1.79 - Now with Shared Multiplayer, Ollama API emulation, ComfyUI API emulation, and speculative decoding
Posted by HadesThrowaway@reddit | LocalLLaMA | View on Reddit | 94 comments
Xhehab_@reddit
Tülu 3 -- a set of state-of-the-art instruct models with fully open data, eval code, and training algorithms
Posted by Xhehab_@reddit | LocalLLaMA | View on Reddit | 42 comments
Xhehab_@reddit (OP)
Tülu 3 -- a set of state-of-the-art instruct models with fully open data, eval code, and training algorithms
Posted by Xhehab_@reddit | LocalLLaMA | View on Reddit | 42 comments
Xhehab_@reddit (OP)
Cohere releases Aya Expanse multilingual AI model family
Posted by umarmnaq@reddit | LocalLLaMA | View on Reddit | 40 comments
Xhehab_@reddit
IBM Granite 3.0 Models
Posted by AaronFeng47@reddit | LocalLLaMA | View on Reddit | 51 comments
Xhehab_@reddit
Is it possible to achieve very long (100,000+) token outputs?
Posted by CH1997H@reddit | LocalLLaMA | View on Reddit | 66 comments
Xhehab_@reddit
NVIDIA's latest model, Llama-3.1-Nemotron-70B is now available on HuggingChat!
Posted by SensitiveCranberry@reddit | LocalLLaMA | View on Reddit | 134 comments
Xhehab_@reddit
Benchmark Your LLM Against Korea’s Most Challenging Exam!
Posted by Working_Original9624@reddit | LocalLLaMA | View on Reddit | 30 comments
Xhehab_@reddit
Is it possible to run some simple LLM (e.g. llama2) using very low amounts of RAM (e.g. 16MB)?
Posted by galapag0@reddit | LocalLLaMA | View on Reddit | 28 comments
Xhehab_@reddit
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching [Best OS TTS Yet!]
Posted by Xhehab_@reddit | LocalLLaMA | View on Reddit | 73 comments
Xhehab_@reddit (OP)
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching [Best OS TTS Yet!]
Posted by Xhehab_@reddit | LocalLLaMA | View on Reddit | 73 comments
Xhehab_@reddit (OP)
Local LLama 3.2 on iPhone 13
Posted by upquarkspin@reddit | LocalLLaMA | View on Reddit | 78 comments
Xhehab_@reddit
Local LLama 3.2 on iPhone 13
Posted by upquarkspin@reddit | LocalLLaMA | View on Reddit | 78 comments
Xhehab_@reddit
OLMoE 7B is fast on low-end GPU and CPU
Posted by dsjlee@reddit | LocalLLaMA | View on Reddit | 28 comments
Xhehab_@reddit
Qwen2.5 7B chat GGUF quantization Evaluation results
Posted by AaronFeng47@reddit | LocalLLaMA | View on Reddit | 39 comments
Xhehab_@reddit
Best LLM to locally host and run
Posted by imedmactavish@reddit | LocalLLaMA | View on Reddit | 23 comments
Xhehab_@reddit
Qwen2-Vl-2B and Qwen2-VL-7B under Apache 2.0 license released!!
Posted by Xhehab_@reddit | LocalLLaMA | View on Reddit | 5 comments
Xhehab_@reddit (OP)
Gemini 1.5 Flash 8b,
Posted by Optifnolinalgebdirec@reddit | LocalLLaMA | View on Reddit | 40 comments
Xhehab_@reddit
Phi 3.5 Finetuning 2x faster + Llamafied for more accuracy
Posted by danielhanchen@reddit | LocalLLaMA | View on Reddit | 65 comments
Xhehab_@reddit
Mistral Nemo is really good... But ignores simple instructions?
Posted by Majestical-psyche@reddit | LocalLLaMA | View on Reddit | 24 comments
Xhehab_@reddit
Did Kyutai ever released their models as promised?
Posted by keepthepace@reddit | LocalLLaMA | View on Reddit | 3 comments
Xhehab_@reddit
Zamba2-2.7B > Outperforms Phi2 2.7B, Danube3 4B, and StableLM 3B
Posted by Xhehab_@reddit | LocalLLaMA | View on Reddit | 14 comments
Xhehab_@reddit (OP)
Zamba2-2.7B > Outperforms Phi2 2.7B, Danube3 4B, and StableLM 3B
Posted by Xhehab_@reddit | LocalLLaMA | View on Reddit | 14 comments
Xhehab_@reddit (OP)
Lllama 3 takes no.3 on Chatbot Arena; 70B no. 9
Posted by Amgadoz@reddit | LocalLLaMA | View on Reddit | 76 comments
Xhehab_@reddit
What’s the fastest, smallest, smartest LLM today? (3b or less)
Posted by triplepicklepants@reddit | LocalLLaMA | View on Reddit | 89 comments
Xhehab_@reddit
Android frontend for ollama/other apis
Posted by Omnic19@reddit | LocalLLaMA | View on Reddit | 6 comments
Xhehab_@reddit
What's next after llama3 failure?
Posted by FluffyMacho@reddit | LocalLLaMA | View on Reddit | 60 comments
Xhehab_@reddit
What's next after llama3 failure?
Posted by FluffyMacho@reddit | LocalLLaMA | View on Reddit | 60 comments
Xhehab_@reddit
Scale AI are introducing high quality arenas, with... - private datasets (=can't be gamed) - paid annotators for the rankings (=fairer and higher quality annotations)
Posted by Xhehab_@reddit | LocalLLaMA | View on Reddit | 34 comments
Xhehab_@reddit (OP)
Scale AI are introducing high quality arenas, with... - private datasets (=can't be gamed) - paid annotators for the rankings (=fairer and higher quality annotations)
Posted by Xhehab_@reddit | LocalLLaMA | View on Reddit | 34 comments
Xhehab_@reddit (OP)
gpt2-chatbot might be Phi-3 14B (medium)!! Dropping in a couple weeks with 7B (small) too!
Posted by Xhehab_@reddit | LocalLLaMA | View on Reddit | 90 comments
Xhehab_@reddit (OP)
gpt2-chatbot might be Phi-3 14B (medium)!! Dropping in a couple weeks with 7B (small) too!
Posted by Xhehab_@reddit | LocalLLaMA | View on Reddit | 90 comments
Xhehab_@reddit (OP)
gpt2-chatbot might be Phi-3 14B (medium)!! Dropping in a couple weeks with 7B (small) too!
Posted by Xhehab_@reddit | LocalLLaMA | View on Reddit | 90 comments