eliebakk
-
200+ pages of Hugging Face secrets on how to train an LLM
Posted by eliebakk@reddit | LocalLLaMA | View on Reddit | 101 comments
-
how to train a tiny model (4B) to prove hard theorems
Posted by eliebakk@reddit | LocalLLaMA | View on Reddit | 20 comments
-
What MoE model sizes and capabilities are currently missing in the open weight ecosystem?
Posted by eliebakk@reddit | LocalLLaMA | View on Reddit | 47 comments
-
AMA with Hugging Face Science, the team behind SmolLM, SmolVLM, Fineweb and more.
Posted by eliebakk@reddit | LocalLLaMA | View on Reddit | 472 comments
-
SmolLM3-3B training logs and intermediate checkpoints
Posted by eliebakk@reddit | LocalLLaMA | View on Reddit | 23 comments
-
Full open source reproduction of R1 in progress ⏳
Posted by eliebakk@reddit | LocalLLaMA | View on Reddit | 157 comments
-
SmolLM3: reasoning, long context and multilinguality for 3B parameter only
Posted by eliebakk@reddit | LocalLLaMA | View on Reddit | 47 comments
-
350k samples to match distilled R1 on *all* benchmark
Posted by eliebakk@reddit | LocalLLaMA | View on Reddit | 7 comments
-
Qwen3 training recap 🐦🔥
Posted by eliebakk@reddit | LocalLLaMA | View on Reddit | 5 comments
-
Gemma3 technical report detailed analysis 💎
Posted by eliebakk@reddit | LocalLLaMA | View on Reddit | 15 comments
-
7B reasoning model outperforming Claude-3.7 Sonnet on IOI
Posted by eliebakk@reddit | LocalLLaMA | View on Reddit | 28 comments
-
New Reasoning model (Reka Flash 3 - 21B)
Posted by eliebakk@reddit | LocalLLaMA | View on Reddit | 27 comments
-
DCLM dataset but better for smol models
Posted by eliebakk@reddit | LocalLLaMA | View on Reddit | 5 comments
-
Do mechanistic interpretability on your laptop with the new SmolLM2 checkpoints
Posted by eliebakk@reddit | LocalLLaMA | View on Reddit | 3 comments
-
Training LLM on 1000s of GPUs made simple
Posted by eliebakk@reddit | LocalLLaMA | View on Reddit | 33 comments
-
Claude Sonnet 3.7 soon
Posted by eliebakk@reddit | LocalLLaMA | View on Reddit | 107 comments
-
Training on 1000s of GPUs made simple
Posted by eliebakk@reddit | LocalLLaMA | View on Reddit | 0 comments
-
Deepseek R1 GRPO code open sourced 🤯
Posted by eliebakk@reddit | LocalLLaMA | View on Reddit | 18 comments
-
First large scale open source math reasoning dataset with 800k R1 reasoning traces
Posted by eliebakk@reddit | LocalLLaMA | View on Reddit | 8 comments
-
405B MiniMax MoE technical deepdive
Posted by eliebakk@reddit | LocalLLaMA | View on Reddit | 10 comments
-
Deepseek R1 GRPO code open sourced 🤯
Posted by eliebakk@reddit | LocalLLaMA | View on Reddit | 0 comments