200+ pages of Hugging Face secrets on how to train an LLM
Posted by eliebakk@reddit | LocalLLaMA | View on Reddit | 101 comments
Hey it's elie from the hugging face pre-training team! We're very excited to share our new blog (book?) that cover the full pipeline: pre-training, post-training and infra. 200+ pages of what worked, what didn’t, and how to make it run reliably :)
[https://huggingface.co/spaces/HuggingFaceTB/smol-training-playbook](https://huggingface.co/spaces/HuggingFaceTB/smol-training-playbook)
Hope yall will enjoy it, don't hesitate to make feedback on the community tab :)
101 Comments
pigeon57434@reddit
Megneous@reddit
ramendik@reddit
SnooMarzipans2470@reddit
lewtun@reddit
maifee@reddit
LMLocalizer@reddit
Ynkwmh@reddit
x3v0r@reddit
apodicity@reddit
peripateticman2026@reddit
maifee@reddit
hmseb@reddit
sanger_r@reddit
hmseb@reddit
JustSayin_thatuknow@reddit
lewtun@reddit
NobleKale@reddit
CurrentCourt9888@reddit
Inevitable_Mistake32@reddit
xandie985@reddit
AnonymZ_@reddit
rozeappletree@reddit
pevers@reddit
Stepfunction@reddit
vic8760@reddit
No_Conversation9561@reddit
Aeio-guy@reddit
eliebakk@reddit (OP)
Stepfunction@reddit
fuzzysingularity@reddit
Awkward_Run_9982@reddit
Technical_Egg_4548@reddit
TheRealMasonMac@reddit
PersonOfDisinterest9@reddit
ramendik@reddit
PersonOfDisinterest9@reddit
ramendik@reddit
PersonOfDisinterest9@reddit
ramendik@reddit
PersonOfDisinterest9@reddit
MoffKalast@reddit
Titanium-Marshmallow@reddit
foldl-li@reddit
ramendik@reddit
HugoCortell@reddit
NobleKale@reddit
HugoCortell@reddit
NobleKale@reddit
HugoCortell@reddit
NobleKale@reddit
HugoCortell@reddit
charliex2@reddit
HugoCortell@reddit
charliex2@reddit
drc1728@reddit
EggCess@reddit
NobleKale@reddit
New_Newspaper_4787@reddit
ramendik@reddit
koflerdavid@reddit
tifa_cloud0@reddit
SeasonElectrical3173@reddit
TheRealMasonMac@reddit
SeasonElectrical3173@reddit
TheRealMasonMac@reddit
haizu_kun@reddit
SeasonElectrical3173@reddit
haizu_kun@reddit
SeasonElectrical3173@reddit
haizu_kun@reddit
SeasonElectrical3173@reddit
JiminP@reddit
getgoingfast@reddit
tpiros@reddit
eliebakk@reddit (OP)
tpiros@reddit
KallistiTMP@reddit
Hefty_Wolverine_553@reddit
greeneyedguru@reddit
eliebakk@reddit (OP)
To2Two2To@reddit
CheatCodesOfLife@reddit
LoaderD@reddit
Hefty_Wolverine_553@reddit
WithoutReason1729@reddit
ResidentPositive4122@reddit
Ok-Violinist-3947@reddit
RenewAi@reddit
JustSayin_thatuknow@reddit
kompania@reddit
Smile_Clown@reddit
lewtun@reddit
Solvicode@reddit
dorakus@reddit
SlapAndFinger@reddit
RealSataan@reddit
n0xdi@reddit
IrisColt@reddit
SnooPeppers3873@reddit
Traditional_Tap1708@reddit