I BUILT MY FIRST MODEL FROM SCRATCH

Posted by volious-ka@reddit | LocalLLaMA | View on Reddit | 30 comments

Sup, I'm Crownelius, I made that popular opus distill dataset.

TODAY YOU ARE INTRODUCED TO SHARD a 40m parameter mal-formed LLM.

Right now I'm working on a series of tiny LLM's, with a goal to run a coherent model for IoT tasks. I've researched atomic models, and while doing that I came across a project called Compact AI. Since joining them, I've learned a lot and even made my own model from scratch.

The model is available here: CompactAI-O[HF Organization]

About my model named "Shard"-I call it Scamp.