Does anyone here rember EleutherAI with GPT-Neox-20b? Or BigScience Bloom 176B?

Posted by Mr_Moonsilver@reddit | LocalLLaMA | View on Reddit | 18 comments

Those were the days... even before Llama and Mistral 7b, or the first Deepseek-Coder (7b and 33b), or WizardLM models with their 16k context windows... man, I feel like an OG even though this is only some 3 or 4 years ago. Things have come a long way. What were your favourites?