Meta/Llama

Posted by Additional_Test_758@reddit | LocalLLaMA | View on Reddit | 6 comments

It's been a while since Meta released a new model...

Reply to Post

6 Comments

[-]

Heralax_Tekran@reddit

+1. LLM researchers' brains can't be doing inference all the time.

[-]

I like how they release multiple sizes, but what if they put all their efforts for data and compute into a single model without distributing it among several? Like pick a reasonable size, 14b (like the new Qwen 2.5) or 22b (Mistral Small 2409), and just throw everything at it. I just want to see how much of a leap it would be for that single model size given meta's absurd amount of H100's.

[-]

Ill_Satisfaction_865@reddit

They might release it next Wednesday at Meta Connect 2024. They have a Llama section in their program: [https://www.meta.com/en-gb/connect/program](https://www.meta.com/en-gb/connect/program)

[-]

PoemPrestigious3834@reddit

I wish they made a MOE model just to see how good they can get it to perform

[-]

ArsNeph@reddit

They're totally not going to release a Multimodal Llama 3.5 at Meta Connect or anything 😏