MMLU: NeuralDaredevil 8B Abliterated vs Abliterated 70B Llama 3
Posted by My_Unbiased_Opinion@reddit | LocalLLaMA | View on Reddit | 8 comments
I have been extremely impressed with Neuraldaredevil Llama 3 8b Abliterated. I'm running it at Q8 and apparently the MMLU is about 71.8.
I also tried running the abliterated 3.5 70b llama 3. It is good, but I can only run it at IQ2XXS on my 3090. Looking at the GitHub page and how quants affect the 70b, the MMLU ends up being around 72 as well.
My question is, has anyone put them though their paces and compared the two? I am not a programmer so I haven't tested code, but I notice the 8b model acts more "human-like". It's hard to explain.
I can force the 8b model with a 16k+ context, but I can only get 4k context on the 70b model.
Any thoughts? I would like a neuraldaredevil 70b model based on Llama3..
8 Comments
Enfiznar@reddit
durden111111@reddit
matteogeniaccio@reddit
My_Unbiased_Opinion@reddit (OP)
matteogeniaccio@reddit
My_Unbiased_Opinion@reddit (OP)
matteogeniaccio@reddit
k4ch0w@reddit