Llama 4 - Scout: best quantization resource and comparison to Llama 3.3
Posted by silenceimpaired@reddit | LocalLLaMA | View on Reddit | 14 comments
The two primary resources I’ve seen to get for Scout (GGUF for us GPU poor), seems to be Unsloth and Bartowski… both of which seems to do something non-traditional compared to density models like Llama 70b 3.3. So which one is the best or am I missing one? At first blush Bartowski seems to perform better but then again my first attempt with Unsloth was a smaller quant… so I’m curious what others think.
Then for llama 3.3 vs scout it seems comparable with maybe llama 3.3 having better performance and scout definitely far faster at the same performance.
14 Comments
silenceimpaired@reddit (OP)
crantob@reddit
deathcom65@reddit
x0wl@reddit
silenceimpaired@reddit (OP)
frivolousfidget@reddit
silenceimpaired@reddit (OP)
frivolousfidget@reddit
x0wl@reddit
x0wl@reddit
silenceimpaired@reddit (OP)
x0wl@reddit
silenceimpaired@reddit (OP)
Bobcotelli@reddit