More quantization visualization types (repost)
Posted by copingmechanism@reddit | LocalLLaMA | View on Reddit | 51 comments
Inspired by this post from u/VoidAlchemy a few months back: [https://old.reddit.com/r/LocalLLaMA/comments/1opeu1w/visualizing\_quantization\_types/](https://old.reddit.com/r/LocalLLaMA/comments/1opeu1w/visualizing_quantization_types/)
Intrusive thoughts had me try to reproduce and extend the work to include more quantization types, with/without imatrix, and some PPL/KLD measurements to see what an "efficient" quantization looks like. MXFP4 really doesn't like to participate in this sort of experiment, I don't have much faith this is a very accurate representation of the quant but oh-well.
The (vibe) code for this is here [https://codeberg.org/mailhost/quant-jaunt](https://codeberg.org/mailhost/quant-jaunt) along with a sample of summary output (from lenna.bmp) and some specifications that might help keep the vibes on track.
\*reposted to respect Lenna's retirement
51 Comments
jhov94@reddit
netikas@reddit
ghulamalchik@reddit
Badger-Purple@reddit
llama-impersonator@reddit
stddealer@reddit
llama-impersonator@reddit
stddealer@reddit
llama-impersonator@reddit
stddealer@reddit
llama-impersonator@reddit
netikas@reddit
llama-impersonator@reddit
stddealer@reddit
netikas@reddit
Holiday_Purpose_3166@reddit
VoidAlchemy@reddit
jhov94@reddit
Holiday_Purpose_3166@reddit
robiinn@reddit
jhov94@reddit
llama-impersonator@reddit
netikas@reddit
llama-impersonator@reddit
FriskyFennecFox@reddit
Adventurous_Cat_1559@reddit
stddealer@reddit
Adventurous_Cat_1559@reddit
stddealer@reddit
croninsiglos@reddit
Firepal64@reddit
croninsiglos@reddit
Mickenfox@reddit
Eastern-Group-1993@reddit
No_Afternoon_4260@reddit
Midaychi@reddit
copingmechanism@reddit (OP)
Cubixmeister@reddit
LaFllamme@reddit
gradient8@reddit
mivog49274@reddit
TitwitMuffbiscuit@reddit
mivog49274@reddit
TitwitMuffbiscuit@reddit
siegevjorn@reddit
angelin1978@reddit
AbheekG@reddit
audioen@reddit
yensteel@reddit
ilintar@reddit
MizantropaMiskretulo@reddit