More quantization visualization types (repost)

Posted by copingmechanism@reddit | LocalLLaMA | View on Reddit | 51 comments

Inspired by this post from u/VoidAlchemy a few months back: [https://old.reddit.com/r/LocalLLaMA/comments/1opeu1w/visualizing\_quantization\_types/](https://old.reddit.com/r/LocalLLaMA/comments/1opeu1w/visualizing_quantization_types/) Intrusive thoughts had me try to reproduce and extend the work to include more quantization types, with/without imatrix, and some PPL/KLD measurements to see what an "efficient" quantization looks like. MXFP4 really doesn't like to participate in this sort of experiment, I don't have much faith this is a very accurate representation of the quant but oh-well. The (vibe) code for this is here [https://codeberg.org/mailhost/quant-jaunt](https://codeberg.org/mailhost/quant-jaunt) along with a sample of summary output (from lenna.bmp) and some specifications that might help keep the vibes on track. \*reposted to respect Lenna's retirement