If you're using Nvidia's NVFP4 of Qwen3.5-397, try a different quant

Posted by Phaelon74@reddit | LocalLLaMA | View on Reddit | 63 comments

If the quant is working well for you, awesome. It's KLD is quite divergent, and that translates to real intelligence lost. The larger the model, the less this is visible, so if you don't see it, rocksauce. if you do, try Sehyo's NVFP4 or Quantrio's AWQ, which is very accurate.