125 tok/s for Qwen3.6 q4xl on 2x 4060ti is insane perf/dollar

Posted by Chuyito@reddit | LocalLLaMA | View on Reddit | 95 comments

Under $1000 for 32gb vram from 2023, and \~300 watts draw... and this thing is outperforming the latest pick-your-vendor $5k mini pcs from 2026.

So.. next question is can I make it squeeze 150 t/s with the same q4xl on cuda 13.3 this weekend. Anyone try it yet?