Can anyone share their qwen 2.5 setup for a 4090 please?

Posted by firemeaway@reddit | LocalLLaMA | View on Reddit | 35 comments

Hi folks,

Totally get there are multiple 4090 related questions but I’ve been struggling to setup qwen2.5 using the oobabooga text-generation webui.

Using the 32b model I get extremely slow responses even at 4bit quantisation.

Anyone willing to share their config that performs best?

Thanks 🙏