GPU advice for Qwen 3.5 27B / Gemma 4 31B (dense) — aiming for 64K ctx, 30+ t/s

Posted by Fit-Courage5400@reddit | LocalLLaMA | View on Reddit | 89 comments

Hey all,

Looking for some real-world advice on GPU choices for running the new dense models — mainly Qwen 3.5 27B and Gemma 4 31B.

What I’m targeting

From what I’ve seen, these dense models are way more demanding than MoE.

Why not MoE?

I’m already running MoE just fine on P40s:

So now I want to move to dense models for better quality / reasoning.

Budget

Options I’m considering

A. 2× 9070 XT (16GB)
B. 1× R9 9700 (32GB)
C. 2× 7900 XTX (24GB)
D. 1× RTX Pro 4000 (24GB)

N. 1× Intel Arc Pro B70 (32GB, maybe future option, but not now)

My current understanding (please correct me)

Concerns (based on what I’ve seen here)

What I want to know

If you’ve actually run Qwen3.5 27B / Gemma 4 31B (dense):

Bonus question

If you had \~$1800 today, would you:

Appreciate any real benchmarks / configs 🙏