RTX 5070 Ti (new) vs RTX 3090 / 3090 Ti (used) for LLM inference + clustering

Posted by FeiX7@reddit | LocalLLaMA | View on Reddit | 16 comments

I am thinking to get one of them (or two of them to cluster)
I need purely for LLM Inference
both cost same in my country

Bigger the models I can fit and faster I can run them better

I am thinking to get 5070 ti and add second one, but if value per dollar is more for 3090 I rather pick it.
so please share your opinions about that.

(Currently I am on AMD, I run Qwen3.5 27B and it is SOOO slow, so I need faster inference)