Advice needed on eGPU and Mini PC

Posted by Kulidc@reddit | LocalLLaMA | View on Reddit | 21 comments

Hi all, I come across to relatively niche problem and could not find much useful posts or guides about it.

I have a mini pc (Beelink Ser 8, 8745HS and 32GB 5600 DDR5 SODIMM) headless server for hosting some routing services, and I am wondering whether I could buy an external GPU docking station and a new GPU, connected through the USB4 interface (\~40Gb/s) or Oculink from the spared SSD slot (PCIE 4.0 x4, \~64Gb/s) and also serve as a coding agent or small assistant.

I would prefer 32GB VRAM, like AI PRO R9700 (Cheap but ROCm, which is a ) or RTX Pro 4500 for serving Qwen 3.6 27B AWQ 4 or 6 bit in vllm.

I will not consider MoE models like the Qwen 3.6 A35B-A3B with CPU offloading due to the connection interface, nor will I consider 5090 due to the large size, heat output and high power draw (I do not want my house to be burnt down due to the connector).

Am I missing any important thing here, apart from the interface and offloading?

Could anyone shares a similar experience on setting up the eGPU with Ubuntu?