[Benchmark] If you want protable StrixHalo - Here is my test for Asus ProArt Px13 and Qwen3.5 & Gemma4

Posted by Willing-Toe1942@reddit | LocalLLaMA | View on Reddit | 8 comments

I want powerhouse on the go and after some research and balancing option I went for Asus PX13 ProArt (GoPro edition) which is basically StrixHalo (AMD Ryzen AI 395+) with 128G RAM

This littel 13 inch laptop has amazin form factor all metal body and it's basically the lightest and most portable thing you can have to run LLM on the go

So I immeditly removed windows, installed CachyOS and started the benchmarks with 3 power mode (selected power modes from Gnome control center) and couldn't wait to share the result to the amazing community :D

here is the initaial Qwen3.5 benchmarks with noise level and measured temperature (nvtop and amdgpu_top)

[PX13 ProArt ](

## command run on llama-vulkan-radv toolbox

llama-bench -m Qwen3.5-35B-A3B-UD-IQ3_XXS.gguf -p 512,1024,2048,4096,8192,16384,32768 -t 512

application used for power monitor/temperature: amdgpu_top

noise measurement: with mobile phone - taken 30 cm away from laptop (similar distance your body to laptop)

Gemma4 benchmarks is baking right now will add it here later.

model size params backend ngl threads test t/s
qwen35moe 35B.A3B IQ3_XXS - 3.0625 bpw 12.17 GiB 34.66 B Vulkan 99 512 pp512 1007.05 ± 11.05
qwen35moe 35B.A3B IQ3_XXS - 3.0625 bpw 12.17 GiB 34.66 B Vulkan 99 512 pp1024 972.53 ± 6.84
qwen35moe 35B.A3B IQ3_XXS - 3.0625 bpw 12.17 GiB 34.66 B Vulkan 99 512 pp2048 938.87 ± 3.66
qwen35moe 35B.A3B IQ3_XXS - 3.0625 bpw 12.17 GiB 34.66 B Vulkan 99 512 pp4096 901.94 ± 5.16
qwen35moe 35B.A3B IQ3_XXS - 3.0625 bpw 12.17 GiB 34.66 B Vulkan 99 512 pp8192 870.25 ± 2.89
qwen35moe 35B.A3B IQ3_XXS - 3.0625 bpw 12.17 GiB 34.66 B Vulkan 99 512 pp16384 784.83 ± 2.00
qwen35moe 35B.A3B IQ3_XXS - 3.0625 bpw 12.17 GiB 34.66 B Vulkan 99 512 pp32768 644.06 ± 5.39
qwen35moe 35B.A3B IQ3_XXS - 3.0625 bpw 12.17 GiB 34.66 B Vulkan 99 512 tg128 69.00 ± 0.28
model size params backend ngl threads test t/s
qwen35moe 35B.A3B IQ3_XXS - 3.0625 bpw 12.17 GiB 34.66 B Vulkan 99 512 pp512 809.28 ± 14.25
qwen35moe 35B.A3B IQ3_XXS - 3.0625 bpw 12.17 GiB 34.66 B Vulkan 99 512 pp1024 798.39 ± 4.99
qwen35moe 35B.A3B IQ3_XXS - 3.0625 bpw 12.17 GiB 34.66 B Vulkan 99 512 pp2048 800.93 ± 2.92
qwen35moe 35B.A3B IQ3_XXS - 3.0625 bpw 12.17 GiB 34.66 B Vulkan 99 512 pp4096 802.36 ± 4.62
qwen35moe 35B.A3B IQ3_XXS - 3.0625 bpw 12.17 GiB 34.66 B Vulkan 99 512 pp8192 790.08 ± 4.04
qwen35moe 35B.A3B IQ3_XXS - 3.0625 bpw 12.17 GiB 34.66 B Vulkan 99 512 pp16384 727.97 ± 2.63
qwen35moe 35B.A3B IQ3_XXS - 3.0625 bpw 12.17 GiB 34.66 B Vulkan 99 512 pp32768 614.02 ± 1.22
qwen35moe 35B.A3B IQ3_XXS - 3.0625 bpw 12.17 GiB 34.66 B Vulkan 99 512 tg128 68.67 ± 0.93
model size params backend ngl threads test t/s
qwen35moe 35B.A3B IQ3_XXS - 3.0625 bpw 12.17 GiB 34.66 B Vulkan 99 512 pp512 725.47 ± 21.19
qwen35moe 35B.A3B IQ3_XXS - 3.0625 bpw 12.17 GiB 34.66 B Vulkan 99 512 pp1024 727.55 ± 8.75
qwen35moe 35B.A3B IQ3_XXS - 3.0625 bpw 12.17 GiB 34.66 B Vulkan 99 512 pp2048 707.59 ± 8.67
qwen35moe 35B.A3B IQ3_XXS - 3.0625 bpw 12.17 GiB 34.66 B Vulkan 99 512 pp4096 673.13 ± 10.74
qwen35moe 35B.A3B IQ3_XXS - 3.0625 bpw 12.17 GiB 34.66 B Vulkan 99 512 pp8192 610.91 ± 16.36
qwen35moe 35B.A3B IQ3_XXS - 3.0625 bpw 12.17 GiB 34.66 B Vulkan 99 512 pp16384 488.11 ± 9.62
qwen35moe 35B.A3B IQ3_XXS - 3.0625 bpw 12.17 GiB 34.66 B Vulkan 99 512 pp32768 407.35 ± 12.66
qwen35moe 35B.A3B IQ3_XXS - 3.0625 bpw 12.17 GiB 34.66 B Vulkan 99 512 tg128 55.34 ± 0.13