LM Studio + Snapdragon Laptops = Bad experience

Posted by Andrew_C0@reddit | LocalLLaMA | View on Reddit | 16 comments

Hello. I've been running into this issue recently that I'm unable to debug or fix whatsoever.

Using the latest version of LM Studio (0.3.30) on my Snapdragon Laptop (a Slim 7X - the 32GB RAM version), I get pretty great experience first time I run LM Studio. I tried recently Qwen3 1.7B model just to test it out, and I get around 50 tokens/s, which is great.

However, that only works the first time the model is loaded. Afterwards, if I want to eject the model and use another one (let's say, Qwen3 4B), I get somewhat arount 0.02 tokens/s. I just don't get why. If I want to reload the same 1.7B model, I get the same token performance.

What I've noticed is that rebooting the laptop and loading the model again, it fixes the issue (in regards to whatever model I load first, including Qwen3 Coder 30B), but as soon as I eject and load another model, until I reboot, the tokens/s is always under 1 t/s.

I haven't altered any settings, so I just downloaded the model, loaded it in, and that's it.

I had the same experience using a Surface Laptop 7 in the past, with an older version of LM Studio, but after some updates, it was somehow fixed.

Any help is greatly appreciated to fix this.