Qwen3.6-27B-GGUF:UD-Q8_K_XL and llama.cpp issue (DGX SPARK)

Posted by DOOMISHERE@reddit | LocalLLaMA | View on Reddit | 14 comments

Hey all,

im having a crisis that i just cant figure...

i used Qwen3.6-27B-GGUF:UD-Q8_K_XL ever since it came out (on a DGX SPARK) and it worked like magic with decent performance (\~50 t/s) , im updating SPARK and llama.cpp on a daily basis, 3 days ago - something happend... and im getting \~8t/s ...

i tried EVERYTHING...

hard power cycling (disconnect the power block, everything..)

factory reset on the DGX SPARK

went back to older versions of llama.cpp

nothing worked...

banging my head against the wall didnt help either..

any idea what could have gone wrong ?

i have 2 DGX SPARKS and this happens on both of them...

im just lost 😞