Qwen3.6-27B-UD-Q6_K_XL.gguf sometimes gets stuck in a loop

Posted by Kirys79@reddit | LocalLLaMA | View on Reddit | 14 comments

Hi all

I'm running Qwen3.6-27B-UD-Q6_K_XL.gguf using llama swap and llama-server with these parameters (actually stolen for some posts on this subreddit.)

        llama-server \
            -m /models/Qwen3.6-27B/Qwen3.6-27B-UD-Q6_K_XL.gguf \
            --mmproj /models/Qwen3.6-27B/mmproj-F32.gguf \
            --n-gpu-layers 99 \
            --no-mmproj-offload \
            --port ${PORT} \
            --temp 0.6 \
            --top-p 0.95 \
            --top-k 20 \
            --min-p 0.0 \
            --presence-penalty 0.0 \
            --repeat-penalty 1.0 \
            -n 32768 \
            -np 1 \
            --no-context-shift \
            --no-webui \
            --timeout 600 \
            --flash-attn on \
            --cache-type-k q8_0 \
            --cache-type-v q8_0 \
            --chat-template-kwargs '{"preserve_thinking": true}' \
            --spec-type ngram-mod --spec-ngram-size-n 24 --draft-min 48 --draft-max 64 \
            --ctx-size ${default_ctx} \
            --batch-size 4096 \
            --ubatch-size 2048

"default_ctx": 131072

The funny part is the thinking sometimes get stuck in a loop.

I mainly use aider to check my code and not to write anything usually just asking to check my code for bugs or security issues.

I asked checking a bash script today and it get in this loop:

Potential Improvements/Notes:

• dialog dimensions: 22 76 16 might be tight for some terminals, but functional. 
• Minor: echo "Invalid choice"; has an unnecessary semicolon.                                                                                                                                  
I will structure the response clearly and briefly. I will reply in English. I will not edit the files. I will act as an expert code analyst. I will answer the question. I will check the code. I will provide      
feedback. I will be concise. I will be accurate. I will be helpful. I will be professional. I will be clear. I will be structured. I will be complete. I will be correct. I will be consistent. I will be coherent.
I will be concise. I will be precise. I will be accurate. I will be reliable. I will be trustworthy. I will be credible. I will be authoritative. I will be expert. I will be professional. I will be helpful. I    
will be useful. I will be valuable. I will be beneficial. I will be advantageous. I will be profitable. I will be worthwhile. I will be significant. I will be important. I will be relevant. I will be pertinent.  
I will be applicable. I will be appropriate. I will be suitable. I will be fitting. I will be proper. I will be correct. I will be right. I will be accurate. I will be precise. I will be exact. I will be         
specific. I will be detailed. I will be thorough. I will be comprehensive. I will be complete. I will be exhaustive. I will be extensive. I will be wide-ranging. I will be broad. I will be general. I will be     
universal. I will be global. I will be worldwide. I will be international. I will be global. I will be worldwide. I will be international. I will be global. I will be worldwide. I will be international. I will   
be global. I will be worldwide. I will be international. I will be global. I will be worldwide. I will be international. I will be global. I will be worldwide. I will be international. I will be global. I will   
be worldwide. I will be international. I will be global. I will be worldwide. I will be international. I will be global. I will be worldwide. I will be international. I will be global. I will be worldwide. I     
will be international. I will be global. I will be worldwide. I will be international. I will be global. I will be worldwide. I will be international. I will be global. I will be worldwide. I will be             
international. I will be global. I will be worldwide. I will be international. I will be global. I will be worldwide. I will be international. I will be global. I will be worldwide. I will be international.

and it goes on unless I hit ctrl+c.

Do you have see any mistake into my llama-server settings that may be the cause?

Any of you do have the same issue?

Thanks

K.