Has anyone here successfully extended Qwen3.5 or 3.6 context length paste 260k?
Posted by fragment_me@reddit | LocalLLaMA | View on Reddit | 11 comments
I've read about YARN, but I'm I'm not familiar with it. And this doesn't seem to work for me, the cap is still 260k.
llama-server \
-m /path/to/qwen3.5-35B.gguf \
-c 400000 \
--rope-scaling yarn \
--rope-scale 1.526 \
--yarn-orig-ctx 262144
11 Comments
FoxiPanda@reddit
fragment_me@reddit (OP)
FoxiPanda@reddit
fragment_me@reddit (OP)
FoxiPanda@reddit
fragment_me@reddit (OP)
FoxiPanda@reddit
Farmadupe@reddit
cafonez@reddit
Ok_Warning2146@reddit
putrasherni@reddit