đ Qwen3-30B-A3B-2507 and Qwen3-235B-A22B-2507 now support ultra-long contextâup to 1 million tokens!
Posted by ResearchCrafty1804@reddit | LocalLLaMA | View on Reddit | 72 comments
đ Qwen3-30B-A3B-2507 and Qwen3-235B-A22B-2507 now support ultra-long contextâup to 1 million tokens!
đ§ Powered by:
⢠Dual Chunk Attention (DCA) â A length extrapolation method that splits long sequences into manageable chunks while preserving global coherence.
⢠MInference â Sparse attention that cuts overhead by focusing on key token interactions
đĄ These innovations boost both generation quality and inference speed, delivering up to 3Ă faster performance on near-1M token sequences.
â
Fully compatible with vLLM and SGLang for efficient deployment.
đ See the update model cards for how to enable this feature.
https://huggingface.co/Qwen/Qwen3-235B-A22B-Instruct-2507
https://huggingface.co/Qwen/Qwen3-235B-A22B-Thinking-2507
https://huggingface.co/Qwen/Qwen3-30B-A3B-Instruct-2507
https://huggingface.co/Qwen/Qwen3-30B-A3B-Thinking-2507
https://modelscope.cn/models/Qwen/Qwen3-235B-A22B-Instruct-2507
https://modelscope.cn/models/Qwen/Qwen3-235B-A22B-Thinking-2507
https://modelscope.cn/models/Qwen/Qwen3-30B-A3B-Instruct-2507
https://modelscope.cn/models/Qwen/Qwen3-30B-A3B-Thinking-2507
72 Comments
QuackerEnte@reddit
QuackerEnte@reddit
Green-Ad-3964@reddit
koflerdavid@reddit
ayylmaonade@reddit
madaradess007@reddit
SandboChang@reddit
LinkSea8324@reddit
intellidumb@reddit
SandboChang@reddit
kapitanfind-us@reddit
SandboChang@reddit
kapitanfind-us@reddit
SandboChang@reddit
kapitanfind-us@reddit
phazei@reddit
kapitanfind-us@reddit
phazei@reddit
phazei@reddit
SandboChang@reddit
mister2d@reddit
vibjelo@reddit
das_war_ein_Befehl@reddit
DorphinPack@reddit
SandboChang@reddit
Divergence1900@reddit
HilLiedTroopsDied@reddit
hainesk@reddit
HilLiedTroopsDied@reddit
Sad_Cardiologist_835@reddit
evilbarron2@reddit
Voxandr@reddit
johnabbe@reddit
AbyssianOne@reddit
johnabbe@reddit
AbyssianOne@reddit
johnabbe@reddit
One-Employment3759@reddit
lucasruedaok@reddit
superkickstart@reddit
gnorrisan@reddit
AbyssianOne@reddit
MrWeirdoFace@reddit
cristoper@reddit
Bakoro@reddit
MrWeirdoFace@reddit
JLeonsarmiento@reddit
silenceimpaired@reddit
waszumteufel@reddit
Current-Rabbit-620@reddit
cristoper@reddit
Current-Rabbit-620@reddit
101m4n@reddit
ChainOfThot@reddit
Kitchen-Year-8434@reddit
ayylmaonade@reddit
renrutal@reddit
Far_Buyer_7281@reddit
LinkSea8324@reddit
vibjelo@reddit
LinkSea8324@reddit
DistanceSolar1449@reddit
PermanentLiminality@reddit
wooden-guy@reddit
ThinkExtension2328@reddit
wooden-guy@reddit
Own-Potential-2308@reddit
BoJackHorseMan53@reddit
Valhall22@reddit
z1xto@reddit
-p-e-w-@reddit
Chromix_@reddit