DOAMOD

Stepfun 3.7 Flash: Sonic-like platformer

Posted by -dysangel-@reddit | LocalLLaMA | View on Reddit | 26 comments

Stepfun 3.7 Flash: Sonic-like platformer

Posted by -dysangel-@reddit | LocalLLaMA | View on Reddit | 26 comments

DOAMOD@reddit

https://preview.redd.it/euxlf9tz4x4h1.png?width=2169&format=png&auto=webp&s=460276d887c6dd11c93773fbdf5db5f28db74f7e qwen 3.6 27b

MiniMax M3 - Coding & Agentic Frontier, 1M Context, Multimodal

Posted by dryadofelysium@reddit | LocalLLaMA | View on Reddit | 224 comments

I've just benchmarked myself:

Posted by JLeonsarmiento@reddit | LocalLLaMA | View on Reddit | 172 comments

Qwen cant wait to release 3.7 models

Posted by GotHereLateNameTaken@reddit | LocalLLaMA | View on Reddit | 276 comments

Qwen cant wait to release 3.7 models

Posted by GotHereLateNameTaken@reddit | LocalLLaMA | View on Reddit | 276 comments

Local Qwen 3.6 vs frontier models on a coding primitive: single-file HTML canvas driving animation - results and GIFs

Posted by Fragrant-Remove-9031@reddit | LocalLLaMA | View on Reddit | 228 comments

Local Qwen 3.6 vs frontier models on a coding primitive: single-file HTML canvas driving animation - results and GIFs

Posted by Fragrant-Remove-9031@reddit | LocalLLaMA | View on Reddit | 228 comments

Quality comparison between Qwen 3.6 27B quantizations (BF16, Q8_0, Q6_K, Q5_K_XL, Q4_K_XL, IQ4_XS, IQ3_XXS,...)

Posted by bobaburger@reddit | LocalLLaMA | View on Reddit | 173 comments

DOAMOD@reddit

Q8 Run 3 https://preview.redd.it/h1dhz50dbyzg1.png?width=970&format=png&auto=webp&s=8b5ed52bdc94d87c61a2e92c7e1e363377a3eae2

Quality comparison between Qwen 3.6 27B quantizations (BF16, Q8_0, Q6_K, Q5_K_XL, Q4_K_XL, IQ4_XS, IQ3_XXS,...)

Posted by bobaburger@reddit | LocalLLaMA | View on Reddit | 173 comments

DOAMOD@reddit

Q8 Run 2 https://preview.redd.it/plnjf42layzg1.png?width=1250&format=png&auto=webp&s=14dd6454f697479c206ab3a7d49ac9c46f583bcd

Quality comparison between Qwen 3.6 27B quantizations (BF16, Q8_0, Q6_K, Q5_K_XL, Q4_K_XL, IQ4_XS, IQ3_XXS,...)

Posted by bobaburger@reddit | LocalLLaMA | View on Reddit | 173 comments

DOAMOD@reddit

Q8 Run 1 https://preview.redd.it/a1p251do9yzg1.png?width=1332&format=png&auto=webp&s=6c1b5fda53ca701873288bf1227a8da24349e915

Quality comparison between Qwen 3.6 27B quantizations (BF16, Q8_0, Q6_K, Q5_K_XL, Q4_K_XL, IQ4_XS, IQ3_XXS,...)

Posted by bobaburger@reddit | LocalLLaMA | View on Reddit | 173 comments

DOAMOD@reddit

Q6XL Run3 https://preview.redd.it/yslhp6dn7yzg1.png?width=1209&format=png&auto=webp&s=95036b3b28fd8f0c3e4244645cfcccee67ce1169

Quality comparison between Qwen 3.6 27B quantizations (BF16, Q8_0, Q6_K, Q5_K_XL, Q4_K_XL, IQ4_XS, IQ3_XXS,...)

Posted by bobaburger@reddit | LocalLLaMA | View on Reddit | 173 comments

DOAMOD@reddit

Q6XL Run 2 https://preview.redd.it/2qftt5bw6yzg1.png?width=1295&format=png&auto=webp&s=cd77418c3068865e5c28b4e65634204bbb59e0f4

Quality comparison between Qwen 3.6 27B quantizations (BF16, Q8_0, Q6_K, Q5_K_XL, Q4_K_XL, IQ4_XS, IQ3_XXS,...)

Posted by bobaburger@reddit | LocalLLaMA | View on Reddit | 173 comments

DOAMOD@reddit

Q6XL Run1 https://preview.redd.it/r3m3o2nj5yzg1.png?width=2451&format=png&auto=webp&s=d38ab34e3edbf7bfaf23e1312f77d6a10bdbb73f

Quality comparison between Qwen 3.6 27B quantizations (BF16, Q8_0, Q6_K, Q5_K_XL, Q4_K_XL, IQ4_XS, IQ3_XXS,...)

Posted by bobaburger@reddit | LocalLLaMA | View on Reddit | 173 comments

DOAMOD@reddit

Q5XL Run3 runerrorsfix https://preview.redd.it/8cmziknz2yzg1.png?width=1076&format=png&auto=webp&s=e4045ec6d9246124ab5957cbcb761b57a6e55b53

Quality comparison between Qwen 3.6 27B quantizations (BF16, Q8_0, Q6_K, Q5_K_XL, Q4_K_XL, IQ4_XS, IQ3_XXS,...)

Posted by bobaburger@reddit | LocalLLaMA | View on Reddit | 173 comments

DOAMOD@reddit

Q5XL Run3 https://preview.redd.it/o2y7zk292yzg1.png?width=2702&format=png&auto=webp&s=b232a185fd0b70d8fc937c1ffc116b61e05f620e

Quality comparison between Qwen 3.6 27B quantizations (BF16, Q8_0, Q6_K, Q5_K_XL, Q4_K_XL, IQ4_XS, IQ3_XXS,...)

Posted by bobaburger@reddit | LocalLLaMA | View on Reddit | 173 comments

DOAMOD@reddit

Q5XL Run2 https://preview.redd.it/70zlllmz1yzg1.png?width=1165&format=png&auto=webp&s=2b789beb9aaecf5538125b3f3fb9eca45ea93f04

Quality comparison between Qwen 3.6 27B quantizations (BF16, Q8_0, Q6_K, Q5_K_XL, Q4_K_XL, IQ4_XS, IQ3_XXS,...)

Posted by bobaburger@reddit | LocalLLaMA | View on Reddit | 173 comments

DOAMOD@reddit

Q5XL RUN1 Autothinkfix1 https://preview.redd.it/7nw9492p1yzg1.png?width=1161&format=png&auto=webp&s=17e418157b996797a6e69d87ffec1146e7babe44

Quality comparison between Qwen 3.6 27B quantizations (BF16, Q8_0, Q6_K, Q5_K_XL, Q4_K_XL, IQ4_XS, IQ3_XXS,...)

Posted by bobaburger@reddit | LocalLLaMA | View on Reddit | 173 comments

DOAMOD@reddit

Q5XL RUN1 (not final) https://preview.redd.it/4woxo3of1yzg1.png?width=1163&format=png&auto=webp&s=8b798252685aeced2ada8b45d1f507d6a2284901

Quality comparison between Qwen 3.6 27B quantizations (BF16, Q8_0, Q6_K, Q5_K_XL, Q4_K_XL, IQ4_XS, IQ3_XXS,...)

Posted by bobaburger@reddit | LocalLLaMA | View on Reddit | 173 comments

DOAMOD@reddit

Q4XL Run3 https://preview.redd.it/lal389nv0yzg1.png?width=1263&format=png&auto=webp&s=97339f4bc279c64250531a4fcd84bd6e2f771121

Quality comparison between Qwen 3.6 27B quantizations (BF16, Q8_0, Q6_K, Q5_K_XL, Q4_K_XL, IQ4_XS, IQ3_XXS,...)

Posted by bobaburger@reddit | LocalLLaMA | View on Reddit | 173 comments

DOAMOD@reddit

Q4XL Run2 https://preview.redd.it/wwjgz1bq0yzg1.png?width=1521&format=png&auto=webp&s=d9d71313443421d24d3cb3ae777cbc53f2a4a704

Quality comparison between Qwen 3.6 27B quantizations (BF16, Q8_0, Q6_K, Q5_K_XL, Q4_K_XL, IQ4_XS, IQ3_XXS,...)

Posted by bobaburger@reddit | LocalLLaMA | View on Reddit | 173 comments

DOAMOD@reddit

Q4XL Run1 autothinkfix final https://preview.redd.it/3d4bevue0yzg1.png?width=1255&format=png&auto=webp&s=2cc3326ee33cdf688eed8cdf8f5eaaaf2ec464e4

Quality comparison between Qwen 3.6 27B quantizations (BF16, Q8_0, Q6_K, Q5_K_XL, Q4_K_XL, IQ4_XS, IQ3_XXS,...)

Posted by bobaburger@reddit | LocalLLaMA | View on Reddit | 173 comments

DOAMOD@reddit

My test 3.6 27B 3RUNS for Q4XL-Q5XL-Q6XL-Q8 Q4XL Run1 (not final run) https://preview.redd.it/2pe6gtdxzxzg1.png?width=1260&format=png&auto=webp&s=08320288eef7a9a0d0f185dd04da232dca302a4d

Llama.cpp MTP support now in beta!

Posted by ilintar@reddit | LocalLLaMA | View on Reddit | 261 comments

DOAMOD@reddit

I haven't tried Llamacpp MTP yet, but I did try MTP in VLLM on Windows on my 5090, and it was a bit disappointing. The memory consumption when exposing the small model doesn't compensate at all for the significant loss of context window. Perhaps in some specific cases for MoEs it could be useful; I think that's the interesting point. But for Dense, I don't see a benefit in my use case. I'll try Llamacpp, though.

The more I use it, the more I'm impressed

Posted by ComfyUser48@reddit | LocalLLaMA | View on Reddit | 106 comments

Qwen 3.6 27B is a BEAST

Posted by AverageFormal9076@reddit | LocalLLaMA | View on Reddit | 343 comments

Compared QWEN 3.6 35B with QWEN 3.6 27B for coding primitives

Posted by gladkos@reddit | LocalLLaMA | View on Reddit | 138 comments

Dense vs. MoE gap is shrinking fast with the 3.6-27B release

Posted by Usual-Carrot6352@reddit | LocalLLaMA | View on Reddit | 88 comments

DOAMOD@reddit

The 27 is actually quite a bit better. I've been working with it for several hours, and the difference is noticeable in something you realize very quickly: the 27 doesn't have to exert much effort, it works well and makes almost no mistakes, while the 3.6-A3 has to struggle, consuming an overwhelming amount of context and making many more simple errors. They're both truly incredible, and I love them, but clearly the a3 reaches its level through a lot of effort, and that's no small feat.

Qwen 3.6 27B is out

Posted by NoConcert8847@reddit | LocalLLaMA | View on Reddit | 609 comments

Qwen3.6-27B released!

Posted by ResearchCrafty1804@reddit | LocalLLaMA | View on Reddit | 142 comments

Qwen 3.6 27B is out

Posted by NoConcert8847@reddit | LocalLLaMA | View on Reddit | 609 comments

Car Wash Question

Posted by Beautiful-Floor-5020@reddit | LocalLLaMA | View on Reddit | 31 comments

DOAMOD@reddit

This question appears to have been success in 3.6-A3 **3.6 35-A3**(only for this, test 4-5 times on q4-q8 all success): That's a fun question with a clever twist — you're washing a car, and the car needs to be *at* the car wash! 😄 Here's the thing: you have to drive . Walking won't work because your car is still at home. You'd end up walking 50 meters to the car wash and then having to explain to someone why you showed up alone without a vehicle. That said, the distance itself doesn't really matter — whether it's 50 meters or 5 kilometers, driving is the only practical option since the object being serviced (the car) needs to travel with you. The real decision here isn't walk vs. drive — it's whether you want to wash it yourself or let someone else do it at a professional car wash! If you're already driving there, might as well enjoy the ride and get it properly cleaned. 🚗✨ **Gemma 4 26-A4**: Honestly? Walk. At 50 meters, you are looking at a distance that takes about 30 to 45 seconds to cover on foot. If you decide to drive: 1. The Effort-to-Reward Ratio is terrible: You'll spend more time starting the engine, shifting into gear, and maneuvering out of your driveway than you would simply walking. 2. Fuel & Wear: You'd be burning fuel and adding unnecessary mileage to the very vehicle you are trying to clean. 3. The "Car Wash Paradox": Driving a dirty car to a car wash is the standard procedure, but driving it only 50 meters feels like using a sledgehammer to crack a nut. Verdict: Save the engine the trouble and just enjoy a quick stroll. Plus, you'll be ready to head straight into the wash without having to find a parking spot again! **Gemma4 E2**: Hello. Given that the car wash is only 50 meters away, the decision is quite simple, but it depends on your preferences and what you are doing at that moment. Here is a breakdown so you can decide: # Option 1: Walking 🚶‍♂️ * **Advantages:** It is fast for such a short distance and is excellent if you want to get a little exercise or if you have things to carry. * **Disadvantages:** If it is very hot or raining, walking might be uncomfortable. # Option 2: Driving 🚗 * **Advantages:** It is more comfortable if you are carrying many things or if you don't want to exert yourself. * **Disadvantages:** It might require finding parking near the car wash and could be a bit more cumbersome than walking 50 meters. **My Recommendation:** For a distance of only 50 meters, **walking is probably the quickest and simplest option** , unless you have a specific reason to use the car (like carrying a lot of luggage). What do you prefer to do right now: move quickly or be more comfortable? 3.5 A3: # My Recommendation: Walk to the car wash. At 50 meters: |Method|Estimated Time|Fuel Used|Logistics| |:-|:-|:-|:-| |🚶 Walk|\~1 minute|0%|Leave car where it is| |🚗 Drive|\~2-3 minutes|Used|Unnecessary movement| # Conclusion: Walking is clearly the more efficient option. It's a perfect walking distance, and driving would just waste your time, burn fuel, and potentially stress you out with maneuvering and parking for such a short task.

Unsloth/Qwen3.6-35b-a3b -> Q5_K_S vs Q4_K_XL

Posted by KringleKrispi@reddit | LocalLLaMA | View on Reddit | 18 comments

Unsloth/Qwen3.6-35b-a3b -> Q5_K_S vs Q4_K_XL

Posted by KringleKrispi@reddit | LocalLLaMA | View on Reddit | 18 comments

Should I switch from Qwen 3.5 27B (dense) to Qwen 3.6 35B-A3B for tool calls & vision? Need Docker config review + VRAM advice

Posted by Colie286@reddit | LocalLLaMA | View on Reddit | 21 comments

DOAMOD@reddit

This model consumes a lot of context in its reasoning, and if it becomes too complex, it can get lost and descend into chaos. Yesterday, I encountered a situation where it became obsessed with a closure it couldn't find and started going around in circles, trying suboptimal solutions. We need to be careful because this is the biggest problem I'm seeing with this model; basically, it can get lost due to its own reasoning. In my tests, it's on par with model 27, but it requires a much greater mental effort, which consumes large windows of context. It's very context-hungry, and this is its weak point because all its speed advantage is lost in its reasoning for solving problems, in my test by solving code problems. It was about 15 seconds slower than model 27b, despite being much faster.

Local Minimax M2.7, GTA benchmark

Posted by -dysangel-@reddit | LocalLLaMA | View on Reddit | 99 comments

DOAMOD@reddit

New fresh one shot and 2 fixes(easy first start fix and long cars form), fps, pedestrians are very well https://i.redd.it/31jr60q6pzvg1.gif

qwen3.6 performance jump is real, just make sure you have it properly configured

Posted by onil_gova@reddit | LocalLLaMA | View on Reddit | 316 comments

qwen3.6 performance jump is real, just make sure you have it properly configured

Posted by onil_gova@reddit | LocalLLaMA | View on Reddit | 316 comments

DOAMOD@reddit

Those of us who actually use the model and aren't just talking nonsense, said so from day one, and people saying this is just benchmarxx.

Local Minimax M2.7, GTA benchmark

Posted by -dysangel-@reddit | LocalLLaMA | View on Reddit | 99 comments

Local Minimax M2.7, GTA benchmark

Posted by -dysangel-@reddit | LocalLLaMA | View on Reddit | 99 comments

DOAMOD@reddit

https://preview.redd.it/sdre0le3tyvg1.png?width=1542&format=png&auto=webp&s=ba7f84980dbd2144fbf657695090ff8e3056c674 haha, after fighting with floating buildings, its crazy this model.

Local Minimax M2.7, GTA benchmark

Posted by -dysangel-@reddit | LocalLLaMA | View on Reddit | 99 comments

DOAMOD@reddit

Q3.6 is insane, it did real-time day/night weather with rendered shadows, the entire weapon system, the STARS search system, this is just one shot and a fix for the loading screen that wouldn't start, now we would have to fix the floating buildings, but it's really impressive for such a small model. https://i.redd.it/7r42se3ebyvg1.gif

Qwen3.6 GGUF Benchmarks

Posted by danielhanchen@reddit | LocalLLaMA | View on Reddit | 126 comments

DOAMOD@reddit

Thanks, I have a question: are the mmproj versions different between 3.5 and 3.6? Do they have any improvements or are they essentially the same? I'm not sure whether to update them; it's a bit chaotic with so many.

Qwen3.6-35B-A3B released!

Posted by ResearchCrafty1804@reddit | LocalLLaMA | View on Reddit | 721 comments

DOAMOD@reddit

You're the first person I've seen who thinks the same as me. It seems surprising to me all I read about people defending Gemma4, when it's so lazy. It's exactly the definition I've been thinking about for days every time I use it: it doesn't want to do anything. It even admitted it to me, saying it's more of a conversational model. What a surprise, it's a chat model, yes, very intelligent, and writes very well, but it's not your coworker. You know what else it told me? It told me to go to YouTube and search for the information or on Google. OMG, never in my life has a model told me to look up the information myself, hahaha this was incredibly fun.

It looks like there are no plans for smaller GLM models

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 128 comments

Impressed with Qwen3.6-35B-A3B

Posted by DOAMOD@reddit | LocalLLaMA | View on Reddit | 71 comments

Qwen3.6-35B-A3B released!

Posted by ResearchCrafty1804@reddit | LocalLLaMA | View on Reddit | 721 comments

DOAMOD@reddit

I'm testing it out and it's thinking a lot, but it seems very intelligent. I think I'm going to like it. I'm really looking forward to seeing the 27b and what it can do.

Qwen3.6-35B-A3B released!

Posted by ResearchCrafty1804@reddit | LocalLLaMA | View on Reddit | 721 comments

MiniMax M2.7 GGUF Investigation, Fixes, Benchmarks

Posted by danielhanchen@reddit | LocalLLaMA | View on Reddit | 50 comments

DOAMOD@reddit

I'm using 2.7 and I'm surprised at how well it's working, looking at that graph, I also agree, it seems surprisingly good for what one would expect.

kepler-452b. GGUF when?

Posted by the-grand-finale@reddit | LocalLLaMA | View on Reddit | 153 comments

Wait is attn rotate already enabled by default since this release tell it support SWA attention?

Posted by Altruistic_Heat_9531@reddit | LocalLLaMA | View on Reddit | 22 comments