Qwen 3.5B is so impressive, it found multiple bugs claude opus 4.7 couldnt

Posted by ArugulaAnnual1765@reddit | LocalLLaMA | View on Reddit | 17 comments

Just wanted to start off with how absolutely blown away i am by this new model. I am running the bartowski/Qwen_Qwen3.6-35B-A3B-GGUF IQ4_XS quant on my 5090 with the full 256k context.

I am damn impressed! I had asked it a very broad question, to just look for any bugs or issues.
With that huge context window, I noticed it dumping entire relevant files into its context , which it could easily handle, it filled up to 150k\~ tokens before dumping its plan, which I am seriously cool with (I like to transfer the plan to a new convo and reset that window anyway)
It was able to find multiple bugs which violated the guidelines set in rules/claude.md

Running on my 5090, it was blazing at around 180 tps - my eyes were wide as I saw the machine work in front of me, it was truly glorious

In contrast, I tasked slowpus 4.7 to the same task. After taking literally 10x longer and using my entire 5hr usage window, It didnt even find half of the legitimate bugs that my local setup found.
I noticed that claude was MUCH more careful about loading up the context, performing a ton of greps and text searches, sure its much more efficient for anthropics servers, but it will never beat half of the codebase being loaded straight into context lmao

Overall, the past 6 months has fealt like flying on top of a rocket - it was so useless months ago, now its super smart and insanely fast, my mind it literally blown rn