Qwen3.5-35B-A3B-Claude-4.6-Opus-Uncensored-KL-UD-V2-GGUF + Bonus scripts

Posted by EvilEnginer@reddit | LocalLLaMA | View on Reddit | 5 comments

Hello everyone. Finally I fixed Qwen3.5 35B A3B model after finetuning on Claude Opus datasets and merging with HauhauCS uncensored version. I fixed: attention, Dense FFN, MoE routed experts, and MoE shared expert tensors in model.

Here link: https://huggingface.co/LuffyTheFox/Qwen3.5-35B-A3B-Claude-4.6-Opus-Uncensored-KL-UD-V2-GGUF . Please read launch instructions on page for best experience.

I merged: samuelcardillo model with HauhauCS model, and applied my fixes.
Merging has been done via this script: https://pastebin.com/eB6zB4DU

Model programming features has been tested via following prompts:

Write an Arkanoid game using HTML5 and Javascript. The game should be controlled with a mouse and include generated sounds and effects. The game should have beautiful design with neon bricks and sounds.
Add bonus system. Change background to space.

I got this result: https://pastebin.com/P29JEnPA

Bonus script: Universal Dynamic quantization workflow for Google Colab Free (CPU).

Quantization has been done via this script for UD Q4_K_XL quant: https://pastebin.com/5Ba6qs7L

My idea:

Read the exact per-tensor quantization types used in: Qwen3.5-35B-A3B-UD-Q4_K_XL.gguf (Unsloth) quant.
Save them into a unsloth_ud_profile.json here link: https://pastebin.com/qYrFYadc
Delete Unsloth reference quant to save disk.
Quantize your finetuned GGUF (Q8_0/BF16) -> Q4_K_XL using that JSON profile.

Enjoy \^_\^. Please upvote / share if you like the model and my scripts.

[-]

high_funtioning_mess@reddit

Can you run some benchmarks here or even create your own tests on this free website and share the benchmark results so that we can compare?

https://benchmark.braintwin.ai

EvilEnginer@reddit (OP)

Don't know how to do it :(. I never did benchmarks in my pratice. Kinda new in LLM and all this stuff.
But I would be really happy to see, if someone from community can do this.

Ok, let me see if I can create some uncensored benchmarks and share it here.

Nice :). I'm also very interested to know how this model berforms in benchmarks.

Frankly saying I was surprised how well ChatGPT 5.4 on https://arena.ai/text/direct can debug code written by Claude Opus. Claude Opus really bad in debugging and breaks scripts very often.