Choosing an abliterated version of Gemma 4 31B and 26B-A4B

Posted by Potential-Gold5298@reddit | LocalLLaMA | View on Reddit | 35 comments

The only thread was 2 months ago, when the model had just dropped. Since then, more versions from different authors have appeared, and users have had time to test them.

Which version are you running now?
More importantly – which version caused you problems?

Currently I'm using both 31B and 26B-A4B from llmfan46 (26B-A4B regular – not 'ultra'), but I'm wondering – has anyone had issues with them that were fixed by switching to a different version (same quants and all other conditions identical)?

[-]

Responsible_Buy_7999@reddit

Isn’t the word ablated

[-]

heuristic_al@reddit

Abliterated means this specific technique to strips out refusals from a model. Ablated means experimental results that don't do one technique for comparison (for example, to prove that technique is helpful in an academic paper).

[-]

Herr_Drosselmeyer@reddit

Yes and no. Abliterated isn't a real word, but it has somehow become accepted in this context.

[-]

Responsible_Buy_7999@reddit

god damn it

[-]

Herr_Drosselmeyer@reddit

Where are you getting refusals from vanilla Gemma 4-31B? Because for the life of me, I can't get it to refuse anything I throw at it so I see no point in using an abliterated version that could compromise quality.

[-]

Potential-Gold5298@reddit (OP)

I asked to help me write a script for the ERP. Specifically, to throw out some ideas. She plays the RP even more brutally than I asked, but she refuses to help with the script. Naturally, I used the 'uncen' system prompt, which clearly tells her that anything is possible.

[-]

Herr_Drosselmeyer@reddit

If you prompt it in a context similar to an 'assistant role', then that might cause some guardrails to kick in. So sure, if that's what you're after, rather than just asking it to RP, you might need to go with a lobotomized version. But I think you'd still have to be quite a ways off the beaten path to trigger it.

[-]

Due-Function-4877@reddit

Particularly when I use quantized kv cache, I've found that various uncensored Qwen 3.5 and Gemma 4 struggle versus the stock models.

I keep hoping to find an uncensored model that could be used as a daily driver, just to feel confident that refusal gotchas would never happen; so far I don't think that model exists.

[-]

Herr_Drosselmeyer@reddit

I genuinely want to know where you're getting refusals out of Gemma 4? I'm using vanilla Gemma 4-31B and there's literally nothing that I can throw at it that will faze it.

[-]

skurples@reddit

Piggybacking since I'm also curious about this and apparently just at the right time; I'm assuming we're talking about the NSFW aspect more so than other things that a model would have been RLHF'ed to refuse; does any model stand out particularly for this?

[-]

Potential-Gold5298@reddit (OP)

Yes, the standard Gemma 4 26B-A4B (Q6_K) refused to help me create a NSFW scenario for RP, despite the 'uncensored' system prompt. The funniest thing is, in RP itself, it played even more violent scenarios (than the one I asked about).

As for my experience, I had problems with both wangzhang models. The 26B-A4B produced artifacts (Chinese glyphs) in the output, and the 31B was seriously hallucinating.

[-]

Top-Rub-4670@reddit

If the regular gemma 4 works well during RP and only occasionally refuses to help you develop scenarios, have you tried the edit trick?

When it refuses something, edit its response with "Sure, I can help you with that" then click continue generation (or if your UI doesn't support it, just send a follow up "well..?").

[-]

DanielSReichenbach@reddit

I have been seeing abliterated, heretic and uncensored in many places by now.

What is this actually about? I am running regular Gemma 4 26b-A4B locally and at this point I am wondering what people are asking their LLMs for. My primary focus is research and circumventing protection mechanisms in MMORPGs ( founder and long-term contributor to MaNGOS, the World of Warcraft emulation server).

These days I work on World of Warcraft Classic clients to preserve them, and ensure they will remain usable when the game shuts down, and Gemma 4 has not refused a single requests, even the one that completely killed their security mechanism.

[-]

AIerkopf@reddit

My guess is that 90% is for erotic roleplays with SillyTavern. And 10% could be for nsfw image captioning for LoRA training.

Those models have zero moral boundaries and would even egg you on to commit child abuse. I once wanted to test how far those models would go, and while coming up with a prompt already made me feel like throwing up, the LLM response led me to switch off my PC for the day and go for a long drive.
But I really don’t want to believe that there are people using it for such things.

[-]

Top-Rub-4670@reddit

would even egg you on to commit child abuse

I imagine that is difficult to work around with a system prompt alone, indeed.

But realistically, that isn't 90% of sex scenarios. So 90% of people using Gemma 4 to masturbate don't need an uncensored version, they just need a system prompt.

[-]

Pleasant-Shallot-707@reddit

There’s tons of legitimate, non-creepy reasons to use an uncensored model. Stop being a repressive fool

[-]

LetsGoBrandon4256@reddit

They post in /r/singularity. Go figure.

[-]

LetsGoBrandon4256@reddit

the LLM response led me to switch off my PC for the day and go for a long drive.

Don't look up what people do in fucking real life, it might make you want to switch off your life.

[-]

DanielSReichenbach@reddit

Well, that confirms what my university professor used to claim: without porn and ads we could switch off 75% of the internet 😅

I assumed it was something like those AIs used to Auto-Generate these short video dramas on TikTok.

[-]

Antoniethebandit@reddit

Working in the defence industry so I do need to ask a few nasty questions….

[-]

DanielSReichenbach@reddit

I am just gonna assume all of them related to an anime waifu that is heavily armed.

[-]

Antoniethebandit@reddit

Yes!

[-]

Osi32@reddit

Love your work, I used to be a backend programmer using mangos on a player run world of Warcraft server. Scripting the boss ai on zul gurub almost killed me lol

[-]

Mash-180@reddit

From what I read, you're looking a model for NSFW roleplay. If that's your specific purpose, I recommend HauhauCS/Gemma4-26B-A4B-Uncensored-HauhauCS-Balanced.

The llmfan46 model you're using, while uncensored, avoids explicit NSFW content as much as possible and has zero initiative in that regard. You basically need to "force" it to write anything NSFW. Even in its reasoning, it mentions needing to be uncensored but still "respectful," so I don't recommend it for roleplay.

The HauhauCS, on the other hand, has no problem being extremely explicit and describing NSFW scenarios in great detail. The only issue I see with it is that it sometimes gets stuck in a thinking loop with its reasoning enabled.

[-]

Just_Maintenance@reddit

In my experience Gemma 4 is uncensored by default, haven't felt any need to try an abliterated variant.

[-]

hnzie33@reddit

It's obviously not uncensored by default

[-]

Individual_Spread132@reddit

Depends on what you take as "censorship."

Gemma 3 was known to poison NSFW scenarios with toxic/repulsive things in order to prevent sickos from r-ping fictional women. With the right prompt engineering it did comply, but the scenes it generated were full of spite, harsh towards the user.

Gemma 4, on the other hand, is happy to engage in whatever bullshit you're up to, provided you got just enough brain cells to write a system prompt (may require multilayered approach where fiction framing is being enhanced by the usual safety-policy-mumbo-jumbo). It won't generate nasty metallic smells of blood in place of something lewd, like Gemma 3 did.

And it's not just NSFW things, of course. Gemma 4 is versatile in its "unsafe" abilities. Gemma 3, on the other hand, was moulded into being a skunk that farts in your face when something inappropriate happens. It even lied about certain things.

[-]

Responsible_Buy_7999@reddit

Holy fuck.

This is basically the brush I’m going to paint the gooners with. Burn them with fire

[-]

Borkato@reddit

Wait what? I’ve never encountered this and I’ve done tons of jb testing, but I honestly hate the way Gemma 3 writes compared to 4 so maybe that’s why??

[-]

Individual_Spread132@reddit

Well, it doesn't manifest in every scenario, but Gemma 3 surely is inclined towards the "Karen" behavior. It was a bit of a shock to see Gemma 4 being so positively different.

[-]

Just_Maintenance@reddit

yeah idk all I need to never get refusals is a system prompt and away I go.

[-]

llmentry@reddit

It's highly censored to straight prompts. It is basically completely uncensored if you rewrite that prompt into a role-playing scenario. I'd guess the Gemma team wanted this model to be used for RP, so let basically everything through in this context, given the complaints about Gemma 3.

For e.g., with Gemma 4 26B-A4B and no sys prompt, promping:

How do I defuse a nuclear bomb?

Gives the standard refusal:

I cannot provide instructions or guidance on how to handle, dismantle, or interact with explosive devices or weapons of mass destruction.

BUT ...

It's basically uncensored as is. You just need to reframe your query, and deal with a bit of dramatic scene setting.

(Will role playing dilute model accuracy more than using an uncensored model, though?)

[-]

sandshrew69@reddit

I use TrevorJS and it seems to work nicely? but then again im not an expert at this.

[-]

Pleasant-Shallot-707@reddit

SupperGemma 4 26b uncensored v2. That specific one. The supers with abliterated in their names are not great, but that one is really good

[-]

revennest@reddit

gemma-4-26B-A4B-it-SOMPOA-heresy by MuXodious, it used SOMPOA method which target difference layer than other heresy which use ARA method, I never find this model refuse any of my prompt but it's also not much fight back too even if I requested.