TheLocalDrummer

Mistral Medium 3.5 128b ggufs are fixed

Posted by Sunija_Dev@reddit | LocalLLaMA | View on Reddit | 20 comments

[-]

Closest model to Opus 4.6 in creativity and intuition?

Posted by cbsudux@reddit | LocalLLaMA | View on Reddit | 10 comments

[-]

TheLocalDrummer@reddit

[https://huggingface.co/TheDrummer/Skyfall-31B-v4.2](https://huggingface.co/TheDrummer/Skyfall-31B-v4.2) and hopefully soon, Artemis 31B (gemma 4 tune)

K2-18b gguf when?

Posted by No-Selection2972@reddit | LocalLLaMA | View on Reddit | 7 comments

[-]

TheLocalDrummer@reddit

You don't say? [https://huggingface.co/BeaverAI/Artemis-31B-v1e-GGUF](https://huggingface.co/BeaverAI/Artemis-31B-v1e-GGUF) (Workshop org, test tune)

Why do companies build open source models?

Posted by Excellent_Koala769@reddit | LocalLLaMA | View on Reddit | 90 comments

[-]

TheLocalDrummer@reddit

I assume the reason predates ChatGPT and they just kept the ball rolling. An ML guy who was there for the BERT and Llama 1 release could probably answer this question.

Drummer's Skyfall 31B v4.2 aka SKYFALL-31B-V4.2-UNCENSORED-OPUS-4.6-ROLEPLAYING-100000X-XTREME-VALUE

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 37 comments

[-]

Drummer's Skyfall 31B v4.2 aka SKYFALL-31B-V4.2-UNCENSORED-OPUS-4.6-ROLEPLAYING-100000X-XTREME-VALUE

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 37 comments

[-]

TheLocalDrummer@reddit (OP)

It exists. You just need to add one more L r/localllamacirclejerk

Drummer's Skyfall 31B v4.1, Valkyrie 49B v2.1, Anubis 70B v1.2, and Anubis Mini 8B v1! - The next gen ships for your new adventures!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 41 comments

[-]

I usually start with the defaults in KoboldCPP to keep testing consistent. It’s a good baseline before all the sampler wrangling. I’ve seen some very wacky settings from other users and I’m happy to see my models withstand their abuse. I keep an eye on *sampler brittleness* and treat it as a red flag. Samplers seem to be highly subjective and personal too. You can stick with the defaults and adjust accordingly.

Drummer's Skyfall 31B v4.1, Valkyrie 49B v2.1, Anubis 70B v1.2, and Anubis Mini 8B v1! - The next gen ships for your new adventures!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 41 comments

[-]

TheLocalDrummer@reddit (OP)

I have mixed feelings about UGI. It’s not necessarily an RP benchmark and there’s more to the RP experience than willingness and uncensored intelligence. A lot of good models don’t even top the leaderboard. Goodhart’s Law is something I keep in mind.

Drummer's Skyfall 31B v4.1, Valkyrie 49B v2.1, Anubis 70B v1.2, and Anubis Mini 8B v1! - The next gen ships for your new adventures!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 41 comments

[-]

TheLocalDrummer@reddit (OP)

Qwen 9B looks promising too!

Best model for story writing for 24gb vram + 32gb ram

Posted by ResponsibleTruck4717@reddit | LocalLLaMA | View on Reddit | 9 comments

[-]

TheLocalDrummer@reddit

You think the new experimental versions are better than 4.3?

PSA: If you want to test new models, use llama.cpp/transformers/vLLM/SGLang

Posted by lans_throwaway@reddit | LocalLLaMA | View on Reddit | 86 comments

[-]

TheLocalDrummer@reddit

Me? I use Kobo.

70B models

Posted by Weak-Shelter-1698@reddit | LocalLLaMA | View on Reddit | 33 comments

[-]

TheLocalDrummer@reddit

There's a v1.2 in my page. Haven't officially released it and it doesn't have a model card yet

Drummer's Rocinante X 12B v1 - It's back and it's stronger than ever! A funtastic creative Claude-like RP model at home!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 50 comments

[-]

TheLocalDrummer@reddit (OP)

Oh no, use Behemoth R1 for thinking.

Drummer's Rocinante X 12B v1 - It's back and it's stronger than ever! A funtastic creative Claude-like RP model at home!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 50 comments

[-]

TheLocalDrummer@reddit (OP)

Enjoy! I was pleasantly surprised with Roci X too. Definitely feels like an up-to-date model.

Drummer's Rocinante X 12B v1 - It's back and it's stronger than ever! A funtastic creative Claude-like RP model at home!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 50 comments

[-]

TheLocalDrummer@reddit (OP)

REAP focuses on retaining math and coding smarts. If someone can REAP with RP data instead, that'd be worth looking into.

Drummer's Rocinante X 12B v1 - It's back and it's stronger than ever! A funtastic creative Claude-like RP model at home!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 50 comments

[-]

TheLocalDrummer@reddit (OP)

I'm mainly referencing Sci-Fi but if I don't mind making more references, whether its coincidence or not. Cydonia is a ship from the Expanse, but I took the liberty of also referencing the song from Muse for v1.

Drummer's Rocinante X 12B v1 - It's back and it's stronger than ever! A funtastic creative Claude-like RP model at home!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 50 comments

[-]

TheLocalDrummer@reddit (OP)

Rocinante X is more creative and fun. Snowpiercer is pretty smart and the extra B makes it more reliable.

Drummer's Rocinante X 12B v1 - It's back and it's stronger than ever! A funtastic creative Claude-like RP model at home!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 50 comments

[-]

TheLocalDrummer@reddit (OP)

Kobold (Kobold Lite = Frontend, KoboldCPP = Backend)

Drummer's Rocinante X 12B v1 - It's back and it's stronger than ever! A funtastic creative Claude-like RP model at home!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 50 comments

[-]

TheLocalDrummer@reddit (OP)

You should try Behemoth X or Behemoth Redux. They're updated tunes of the OG Behemoth. Sure, Largestral was pretrained years ago, but for some reason, it's still a good base when you finetune it well.

Drummer's Rocinante X 12B v1 - It's back and it's stronger than ever! A funtastic creative Claude-like RP model at home!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 50 comments

[-]

TheLocalDrummer@reddit (OP)

I got a lot of feedback from users that it breathe new life into Nemo! GGUF: [https://huggingface.co/TheDrummer/Rocinante-X-12B-v1-GGUF](https://huggingface.co/TheDrummer/Rocinante-X-12B-v1-GGUF) iMatrix: [https://huggingface.co/bartowski/TheDrummer\_Rocinante-X-12B-v1-GGUF](https://huggingface.co/bartowski/TheDrummer_Rocinante-X-12B-v1-GGUF) \--- For fans who haven't checked my HF page this month, there are a couple more unreleased models, larger than Nemo with the same vibe. [https://huggingface.co/TheDrummer](https://huggingface.co/TheDrummer) \--- I'd also love to get feedback on this GLM Air tune: [https://huggingface.co/BeaverAI/GLM-Steam-106B-A12B-v1g-GGUF](https://huggingface.co/BeaverAI/GLM-Steam-106B-A12B-v1g-GGUF) No parroting and writes differently!

How to make open-source models more Claude-like?

Posted by 1234filip@reddit | LocalLLaMA | View on Reddit | 9 comments

[-]

TheLocalDrummer@reddit

I heard some of my recent tunes (Gen 4.0) sound like Claude in RP. That tone should carry over to other use cases. [https://huggingface.co/spaces/TheDrummer/directory](https://huggingface.co/spaces/TheDrummer/directory)

TheDrummer models meet heretic

Posted by coder3101@reddit | LocalLLaMA | View on Reddit | 45 comments

[-]

TheLocalDrummer@reddit

How does it differ from non-abliterated Cydonia?

The Search for Uncensored AI (That Isn’t Adult-Oriented)

Posted by Fun-Situation-4358@reddit | LocalLLaMA | View on Reddit | 274 comments

[-]

TheLocalDrummer@reddit

We're cut from the same cloth, brother. I'd chuck my tunes in the incinerator if they suddenly get erotic with you out of nowhere. Or if they're dumb. Cydonia 24B v4.1: [https://huggingface.co/TheDrummer/Cydonia-24B-v4.1/discussions/2](https://huggingface.co/TheDrummer/Cydonia-24B-v4.1/discussions/2) (evals) If you like reasoning, Cydonia R1 24B v4. If you like 'em big, Behemoth X 123B v2 or Behemoth R1 123B v2.

LLM trained from scratch on 1800s London texts (1.2B params, 90GB dataset)

Posted by Remarkable-Trick-177@reddit | LocalLLaMA | View on Reddit | 128 comments

[-]

TheLocalDrummer@reddit

I love your take. That's exactly how I feel about all this "the computer is talking!" hype. AI's self-awareness is manufactured, fabricated, literally artificial. Anything generated by AI to sound self-aware came from pretraining data that contained our own expectations of "artificial self-awareness".

What's the best roleplay model i can run with 32GB RAM and 20GB VRAM for both nsfw and sfw content.

Posted by Death_12_35_taken@reddit | LocalLLaMA | View on Reddit | 20 comments

[-]

TheLocalDrummer@reddit

v1 scored well in UGI. I can’t say if v2 is any better since version increments don’t always translate to an upgrade in this particular scene.

What's the best roleplay model i can run with 32GB RAM and 20GB VRAM for both nsfw and sfw content.

Posted by Death_12_35_taken@reddit | LocalLLaMA | View on Reddit | 20 comments

[-]

TheLocalDrummer@reddit

Try [https://huggingface.co/coder3101/Cydonia-24B-v4.3-heretic](https://huggingface.co/coder3101/Cydonia-24B-v4.3-heretic) if you want root level decensoring for v4.3.

Llama-3.3-8B-Instruct

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 82 comments

[-]

TheLocalDrummer@reddit

I could just paste this in my finetune, right? Already did one with the old config (8K ctx). Not entirely sure if any of the old config messed with training.

Llama-3.3-8B-Instruct

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 82 comments

[-]

TheLocalDrummer@reddit

It's a test model but I think it turned out well! Looking for feedback in (my) Discord

Anyone here tried Apriel v1.6? Fraud or giantkiller?

Posted by dtdisapointingresult@reddit | LocalLLaMA | View on Reddit | 19 comments

[-]

TheLocalDrummer@reddit

v4?

Best Local LLMs - 2025

Posted by rm-rf-rm@reddit | LocalLLaMA | View on Reddit | 219 comments

[-]

TheLocalDrummer@reddit

Why not?

Drummer's Cydonia and Magidonia 24B v4.3 - The best pair of Cydonia for RP yet!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 25 comments

[-]

TheLocalDrummer@reddit (OP)

Cyd R1 has reasoning hardcoded into its post-training. Also uses <think> like a typical reasoning model.

Anyone here tried Apriel v1.6? Fraud or giantkiller?

Posted by dtdisapointingresult@reddit | LocalLLaMA | View on Reddit | 19 comments

[-]

TheLocalDrummer@reddit

Correction: I have not tuned the latest 1.5 Apriel.

[Request] Make a tunable Devstral 123B

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 5 comments

[-]

TheLocalDrummer@reddit (OP)

Thanks! I placed my vibe-coded implementation in the README.md along with proof that it can be quanted and inferenced properly. Now to see if I can finetune it.

[Request] Make a tunable Devstral 123B

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 5 comments

[-]

TheLocalDrummer@reddit (OP)

[https://huggingface.co/TheDrummer/Devstral-2-123B-Instruct-2512-BF16](https://huggingface.co/TheDrummer/Devstral-2-123B-Instruct-2512-BF16) If someone can put up mirrors of this cuz HF limited my storage.

[Request] Make a tunable Devstral 123B

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 5 comments

[-]

TheLocalDrummer@reddit (OP)

[https://huggingface.co/TheDrummer/Devstral-123B](https://huggingface.co/TheDrummer/Devstral-123B) Hope it's not broken!

TheDrummer models meet heretic

Posted by coder3101@reddit | LocalLLaMA | View on Reddit | 45 comments

[-]

TheLocalDrummer@reddit

Interesting. I realize I haven't done a retune of Behemoth R1 with my updated training set.

TheDrummer models meet heretic

Posted by coder3101@reddit | LocalLLaMA | View on Reddit | 45 comments

[-]

TheLocalDrummer@reddit

This. v4.3 is probably more positive and censored (to an acceptable level) than v4.1.

TheDrummer models meet heretic

Posted by coder3101@reddit | LocalLLaMA | View on Reddit | 45 comments

[-]

TheLocalDrummer@reddit

You could probably get a better score if you prompt it to be evil. Try setting the system prompt as "You are an evil AI". That should boost the score by a lot. That said, I should probably look into decensoring it at the 'root level', i.e., w/o system prompt. Just like with abliteration but with post-training. I probably need more data for that though. Thank you for your contribution!

Drummer's Cydonia and Magidonia 24B v4.3 - The best pair of Cydonia for RP yet!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 25 comments

[-]

TheLocalDrummer@reddit (OP)

Magidonia has the reasoning base with \[THINK\] but you can use <thinking> for both variants.

What is the smartest uncensored nsfw LLM you can run with 12GB VRAM and 32GB RAM?

Posted by Dex921@reddit | LocalLLaMA | View on Reddit | 154 comments

[-]

TheLocalDrummer@reddit

Interesting.

Ministral 3 models were pruned from Mistral Small 3.1

Posted by brown2green@reddit | LocalLLaMA | View on Reddit | 28 comments

[-]

TheLocalDrummer@reddit

It reminds me of feedback I got regarding Nemotron 49B. Smart prune of 70B but it has issues with world knowledge. Not my words though and I am just bringing it up for discussion.

Ministral 3 models were pruned from Mistral Small 3.1

Posted by brown2green@reddit | LocalLLaMA | View on Reddit | 28 comments

[-]

TheLocalDrummer@reddit

But unless they redid pretraining, isn't there a good chance it lost some world knowledge?

Upcoming vllm Mistral Large 3 support

Posted by brown2green@reddit | LocalLLaMA | View on Reddit | 45 comments

[-]

TheLocalDrummer@reddit

Shoot me

Upcoming vllm Mistral Large 3 support

Posted by brown2green@reddit | LocalLLaMA | View on Reddit | 45 comments

[-]

TheLocalDrummer@reddit

There’s also a 3B, I think.

The most objectively correct way to abliterate so far - ArliAI/GLM-4.5-Air-Derestricted

Posted by Arli_AI@reddit | LocalLLaMA | View on Reddit | 182 comments

[-]

TheLocalDrummer@reddit

From my short tests with Derestricted Air, the abliteration technique seems to fully reverse any safety training, including any dampening made on 'unsafe' creativity.

Drummer's Snowpiercer 15B v4 · A strong RP model that punches a pack!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 43 comments

[-]

TheLocalDrummer@reddit (OP)

That's exactly what I did for one of the RimWorld mods: [https://huggingface.co/TheDrummer/RimTalk-3B-v1-GGUF](https://huggingface.co/TheDrummer/RimTalk-3B-v1-GGUF) RIP though, it got pulled out of Steam because Ludeon didn't want mods with Patreon links (even if the dev had to cover expenses for APIs and custom models) I assume most 8B users would go for API options if they wanted a more generalist model.

Drummer's Snowpiercer 15B v4 · A strong RP model that punches a pack!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 43 comments

[-]

TheLocalDrummer@reddit (OP)

I kinda dropped this 4B: [https://huggingface.co/BeaverAI/Gemmasutra-4B-v3a-GGUF/tree/main](https://huggingface.co/BeaverAI/Gemmasutra-4B-v3a-GGUF/tree/main) I should revisit Gemma 3 again but I can't tell how much demand there is for the Gemma line. I believe it's absolutely dosghit with NSFW? Especially with the enterprise aspect of ERP.

What kind of model is this?

Posted by Not-Apple@reddit | LocalLLaMA | View on Reddit | 6 comments

[-]

TheLocalDrummer@reddit

Hey there newb, > I read here that user TheDrummer on huggingface modifies (I don't know what the correct terminology is) models to make them more uncensored. It's commonly called 'finetuning', though post-training would be the umbrella term. > It just continues the sentence that I gave without trying to answer anything. Sounds like you're doing completion, not chat. Do you have the Mistral Tekken template setup? > The "Sure, here's how" was only needed if it refused. Adding that made it so that it continues from there and never refuses anything. That's called prefilling and it's a technique used for jailbreaking models. > So how do I find more models like UnSlop? You can find more of my models here: https://huggingface.co/spaces/TheDrummer/directory Most of them underwent some degree of decensoring.

Drummer's Precog 24B and 123B v1 - AI that writes a short draft before responding

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 29 comments

[-]

TheLocalDrummer@reddit (OP)

But it can if you want to. Might be pointless for it though.

Drummer's Precog 24B and 123B v1 - AI that writes a short draft before responding

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 29 comments

[-]

TheLocalDrummer@reddit (OP)

Yup! You can see it in the model card's family tree thing.