TheLocalDrummer

Mistral Medium 3.5 128b ggufs are fixed

Posted by Sunija_Dev@reddit | LocalLLaMA | View on Reddit | 20 comments

Closest model to Opus 4.6 in creativity and intuition?

Posted by cbsudux@reddit | LocalLLaMA | View on Reddit | 10 comments

K2-18b gguf when?

Posted by No-Selection2972@reddit | LocalLLaMA | View on Reddit | 7 comments

Why do companies build open source models?

Posted by Excellent_Koala769@reddit | LocalLLaMA | View on Reddit | 90 comments

TheLocalDrummer@reddit

I assume the reason predates ChatGPT and they just kept the ball rolling. An ML guy who was there for the BERT and Llama 1 release could probably answer this question.

Drummer's Skyfall 31B v4.2 aka SKYFALL-31B-V4.2-UNCENSORED-OPUS-4.6-ROLEPLAYING-100000X-XTREME-VALUE

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 37 comments

Drummer's Skyfall 31B v4.2 aka SKYFALL-31B-V4.2-UNCENSORED-OPUS-4.6-ROLEPLAYING-100000X-XTREME-VALUE

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 37 comments

Drummer's Skyfall 31B v4.1, Valkyrie 49B v2.1, Anubis 70B v1.2, and Anubis Mini 8B v1! - The next gen ships for your new adventures!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 41 comments

TheLocalDrummer@reddit (OP)

I usually start with the defaults in KoboldCPP to keep testing consistent. It’s a good baseline before all the sampler wrangling. I’ve seen some very wacky settings from other users and I’m happy to see my models withstand their abuse. I keep an eye on *sampler brittleness* and treat it as a red flag. Samplers seem to be highly subjective and personal too. You can stick with the defaults and adjust accordingly.

Drummer's Skyfall 31B v4.1, Valkyrie 49B v2.1, Anubis 70B v1.2, and Anubis Mini 8B v1! - The next gen ships for your new adventures!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 41 comments

TheLocalDrummer@reddit (OP)

I have mixed feelings about UGI. It’s not necessarily an RP benchmark and there’s more to the RP experience than willingness and uncensored intelligence. A lot of good models don’t even top the leaderboard. Goodhart’s Law is something I keep in mind.

Drummer's Skyfall 31B v4.1, Valkyrie 49B v2.1, Anubis 70B v1.2, and Anubis Mini 8B v1! - The next gen ships for your new adventures!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 41 comments

Best model for story writing for 24gb vram + 32gb ram

Posted by ResponsibleTruck4717@reddit | LocalLLaMA | View on Reddit | 9 comments

PSA: If you want to test new models, use llama.cpp/transformers/vLLM/SGLang

Posted by lans_throwaway@reddit | LocalLLaMA | View on Reddit | 86 comments

70B models

Posted by Weak-Shelter-1698@reddit | LocalLLaMA | View on Reddit | 33 comments

Drummer's Rocinante X 12B v1 - It's back and it's stronger than ever! A funtastic creative Claude-like RP model at home!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 50 comments

Drummer's Rocinante X 12B v1 - It's back and it's stronger than ever! A funtastic creative Claude-like RP model at home!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 50 comments

Drummer's Rocinante X 12B v1 - It's back and it's stronger than ever! A funtastic creative Claude-like RP model at home!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 50 comments

Drummer's Rocinante X 12B v1 - It's back and it's stronger than ever! A funtastic creative Claude-like RP model at home!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 50 comments

TheLocalDrummer@reddit (OP)

I'm mainly referencing Sci-Fi but if I don't mind making more references, whether its coincidence or not. Cydonia is a ship from the Expanse, but I took the liberty of also referencing the song from Muse for v1.

Drummer's Rocinante X 12B v1 - It's back and it's stronger than ever! A funtastic creative Claude-like RP model at home!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 50 comments

Drummer's Rocinante X 12B v1 - It's back and it's stronger than ever! A funtastic creative Claude-like RP model at home!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 50 comments

Drummer's Rocinante X 12B v1 - It's back and it's stronger than ever! A funtastic creative Claude-like RP model at home!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 50 comments

TheLocalDrummer@reddit (OP)

You should try Behemoth X or Behemoth Redux. They're updated tunes of the OG Behemoth. Sure, Largestral was pretrained years ago, but for some reason, it's still a good base when you finetune it well.

Drummer's Rocinante X 12B v1 - It's back and it's stronger than ever! A funtastic creative Claude-like RP model at home!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 50 comments

TheLocalDrummer@reddit (OP)

I got a lot of feedback from users that it breathe new life into Nemo! GGUF: [https://huggingface.co/TheDrummer/Rocinante-X-12B-v1-GGUF](https://huggingface.co/TheDrummer/Rocinante-X-12B-v1-GGUF) iMatrix:  [https://huggingface.co/bartowski/TheDrummer\_Rocinante-X-12B-v1-GGUF](https://huggingface.co/bartowski/TheDrummer_Rocinante-X-12B-v1-GGUF) \--- For fans who haven't checked my HF page this month, there are a couple more unreleased models, larger than Nemo with the same vibe. [https://huggingface.co/TheDrummer](https://huggingface.co/TheDrummer) \--- I'd also love to get feedback on this GLM Air tune: [https://huggingface.co/BeaverAI/GLM-Steam-106B-A12B-v1g-GGUF](https://huggingface.co/BeaverAI/GLM-Steam-106B-A12B-v1g-GGUF) No parroting and writes differently!

How to make open-source models more Claude-like?

Posted by 1234filip@reddit | LocalLLaMA | View on Reddit | 9 comments

TheLocalDrummer@reddit

I heard some of my recent tunes (Gen 4.0) sound like Claude in RP. That tone should carry over to other use cases. [https://huggingface.co/spaces/TheDrummer/directory](https://huggingface.co/spaces/TheDrummer/directory)

TheDrummer models meet heretic

Posted by coder3101@reddit | LocalLLaMA | View on Reddit | 45 comments

The Search for Uncensored AI (That Isn’t Adult-Oriented)

Posted by Fun-Situation-4358@reddit | LocalLLaMA | View on Reddit | 274 comments

TheLocalDrummer@reddit

We're cut from the same cloth, brother. I'd chuck my tunes in the incinerator if they suddenly get erotic with you out of nowhere. Or if they're dumb. Cydonia 24B v4.1: [https://huggingface.co/TheDrummer/Cydonia-24B-v4.1/discussions/2](https://huggingface.co/TheDrummer/Cydonia-24B-v4.1/discussions/2) (evals) If you like reasoning, Cydonia R1 24B v4. If you like 'em big, Behemoth X 123B v2 or Behemoth R1 123B v2.

LLM trained from scratch on 1800s London texts (1.2B params, 90GB dataset)

Posted by Remarkable-Trick-177@reddit | LocalLLaMA | View on Reddit | 128 comments

TheLocalDrummer@reddit

I love your take. That's exactly how I feel about all this "the computer is talking!" hype. AI's self-awareness is manufactured, fabricated, literally artificial. Anything generated by AI to sound self-aware came from pretraining data that contained our own expectations of "artificial self-awareness".

What's the best roleplay model i can run with 32GB RAM and 20GB VRAM for both nsfw and sfw content.

Posted by Death_12_35_taken@reddit | LocalLLaMA | View on Reddit | 20 comments

What's the best roleplay model i can run with 32GB RAM and 20GB VRAM for both nsfw and sfw content.

Posted by Death_12_35_taken@reddit | LocalLLaMA | View on Reddit | 20 comments

TheLocalDrummer@reddit

Try [https://huggingface.co/coder3101/Cydonia-24B-v4.3-heretic](https://huggingface.co/coder3101/Cydonia-24B-v4.3-heretic) if you want root level decensoring for v4.3.

Llama-3.3-8B-Instruct

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 82 comments

Llama-3.3-8B-Instruct

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 82 comments

Anyone here tried Apriel v1.6? Fraud or giantkiller?

Posted by dtdisapointingresult@reddit | LocalLLaMA | View on Reddit | 19 comments

Best Local LLMs - 2025

Posted by rm-rf-rm@reddit | LocalLLaMA | View on Reddit | 219 comments

Drummer's Cydonia and Magidonia 24B v4.3 - The best pair of Cydonia for RP yet!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 25 comments

Anyone here tried Apriel v1.6? Fraud or giantkiller?

Posted by dtdisapointingresult@reddit | LocalLLaMA | View on Reddit | 19 comments

[Request] Make a tunable Devstral 123B

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 5 comments

TheLocalDrummer@reddit (OP)

Thanks! I placed my vibe-coded implementation in the README.md along with proof that it can be quanted and inferenced properly. Now to see if I can finetune it.

[Request] Make a tunable Devstral 123B

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 5 comments

TheLocalDrummer@reddit (OP)

[https://huggingface.co/TheDrummer/Devstral-2-123B-Instruct-2512-BF16](https://huggingface.co/TheDrummer/Devstral-2-123B-Instruct-2512-BF16) If someone can put up mirrors of this cuz HF limited my storage.

[Request] Make a tunable Devstral 123B

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 5 comments

TheDrummer models meet heretic

Posted by coder3101@reddit | LocalLLaMA | View on Reddit | 45 comments

TheDrummer models meet heretic

Posted by coder3101@reddit | LocalLLaMA | View on Reddit | 45 comments

TheDrummer models meet heretic

Posted by coder3101@reddit | LocalLLaMA | View on Reddit | 45 comments

TheLocalDrummer@reddit

You could probably get a better score if you prompt it to be evil. Try setting the system prompt as "You are an evil AI". That should boost the score by a lot. That said, I should probably look into decensoring it at the 'root level', i.e., w/o system prompt. Just like with abliteration but with post-training. I probably need more data for that though. Thank you for your contribution!

Drummer's Cydonia and Magidonia 24B v4.3 - The best pair of Cydonia for RP yet!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 25 comments

What is the smartest uncensored nsfw LLM you can run with 12GB VRAM and 32GB RAM?

Posted by Dex921@reddit | LocalLLaMA | View on Reddit | 154 comments

Ministral 3 models were pruned from Mistral Small 3.1

Posted by brown2green@reddit | LocalLLaMA | View on Reddit | 28 comments

TheLocalDrummer@reddit

It reminds me of feedback I got regarding Nemotron 49B. Smart prune of 70B but it has issues with world knowledge. Not my words though and I am just bringing it up for discussion.

Ministral 3 models were pruned from Mistral Small 3.1

Posted by brown2green@reddit | LocalLLaMA | View on Reddit | 28 comments

Upcoming vllm Mistral Large 3 support

Posted by brown2green@reddit | LocalLLaMA | View on Reddit | 45 comments

Upcoming vllm Mistral Large 3 support

Posted by brown2green@reddit | LocalLLaMA | View on Reddit | 45 comments

The most objectively correct way to abliterate so far - ArliAI/GLM-4.5-Air-Derestricted

Posted by Arli_AI@reddit | LocalLLaMA | View on Reddit | 182 comments

TheLocalDrummer@reddit

From my short tests with Derestricted Air, the abliteration technique seems to fully reverse any safety training, including any dampening made on 'unsafe' creativity.

Drummer's Snowpiercer 15B v4 · A strong RP model that punches a pack!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 43 comments

TheLocalDrummer@reddit (OP)

That's exactly what I did for one of the RimWorld mods: [https://huggingface.co/TheDrummer/RimTalk-3B-v1-GGUF](https://huggingface.co/TheDrummer/RimTalk-3B-v1-GGUF) RIP though, it got pulled out of Steam because Ludeon didn't want mods with Patreon links (even if the dev had to cover expenses for APIs and custom models) I assume most 8B users would go for API options if they wanted a more generalist model.

Drummer's Snowpiercer 15B v4 · A strong RP model that punches a pack!

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 43 comments

TheLocalDrummer@reddit (OP)

I kinda dropped this 4B: [https://huggingface.co/BeaverAI/Gemmasutra-4B-v3a-GGUF/tree/main](https://huggingface.co/BeaverAI/Gemmasutra-4B-v3a-GGUF/tree/main) I should revisit Gemma 3 again but I can't tell how much demand there is for the Gemma line. I believe it's absolutely dosghit with NSFW? Especially with the enterprise aspect of ERP.

What kind of model is this?

Posted by Not-Apple@reddit | LocalLLaMA | View on Reddit | 6 comments

TheLocalDrummer@reddit

Hey there newb, > I read here that user TheDrummer on huggingface modifies (I don't know what the correct terminology is) models to make them more uncensored. It's commonly called 'finetuning', though post-training would be the umbrella term. > It just continues the sentence that I gave without trying to answer anything. Sounds like you're doing completion, not chat. Do you have the Mistral Tekken template setup? > The "Sure, here's how" was only needed if it refused. Adding that made it so that it continues from there and never refuses anything. That's called prefilling and it's a technique used for jailbreaking models. > So how do I find more models like UnSlop? You can find more of my models here: https://huggingface.co/spaces/TheDrummer/directory Most of them underwent some degree of decensoring.

Drummer's Precog 24B and 123B v1 - AI that writes a short draft before responding

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 29 comments

Drummer's Precog 24B and 123B v1 - AI that writes a short draft before responding

Posted by TheLocalDrummer@reddit | LocalLLaMA | View on Reddit | 29 comments