Gryphe/Pantheon-Reasoning-27B · Hugging Face

Posted by jacek2023@reddit | LocalLLaMA | View on Reddit | 18 comments

from Gryphe:

An experiment in bringing reasoning capability to the Pantheon roleplay series in the form of an uncensored dense Qwen 3.6 27B. This specific model can be thought of as a successor to both the Pantheon series and the one-time Codex release since I used such a large variety of data this time around.

Yet another theory being tested this time around: take the data that Pantheon is built on, pair it with full thinking traces, and let the model reason its way through character work — weighing tone, planning narrative beats, considering how a character would actually respond before committing to a line. Whether that meaningfully improves roleplay quality over a non-reasoning model is a question you'll hopefully be able to help me answer.

GGUF quants are available here.

Model details

Base model is llmfan46/Qwen3.6-27B-uncensored-heretic-v2-Native-MTP-Preserved, and from what I can tell this worked out very, very nicely in regards to refusal reduction and writing capabilities.

I considered Gemma 4 31B but that model has been an absolute pain to train. Something something special snowflake architectures. (grumble, grumble)

All training sources include full reasoning traces, with thinking active across every assistant turn:

The model was trained with preserve_thinking: true, so thinking tags remain active across all assistant turns in multi-turn conversations, not just the first.