A Qwen finetune, that feels VERY human
Posted by Sicarius_The_First@reddit | LocalLLaMA | View on Reddit | 73 comments
Hello guys,
So TL;DR, I was asked by multiple people to make an Assistant\_Pepe\_32B version, but the best base model contender was Qwen3-32B, a model that is very hard to tune on anything other than STEM.
The concept of Assistant\_Pepe is an assistant without a typical 'assistant brain', that is infused with negativity bias to reduce sycophancy, previous discussions can be found [here](https://www.reddit.com/r/LocalLLaMA/comments/1qppjo4/assistant_pepe_8b_1m_context_zero_slop/) and [here](https://www.reddit.com/r/LocalLLaMA/comments/1qsrscu/can_4chan_data_really_improve_a_model_turns_out/).
I don't wanna bore you too much with a wall of text, because the above discussions truly did a great job, and great ideas hypothesis were raised there.
I'll conclude with this: this is probably one of the more "human" models out there, which by itself is quite interesting, because it's a Qwen underneath.
More details in the model card:
[https://huggingface.co/SicariusSicariiStuff/Assistant\_Pepe\_32B](https://huggingface.co/SicariusSicariiStuff/Assistant_Pepe_32B)
73 Comments
tyty657@reddit
Adventurous-Gold6413@reddit
Sicarius_The_First@reddit (OP)
DeepOrangeSky@reddit
Sicarius_The_First@reddit (OP)
DeepOrangeSky@reddit
Sicarius_The_First@reddit (OP)
unjustifiably_angry@reddit
Sicarius_The_First@reddit (OP)
unjustifiably_angry@reddit
Sicarius_The_First@reddit (OP)
Adventurous-Gold6413@reddit
Snoo_27681@reddit
Sicarius_The_First@reddit (OP)
IrisColt@reddit
IrisColt@reddit
Sicarius_The_First@reddit (OP)
IrisColt@reddit
Microsort@reddit
c0lumpio@reddit
Eyelbee@reddit
Sicarius_The_First@reddit (OP)
super1701@reddit
Sicarius_The_First@reddit (OP)
super1701@reddit
Additional_Ad_7718@reddit
jingtianli@reddit
Awwtifishal@reddit
LeatherRub7248@reddit
Sicarius_The_First@reddit (OP)
super1701@reddit
Sicarius_The_First@reddit (OP)
super1701@reddit
Sicarius_The_First@reddit (OP)
super1701@reddit
Sicarius_The_First@reddit (OP)
ready_or_not_3434@reddit
Noob_Krusher3000@reddit
AdventurousFly4909@reddit
draconic_tongue@reddit
Sicarius_The_First@reddit (OP)
IrisColt@reddit
Sicarius_The_First@reddit (OP)
Velocita84@reddit
Sicarius_The_First@reddit (OP)
Borkato@reddit
Sicarius_The_First@reddit (OP)
Imaginary-Unit-3267@reddit
Needausernameplzz@reddit
mjsxi__@reddit
Silver-Champion-4846@reddit
Sicarius_The_First@reddit (OP)
Silver-Champion-4846@reddit
LoveMind_AI@reddit
Dany0@reddit
JazzlikeLeave5530@reddit
Sicarius_The_First@reddit (OP)
LoveMind_AI@reddit
LoveMind_AI@reddit
RandumbRedditor1000@reddit
Sicarius_The_First@reddit (OP)
RandumbRedditor1000@reddit
Blues520@reddit
Sicarius_The_First@reddit (OP)
brother_spirit@reddit
Sicarius_The_First@reddit (OP)
brother_spirit@reddit
Sicarius_The_First@reddit (OP)
RottenPingu1@reddit
Sicarius_The_First@reddit (OP)
Sicarius_The_First@reddit (OP)
RandumbRedditor1000@reddit
Sicarius_The_First@reddit (OP)