gpt2-chatbot RLHF (Tone)

Posted by Crazyscientist1024@reddit | LocalLLaMA | View on Reddit | 0 comments

I have recently been testing gpt2-chatbot a lot in the arena since it got back. I let it do lots of creative writing tasks, and imo, I rank it on-par with Opus. gpt2-chatbot has successfully got rid of the annoying GPT-4 nerd tone and gotten a more human / natural one. If GPT-5 releases with a RLHF of Opus and like 10x the performance, Anthropic is basically destroyed. If it still keeps the GPT-4 nerd tone, I would still pay for Opus lots of the time.