Back2Game_8888
-
is it worth to finetune 'yes' or 'no' problem through GRPO
Posted by Back2Game_8888@reddit | LocalLLaMA | View on Reddit | 6 comments
-
grpo on answering 'yes' or 'no' problem
Posted by Back2Game_8888@reddit | LocalLLaMA | View on Reddit | 1 comments