Stanford Researchers Released AgentFlow: Flow-GRPO algorithm. Outperforming 200B GPT-4o with a 7B model! Explore the code & try the demo

Posted by balianone@reddit | LocalLLaMA | View on Reddit | 82 comments