Are people testing ensembles of small size reasoning LLM agents (assuming different models) and do they perform well on the same / shared task?

Posted by Mental-At-ThirtyFive@reddit | LocalLLaMA | View on Reddit | 1 comments

I am assuming this is a reasonable step in world of multi-agents, orchestrations and harnesses - is there any references to this type of work being done