Has anyone seen AI agents working in production at scale?

Posted by madredditscientist@reddit | LocalLLaMA | View on Reddit | 68 comments

Has anyone seen AI agents working in production at scale?

It doesn't matter if you're using the Swarm, langchain, or any other AI agent orchestration framework if the underlying issue is that AI agents too slow, too expensive, and too unreliable. I wrote about AI agent hype vs. reality a while ago, and I don't think it has changed yet.

By combining tightly constrained LLMs, good evaluation data, human-in-the-loop oversight, and traditional engineering methods, we can achieve reliably good results for automating medium-complex tasks.

Will AI agents automate tedious repetitive work, such as web scraping, form filling, and data entry? Yes, absolutely.

Will AI agents autonomously book your vacation without your intervention? Unlikely, at least in the near future.

What are your real-world use cases and experiences?