Microsoft stealth releases both “Magentic-One”: An Open Source Generalist Multi-Agent System for Solving Complex tasks, and AutogenBench
Posted by Porespellar@reddit | LocalLLaMA | View on Reddit | 28 comments
Had no idea these were even being developed. Found both while searching for news on Autogen Studio. The Magentic-One project looks fascinating. Seems to build on top of Autgen. It seems to add quite a lot of capabilities. Didn’t see any other posts regarding these two releases yet so I thought I would post.
foldl-li@reddit
Interesting. But, is GraphRAG widely adopted or not?
psilent@reddit
“More worryingly, in a handful of cases — and until prompted otherwise — the agents occasionally attempted to recruit other humans for help (e.g., by posting to social media, emailing textbook authors, or, in one case, drafting a freedom of information request to a government entity).”
There you go, just ask on social media how to log in to a server
Porespellar@reddit (OP)
That’s friggin hilarious!! It thinks it’s people. I can see why they waited until post-election to release this and pretty much released it without any fanfare.
cyan2k@reddit
?? What are you talking about.... I'm playing with it since a couple of weeks. The branch is three months old
Real_Pareak@reddit
>you guys are hallucinating like mini phi 3.5 in a two bit quant
That's the most LLM-nerdy insult I have ever heard, lol
wavinghandco@reddit
"November 4, 2024"
Porespellar@reddit (OP)
Yeah, that’s when the article was written. A day before the election, but all the mail in voting had already occurred and I don’t know that they actually posted the blog entry until today. Guess I could check the wayback machine. Regardless, this was just kind of put out there without a whole lot of press. The fact that I’m the first to post it here after it’s supposedly been out for two days should tell you all you need to know.
JohnnyLovesData@reddit
Relevant XKCD ? Zealous Autoconfig
inconspiciousdude@reddit
There really is one for everything :/
I can see it becoming a bible of sorts in a post-apocalyptic world.
Jazzlike_Tooth929@reddit
mind blowing
throwawayPzaFm@reddit
That's... kinda awesome.
arjunainfinity@reddit
Nice, here’s an opensource multi-agent studio with telephone features as well https://github.com/NidumAI-Inc/agent-studio
Alexian_Theory@reddit
I’ve played with it for a while last week, I found it by chance looking for something similar to the Websurfer agent for the new core 0.4 dev release. the approach to web browsing is interesting. It takes snapshots of the headless browser it is running, passes the image to a vision enabled LLM and then decides how to further proceed to finish the task.
Enough-Meringue4745@reddit
It's the only feasible way given how bloated html is
FaceDeer@reddit
And also possibly to bypass Cloudflare and other such anti-bot mechanisms.
NarrowTea3631@reddit
headless browsers are generally very easy to detect, takes a lot of work to do serious automated stuff with em
Morganross@reddit
is that the worst possible example they could give?
an example is something that a human can relate to, not a fantasy figment of imagination.
Porespellar@reddit (OP)
Only downside it is currently only supporting OpenAI models and not local. How hard is it to make it work with Ollama? Can someone fork it and do this or something?
gentlecucumber@reddit
If it works with OpenAI then it works with local models. Use vLLM instead of Ollama.
Alexian_Theory@reddit
as mentioned the WebSurfer agent requires a multimodal LLM. So there is the problem really, still no multimodal for ollama AFAIK, still waiting on llama 3.2 11b to work, according to some previous posts it should be fun
Alexian_Theory@reddit
lol the timing. Ollama llama3.2 with vision dropped today.
Incompetent_Magician@reddit
It doesn't support Ollama but it does work with Ollama. I'm on MacOS and I use Podman.
_Erilaz@reddit
If it supports ClosedAI API, that isn't an issue at all.
ithkuil@reddit
The diagram makes it look like they defined a new agent for each tool call. Sorry but that doesn't make sense for this example. It's a toy example but that's oversimplified and that makes it confusing as to why they are doing these things.
My framework can do task this with one agent that has all of those types of commands enabled. You also don't need an orchestrator for this example. What you need an orchestrator for is when there is a ton of output and complexity for some of the subtasks that you don't want to burden the other tasks with. I just don't see that much complexity and output in this example.
Shir_man@reddit
How is Magnetic-one different from Autogen?
Enough-Meringue4745@reddit
I believe it IS autogen but its custom agents
HiddenSecretAccount@reddit
my brain exploded, thanks
mythicinfinity@reddit
open source!