TheaterFire

Putting together all the AI-powered web search software we know of

Posted by Felladrin@reddit | LocalLLaMA | View on Reddit | 72 comments

Putting together all the AI-powered web search software we know of

Reply to Post

72 Comments

Felladrin@reddit (OP)

Started listing [here](https://github.com/felladrin/awesome-ai-web-search) all the AI-powered web search software I was aware of. Besides being useful for users looking for alternatives to existing software, having a timeline helps to see how the space evolves. Please join the effort by adding any other software you know of. You can do so by [editing the readme file](https://github.com/felladrin/awesome-ai-web-search/blob/main/readme.md), [opening an issue](https://github.com/felladrin/awesome-ai-web-search/issues), or commenting directly in this post.
View on Reddit #40257072

FesseJerguson@reddit

Nice someone please have Claude write a swarm agent like system that utilizes them all and consolidates it into once concise report
View on Reddit #40257865

sickleRunner@reddit

you can check this DeepSeek R1 api which has access to the internet [https://deadlock.up.nadles.com/checkout/xMzXXxEGYKP988XQi727eg](https://deadlock.up.nadles.com/checkout/xMzXXxEGYKP988XQi727eg)
View on Reddit #47677745

FesseJerguson@reddit

Oh and I wanna be able to use ollama for most of the agents but put it all together with Claude
View on Reddit #40257931

visionsmemories@reddit

problem is this seems really good on paper, but as for actual applications - there isnt an immediate advantage you get from it; so you just decide not to make it yourself and so do almost everyone else
View on Reddit #40272102

adrenoceptor@reddit

Poe (iOS and OS X and Web) has a “Web-Search” official bot
View on Reddit #40325343

Felladrin@reddit (OP)

Nice! Will add it to the list!
View on Reddit #40339314

Affectionate-Hat-536@reddit

Very useful, I am in process of writing a blog for options around this. 🙏
View on Reddit #40270457

muxxington@reddit

I will read it. Link?
View on Reddit #40276067

Affectionate-Hat-536@reddit

Not completed yet, will surely share once published.
View on Reddit #40327831

Away_Art850@reddit

Super excited to see what you've written!
View on Reddit #40331332

KTibow@reddit

Does Exa count?
View on Reddit #40285874

Felladrin@reddit (OP)

It does. When you [search there](https://exa.ai/search) and click "Show more info" in any search result, it generates a summary, relating your query to the link, using an LLM.
View on Reddit #40296537

KTibow@reddit

Nvm I thought you were missing Exa at first, turns out you had it from the start and I just missed it
View on Reddit #40298872

Enough-Meringue4745@reddit

lists are cool but largely useless
View on Reddit #40295747

Shir_man@reddit

Btw, has anyone seen a framework or agent that can read a CSV file, web search for information based on each table value (including calling external APIs), and then write the search results in a specific format?
View on Reddit #40259722

Chemical_Ad1778@reddit

Working on a small side project that does a variation of this if you can share your specific use case via DM I might be able to help.
View on Reddit #42732844

SnailsArentReal@reddit

You could use dify.ai to do that. It's an open source tool for building genAI powered workflows.
View on Reddit #40285379

Shir_man@reddit

Thank you, I will check it out
View on Reddit #40287875

Affectionate-Hat-536@reddit

Check phidata agent framework saw tools covering most of your ask.
View on Reddit #40275919

Felladrin@reddit (OP)

Good question! None currently in the list seems to be capable of that, but I remember I saw someone sharing here on LocalLlama a formula for Google Spreadsheet that allows querying an LLM for each line of the imported CSV file. This could be a starting point for researching.
View on Reddit #40275426

nightkall@reddit

Here are some more: \- [https://monica.so](https://monica.so) \- [https://search.brave.com](https://search.brave.com) \- [https://kagi.com/fastgpt](https://kagi.com/fastgpt)
View on Reddit #40706564

Felladrin@reddit (OP)

Great additions! I just noticed you've already opened a PR to add them! Will look into it now. See you there!
View on Reddit #40718399

Lost_midia@reddit

Can I run an llma model on an orange pi win A64?
View on Reddit #40279915

Fusseldieb@reddit

Maybe extremely small ones like 1B or whatnot, but they're mostly "useless", unless it's something extremely straightforward or finetuned.
View on Reddit #40325199

Lost_midia@reddit

I thought about making a RAG with some Java documentation so it would be specific to solving problems in Java. Would it work? There are 512Mb of RAM
View on Reddit #40388995

Fusseldieb@reddit

I think it needs some real-world knowledge too, so it can "understand" what you say.
View on Reddit #40396982

Lost_midia@reddit

Yes. Thanks :)
View on Reddit #40401834

TheRealMasonMac@reddit

There is Kagi
View on Reddit #40261427

Everlier@reddit

Came here to mention it as well. Kagi Assistant is the one most useful sub I have.
View on Reddit #40270068

AIposting@reddit

I love Kagi, feels like web searches from a decade ago (in a good way). Shame you have to buy API credits separately if you wanted to hook up your own agent to a local LLM, but I guess a little webscraping would solve that easily enough.
View on Reddit #40281564

TheRealMasonMac@reddit

[https://kagifeedback.org/d/1624-free-api-allotment-for-subscribers](https://kagifeedback.org/d/1624-free-api-allotment-for-subscribers) \> Mostly because any sort of automated use would probably propel the costs for us to the skies, and we are already on razor thin margins. So this is why we ask users to pay for additional scripted usage via the API.
View on Reddit #40314961

Everlier@reddit

I'm not sure how you guys have any margins at all with no limits at the higher plans. I'm sure I've spent more your money on sonnet 3.5 alone than I paid you, even including prior to when assistant was introduced.
View on Reddit #40396799

TheRealMasonMac@reddit

I'm not an employee, idk. But I know they had to send some emails to high-use users about it politely asking them to reduce their usage.
View on Reddit #40401253

AIposting@reddit

Very understandable, thanks for clarifying. It's incredible how much you've been able to accomplish so far, I'd be very sad if I had to go back to Google products if anything happened to Kagi and Proton.
View on Reddit #40378907

Felladrin@reddit (OP)

Thank you both! I’ve just found [the official post about it](https://blog.kagi.com/announcing-assistant). Will at it to the list on the next update today.
View on Reddit #40275609

Revolutionary_War984@reddit

+1 👌🏼
View on Reddit #40370653

flashmoregash@reddit

https://thegigabrain.com/ GigaBrain scans billions of discussions on reddit and other online communities to find the most useful posts and comments for you
View on Reddit #40273860

MoffKalast@reddit

> "Get real answers. From real people." That sounds suspiciously like it's actually fake answers from fake people.
View on Reddit #40346782

Affectionate-Hat-536@reddit

This is good one! I’m
View on Reddit #40306812

Felladrin@reddit (OP)

Looks great! Will add it on the next update today. Thanks for sharing!
View on Reddit #40275883

saintshing@reddit

Reddit should just acquire them.
View on Reddit #40275418

WesternTall3929@reddit

Oh man, it’s going down, this is exactly the type of data I need
View on Reddit #40339527

abellimz@reddit

[you.com](https://you.com)
View on Reddit #40339096

Felladrin@reddit (OP)

Thanks! This one is already on the list! \[[Reference](https://github.com/felladrin/awesome-ai-web-search?tab=readme-ov-file#closed-source)\]
View on Reddit #40339206

jrhizor@reddit

Any particular high performers out of all these options?
View on Reddit #40268082

Square-Intention465@reddit

try pravah
View on Reddit #40337767

CrzyFlky@reddit

Among closed source: perplexity, exa, and gigabrain; and if u can pay, its Kagi.
View on Reddit #40284700

Enough-Meringue4745@reddit

localllama
View on Reddit #40295782

CrzyFlky@reddit

If you open the site, you will see lists of both. someone else can benchmark open models. \- humble GPU poor guy
View on Reddit #40332318

trenchgun@reddit

Does any of them offer a feature where you just get the best result, filtered by the LLM?
View on Reddit #40280539

Felladrin@reddit (OP)

Hey, u/trenchgun! You asked me about it before, but [my answer](https://www.reddit.com/r/LocalLLaMA/comments/1g9d9jr/comment/lu9d2v4/) is still the same, unfortunately.
View on Reddit #40297419

trenchgun@reddit

See here: https://x.com/VictorTaelin/status/1844198211130691766 But yeah: > - not deployed yet, probably won't (expensive af) https://x.com/VictorTaelin/status/1844174273948025176
View on Reddit #40302167

Felladrin@reddit (OP)

Great finding! Looks like a project from u/SrPeixinho. Maybe he could consider selling the project?
View on Reddit #40304396

trenchgun@reddit

I think the issue is that it is prohibitevely expensive.
View on Reddit #40305429

trenchgun@reddit

Ah I did not realize you are you. But this result is very interesting.
View on Reddit #40299986

Enti9@reddit

You.com
View on Reddit #40282537

Felladrin@reddit (OP)

Thanks! This one is already on the list! \[[Reference](https://github.com/felladrin/awesome-ai-web-search?tab=readme-ov-file#closed-source)\]
View on Reddit #40297022

Fleshybum@reddit

So which ones can be run locally?
View on Reddit #40290930

JungianJester@reddit

Thanks for your research work. I have been using Perplexica for a few months, prior to that it was searXNG inside open webui which is adequate for most needs. Anyway there are about a dozen of programs newer than Perplexica, unfortunately there does not appear to be an easy Docker install for most which means people who rely on a Docker Compose method will likely bypass programs which can't easily be containerized.
View on Reddit #40285162

saintshing@reddit

Getliner, felo are pretty good. Getliner: you can see clear breakdown of the query into subqueries, filter by time, exclude individual sources, get summary of each source, use scholarly sources only, etc Felo: similar to getliner, has less filters but has a nice mindmap function There is also webpilot. More basic. But I like how it gives a short summary of the answer and then goes in depth to elaborate on each key point.
View on Reddit #40276200

Felladrin@reddit (OP)

Thank you! I've just gathered some info about them and will add all three to the list soon!
View on Reddit #40278763

GreedyWorking1499@reddit

Do you have any plans to add things like benchmarks?
View on Reddit #40267108

Felladrin@reddit (OP)

Unfortunately, I don’t plan to do it. Web searching is a very personal experience. I can only recommend users to visit and read about each tool listed there, then, if there’s any particular feature they want on their current web searching platform, that they request it to the developers. This will indirectly make the web-searching space better, as one tool influences the other.
View on Reddit #40276610

muxxington@reddit

Thanks. I will work through this list. One question: Does one of these programs offer an API which can then be used with tools e.g. from Open-WebUI?
View on Reddit #40274117

Felladrin@reddit (OP)

Not that I know of. But I also don’t think it’s necessary, as Open WebUI already supports connecting search engines to the chat, including SearXNG, which is the metasearch engine most used by the open source tools listed there. Was there any specific feature you found in one of them that is not available in Open WebUI?
View on Reddit #40276181

deadlydogfart@reddit

Phind is a good one
View on Reddit #40271302

Felladrin@reddit (OP)

Oh yes! Phind! Well remembered! Will add it on the next update today. Thanks!
View on Reddit #40275762

Dalong_pub@reddit

Groquelle
View on Reddit #40258516

Felladrin@reddit (OP)

Could you share a link to it?
View on Reddit #40275292

ComprehensiveQuail77@reddit

Is it better than perplexity?
View on Reddit #40272128

visionsmemories@reddit

great, now benchmark them
View on Reddit #40271936