Hardcore RAG & AI Search resources
Posted by LilDemonApparel@reddit | LocalLLaMA | View on Reddit | 10 comments
Hi everyone,
I’m starting to onboard Enterprise clients and I need to move past the basic tutorials.
I’m posting here because most other subs feel a bit too high-level or news-focused, whereas I know this community is focused on actual engineering.
I need deep-dive resources on production-grade RAG. I'm looking for:
SOTA Papers (ArXiv links welcome)
Advanced Architectures
Engineering Blogs regarding evaluation and scale.
Any must-read links, communities or repos you recommend?
Thanks.
SkyFeistyLlama8@reddit
Try the /rag sub. Don't bother with SOTA, look at the solution accelerator tutorials that Microsoft, Anthropic etc. have.
You have to be extremely careful if you're onboarding enterprise clients and you don't know what you're doing. It's easy to over-promise on a RAG solution and then fall flat on your face.
Data ingest is about 80% of the hard stuff like converting PDFs or images to chunked vector data. The remaining 20% that will turn your hair gray is getting good retrievals across multi-turn conversations.
Good luck kid, you're gonna need it.
NderituPi@reddit
Honestly, stop looking for magic architectural patterns. If you're doing Enterprise RAG, your bottleneck is 100% data ingestion.
Most enterprise PDFs are garbage. Tables are broken, headers are missing, and OCR is hit-or-miss. Before you even look at GraphRAG or HyDE, build a solid ingestion pipeline.
NobleKale@reddit
Stop talking so much damn sense, u/NderituPi, they've clearly already taken a lot of people's money and need a magic bullet immediately.
Like, probably Monday.
NobleKale@reddit
Jesus fucking christ.
Step 1: admit that you're in way too deep
Step 2: admit RAG isn't the magic solution everyone was jerking themselves off about six months ago
Step 3: prepare three envelopes.
holchansg@reddit
check cognee repo.
texasdude11@reddit
Link please?
No_Scheme14@reddit
You can look into these resources:
https://apxml.com/courses/getting-started-rag
https://apxml.com/courses/optimizing-rag-for-production
Mertescielny46@reddit
The Context Engineers Discord: https://discord.gg/Ntd73e2F cracked folks out there!
doxdom@reddit
yep it's a good one!!
macbook86000@reddit
Blogs from zeroentropy (heavy on rerankers tho)