Hacker News

RAG Is Laughably Simple

by josvdweston 8/20/2024, 11:05:19 PM with 2 comments

by PaulHouleon 8/20/2024, 11:28:02 PM
It’s not too different from you yourself searching for a few documents and reading them before formulating an answer.
by curious_curioson 8/20/2024, 11:47:15 PM
You’re glossing over a lot of details here, which is where most of the pain is.
Properly chunking the data, handling non-standard text formatting in source documents, not even having OCR’d text in source documents, having disparate indexes available per client, minimizing hallucinations even with properly context data, and more.