Ask HN: How to Build RAG?

3 points | by punkpeye 13 hours ago ago

2 comments

chewz 13 hours ago ago
RAG is so yesterday.
Upload entire PDF directly[1] to API, don't convert PDF to markdown, don't vectorise.. Put that in API cache [2] and keep asking questions.
Chunking and vector search gives mediocre results [3]. Same with full-time search. Difficult to calibrate when structure of PDF is volatile.
[1] - https://docs.anthropic.com/en/docs/build-with-claude/pdf-sup...
[2] - https://docs.anthropic.com/en/docs/build-with-claude/prompt-...
[3] - This works but for well formated PDFs where you chunk intelligently and extract reasonable metadata.
[-]
- punkpeye 13 hours ago ago
  I am trying to build this independent of APIs provided by specific providers (such as cache), so this isn't an option.