Will RAG Support PDF Files in the Future?

sam · November 13, 2024, 12:11am

JSON is just text so we already support it.

It is an inefficient representation for LLMs given large amount of duplication within the format so it would waste a few tokens, but overall it will work. I would recommend running a script on it and reformatting to improve RAG performance.

It is very hard to do this automatically cause JSON can be very nested and picking a perfect domain specific text representation highly depends on the domain.

Topic		Replies	Views
PDF support in Discourse AI Site Management how-to , ai	15	382	April 30, 2025
Is the PDF upload feature for the new AI Bot UX still in development Support ai , ai-bot	3	48	May 9, 2025
Upload and discuss pdfs in composer Feature ai	5	163	February 24, 2025
Allow ChatBot to read PDFs so it can join in a group discussion Feature ai , ai-bot	6	904	October 12, 2023
Advice on a support bot for a technical support forum (Discourse AI vs Discourse Chatbot) General ai , ai-bot	50	3516	September 19, 2024

Will RAG Support PDF Files in the Future?

Related topics