Prompt injection for long-context LLMs as an alternative to RAG?

sam · May 23, 2024, 2:58am

Yeah we have truncation logic that depends on the amount of tokens the llm allows, we set the threshold quite high for gemini 1.5 models (at 800k)

It should work, but every interaction can be very expensive.

Overall I have found that limiting context help models stay more focused but long term (2-5 years out) … rag may be pointless and we will just have so many tokens and focus that it does not matter.

Topic		Replies	Views
Engineering a persona to lean on chat history Support ai	8	233	August 11, 2025
Why is my AI forum helper struggling to answer questions? Support ai , ai-bot	4	406	October 15, 2025
Another added context for AI Bot Support ai-bot , ai	1	76	July 4, 2025
RAG capacities of discourse-ai Support ai	7	418	September 19, 2024
Discourse AI Persona, upload support Announcements ai-bot , ai	21	1711	September 11, 2025

Prompt injection for long-context LLMs as an alternative to RAG?

Related topics