针对长上下文LLMs的提示注入作为RAG的替代方案？

sam · 2024 年5 月 23 日 02:58

是的，我们有依赖于 LLM 允许的令牌数量的截断逻辑，我们将 Gemini 1.5 模型的阈值设置得很高（800k）。

应该可行，但每次交互的成本可能非常高。

总的来说，我发现限制上下文有助于模型保持更专注，但从长远来看（2-5 年后）……检索增强生成（RAG）可能变得毫无意义，因为我们将拥有如此多的令牌和焦点，以至于它不再重要。

话题		回复	浏览量
Engineering a persona to lean on chat history Support ai	8	233	2025 年8 月 11 日
Why is my AI forum helper struggling to answer questions? Support ai , ai-bot	4	406	2025 年10 月 15 日
Another added context for AI Bot Support ai-bot , ai	1	76	2025 年7 月 4 日
RAG capacities of discourse-ai Support ai	7	418	2024 年9 月 19 日
Discourse AI Persona, upload support Announcements ai-bot , ai	21	1711	2025 年9 月 11 日