嵌入 `input 必须少于 8192 个 token` 的警告与 discourse ai

Falco · 2025 年11 月 1 日 03:39

If you self-host that same model, it can take up to 32k tokens. It is what we run on our hosting these days.

If that’s out of the question, then you need to configure the embeddings model to limit inputs to the maximum allowed on your provider. This way, our AI Bot RAG will split uploaded files into chunks, and Related Topic / Search will take only the first 8192 tokens in each topic.

话题		回复	浏览量
Embedding of post is not being properly truncated in discourse-ai plugin Bug ai	5	407	2023 年8 月 20 日
Gemini API Embedding Configuration Clarification Support ai	3	87	2025 年10 月 16 日
Ai:embeddings:backfill - Handling OpenAI's 400 Error for Excessive Tokens in Embeddings Bug ai	10	866	2024 年3 月 15 日
Adding Semantic Search feature for our self-hosted discourse site Support ai , ai-search	9	189	2025 年3 月 19 日
Something bad with embeddings (related topics getting too costly) Bug related-topics , ai	21	185	2025 年1 月 5 日

嵌入 `input 必须少于 8192 个 token` 的警告与 discourse ai

相关话题