AI exceeds LLM token thresholds randomly and unpredictably

Falco · May 6, 2026, 6:21pm

Are you confusing request tokens with response tokens?

413 means that your request was too large, not your requested response.

To handle that you want to tweak the Context window LLM configuration, but I’d warn that 8k tokens is way too small nowadays. It will work for some features, but it’s not exactly something we exercize much nowadays when LLMs are handling 1M token long context windows. I can run a 256k context window on my desktop PC using a model that is much better than the one you are using.

Topic		Replies	Views
Discourse AI - Large Language Model (LLM) settings page Site Management how-to , ai	20	3747	May 9, 2026
Discourse AI - AI usage Site Management how-to , ai	0	558	January 23, 2025
Discourse AI Failing to translate large number of posts and topics Support ai , content-localization	7	286	November 6, 2025
Unlock All Discourse AI Features with Our Hosted LLM Announcements ai	9	899	March 13, 2026
AI translations errors Bug ai	10	228	May 10, 2026

AI exceeds LLM token thresholds randomly and unpredictably

Related topics