דיסקורס AI - דף הגדרות של מודל שפה גדול (LLM)

Discourse · 2 באוגוסט,‏ 2024,‏ 9:28pm

This guide covers the LLM settings page which is part of the Discourse AI plugin.

Required user level: Administrator

The dedicated settings page is designed to have everything related to Large Language Models (LLMs) used for Discourse AI features in one place.

Depending on the Discourse AI feature enabled, an LLM might be needed. Please check each Discourse AI feature to know if an LLM is a pre-requisite.

Features

Add new models, with prepopulated information
Add custom models not mentioned
Configure LLM settings
Allow specific LLM use for AI Bot
- See the AI Bot username
Enable vision support (model dependent)
Configure allowed attachment types
Set up per-group usage quotas
Track input/output token costs
Test
Save settings

Adding LLM connections

Go to Admin → Plugins → AI
Go to the LLMs tab
Add a new connection, pick your model
Add in the API key (depending on the model, you might have more fields to input manually) and save
(Optional) Test your connection to make sure it’s working

Supported LLMs

You can always add a custom option if you don’t see your model listed. Supported models are continually added. Pre-configured models are templates — you can always achieve the same result using “Manual configuration”.

Anthropic

Claude Opus 4.6
Claude Sonnet 4.6
Claude Haiku 4.5

Google

Gemini 3 Pro
Gemini 3 Flash

OpenAI

GPT-5.4
GPT-5 Mini
GPT-5 Nano

Open Router

DeepSeek V3.2
Moonshot Kimi K2.5
xAI Grok 4 Fast
MiniMax M2.5
Z-AI GLM-5
… and many many more

Additionally, hosted customers can use the CDCK Hosted Small LLM pre-configured in the settings page. This is an open-weights LLM hosted by Discourse, ready for use to power AI features.

Configurations fields

You will only see the fields relevant to your selected LLM provider. Please double-check any of the pre-populated fields with the appropriate provider, such as Model name

Core fields:

Display name — the friendly name shown in dropdowns
Model name — the model identifier sent to the API (e.g. claude-sonnet-4-6, gpt-5.2)
Provider — the service hosting the model (e.g. Anthropic, OpenAI, Google, AWS Bedrock, Azure, Open Router, etc.)
URL — the API endpoint URL (not shown for AWS Bedrock)
API Key — configured via the AI Secrets system
Tokenizer
Max prompt tokens — controls prompt trimming to prevent oversized requests
Max output tokens
Input cost / Output cost — cost per million tokens, used for usage tracking
Cached input cost / Cache write cost — for providers that support prompt caching
Vision enabled — enables image understanding (model dependent)
Allowed attachment types — file types the model can process

Provider-specific fields (shown dynamically based on selected provider):

AWS Bedrock: Access Key ID, Role ARN, Region, reasoning/thinking options, Prompt caching
Anthropic: reasoning options, Prompt caching
OpenAI: Organization ID, Reasoning effort, Service tier
Google: Enable thinking, Thinking level
Open Router: Provider order, Provider quantizations

Quotas (available after initial save):

Per-group usage quotas can be configured with max tokens, max usages, and duration

Technical FAQ

What is tokenizer?

The tokenizer translates strings into tokens, which is what a model uses to understand the input.

What number should I use for Max prompt tokens ?

A good rule of thumb is 50% of the model context window, which is the sum of how many tokens you send and how many tokens they generate. If the prompt gets too big, the request will fail. That number is used to trim the prompt and prevent that from happening

Caveats

Sometimes you may not see the model you wanted to use listed. While you can add them manually, we will support popular models as they come out.

Last edited by @sam 2026-03-24T04:55:48Z

Check document
Perform check on document:

qianping_chen · 30 בספטמבר,‏ 2024,‏ 5:16pm

It’s too difficult, I don’t know how to do it at all. I hope to update specific tutorials on various AIs, such as Google login settings.

sam · 1 באוקטובר,‏ 2024,‏ 5:40am

We improved the UI a lot in the past week, can you try it out again?

hameedacpa · 24 בפברואר,‏ 2025,‏ 5:08pm

When Gemini 2.0 will be supported ?

sam · 24 בפברואר,‏ 2025,‏ 9:58pm

נתמך כבר זמן מה.

Joe_F · 11 במרץ,‏ 2025,‏ 1:21pm

נראה שיש לי בעיה שבה אני לא יכול לבחור LLM למרות שיש לי את ה-CKDC מא hosted מאורגן..

האם זה נורמלי?

sam · 12 במרץ,‏ 2025,‏ 12:17am

A lot to unwrap here, which llm are you trying to choose for what?

The CDCK LLMs are only available for very specific features, to see which you need to head to /admin/whats-new on your instance and click “only show experimental features”, you will need to enable them to unlock the CDCK LLM on specific features.

Any LLM you define outside of CDCK LLMs is available to all features.

AquaL1te · 12 במרץ,‏ 2025,‏ 9:13am

Is there also a topic that provides a general rundown of the best cost/quality balance? Or even which LLM can be used for free for a small community and basic functionality? I can dive into the details and play around. But I’m a bit short in terms of time.

For example, I only care about spam detection and a profanity filter. I had this for free, but those plugins are deprecated or soon to be. It would be nice if I can retain this functionality without having to pay for an LLM.

Saif · 12 במרץ,‏ 2025,‏ 7:20pm

We do have this topic, that might be what you are looking for.

AquaL1te · 25 במרץ,‏ 2025,‏ 9:36am

Done! It was indeed pretty easy. But maybe for a non techie it may still be a bit hard to setup. For example, the model name was automatically set in the settings, but wasn’t the correct one. Luckily I recognized the model name in a curl example for Claude on the API page and then it worked

Estimated costs are maybe 30 euro cents per month for spam control (I don’t have a huge forum). So that’s manageable! I’ve set a limit of 5 euros in the API console, just in case.

Saif · 25 במרץ,‏ 2025,‏ 4:16pm

Which one did you pick for Claude? What was the incorrect name shown, and what did you correct it to?

AquaL1te · 26 במרץ,‏ 2025,‏ 9:31am

אני משתמש ב-Claude 3.5, מזהה המודל כברירת מחדל הוא claude-3-5-haiku, אבל הייתי צריך לשנות אותו ל-claude-3-5-haiku-20241022, אחרת קיבלתי שגיאה.

Saif · 26 במרץ,‏ 2025,‏ 3:49pm

Good to note, yeah sometimes there might be a disconnect. The auto-populated info should act as guidance, which tends to work most of the time, but does fall short in certain cases such as yours (given all the different models and provider configs)

I have updated the OP of this guide

jrgong · 11 באפריל,‏ 2025,‏ 11:20am

This model is not listed on 3.4.2 - are those pre-configs only available on 3.5 and I have to add them manually?

Edit: Also what option do I choose for “Tokenizer” when using Grok 3 models?

Falco · 11 באפריל,‏ 2025,‏ 5:15pm

Pre-configs are simply templates, you can get the same end result by using the “Manual configuration”.

I’ve found that the Gemini tokenizer is pretty close the the Grok one, so try that.

CraigW · 24 ביולי,‏ 2025,‏ 10:52pm

האם יש דרך להשתמש ב-IBM WatsonX דרך ניהול התצורה הנוכחי, או שזה ידרוש עבודת פיתוח נוספת מצוות ה-Discourse?

Falco · 24 ביולי,‏ 2025,‏ 11:15pm

האם IBM WatsonX חושף API תואם OpenAI במקרה?

CraigW · 25 ביולי,‏ 2025,‏ 6:22pm

Great question. A quick poke around the docs didn’t tell me much, but the fact that this repository exists suggests that it is not directly compatible: GitHub - aseelert/watsonx-openai-api: Watsonx Openai compatible API · GitHub

AntiMetaman · 5 בספטמבר,‏ 2025,‏ 8:03pm

אילו מהמודלים הללו של שפה גדולים (LLMs) ניתנים לשימוש בחינם למניעת דואר זבל?

עריכה: לא משנה, אני משתמש ב-Gemini Flash 2.5

pfaffman · 6 בספטמבר,‏ 2025,‏ 7:51pm

I always wonder too. This seems like the best answer to that question.

But also, there is this in the OP from the Spam config topic. I think it’s just a little hard to find in all of the information that’s there.

נושא		תגובות	צפיות
Simplified Large Language Model (LLM) configurations for Discourse AI Announcements ai	1	354	9 באוגוסט,‏ 2024
What LLM to use for Discourse AI? Site Management how-to , ai	0	879	23 בינואר,‏ 2025
Feature request: improve Discourse AI LLM setup (model discovery) and add AI config import/export Feature ai	0	82	26 בינואר,‏ 2026
Configure API Keys for Anthropic Integrations how-to , ai	0	1555	3 באוקטובר,‏ 2023
Can't choose default LLM model Support ai	2	166	16 בנובמבר,‏ 2025