Are the AI models trained on English?

Am I right to assume that the model was trained & tested on English language data only?

2 Likes

There are around 20 different models involved in Discourse AI so far, but yes most models are English only. With the exceptions being the Toxicity module that ships with a multilingual module, and the composer Helper module is powered by OpenAI/Anthropic which are multilingual AFAIK.

Also worth saying that I did a case study and found quite a few models with potential for the french language and I’m keen on creating language specific versions of each modules provided there are good open source models available.

6 Likes

I can confirm the AI helper working like a charm on spanish self-hosted instance.

8 Likes