Hope Watched words adds support for non-English characters

Noble_Fish · February 14, 2026, 3:31pm

This is a useful moderation tool, but it has poor support for non-English characters, and the presence of non-English characters can even affect the detection of English and numbers. Here, taking the Simplified Chinese word “测试” (Test) as an example, the Watched words list contains three elements: “测试”, “Test”, and “123”. In the test below, none of the three examples triggered Watched words.

I searched within the site and found another similar issue about Censored words: Censored words do not respect word boundaries in non-latin alphabet. It seems that this is a common problem across the entire watch word matching system?

zogstrip · February 16, 2026, 2:45pm

Thanks for the report, this will be fixed by

https://github.com/discourse/discourse/pull/37844

Topic		Replies	Views
Russian characters in Watched Words list are failing to be properly identified Bug watched-words	1	550	February 10, 2021
Watched words: in Persian, content is affected without containing the word Support	6	777	May 9, 2019
Test Watched Words is Broken Bug watched-words	2	528	June 9, 2023
Accented characters cause false postives in Watched Words Bug watched-words	2	475	May 18, 2023
Censored words do not respect word boundaries in non-latin alphabet Bug	8	1553	November 29, 2018

Hope Watched words adds support for non-English characters

Related topics