AI flagging too sensitive

Shauny · March 4, 2024, 11:20pm

I can’t find any specific information about this.

When auto flagging toxic posts (something I want, because I want my community to be respectful and safe), it is being far too sensitive, flagged a post where someone called a tv episode “silly”.

It was set at 80. I’ve just increased it to 90. But there’s no documentation on what is the maximum (I imagine that’s 100) and what precisely I need to set this at so it doesn’t do super simple false positives but still protects my community?

Thanks!

Falco · March 4, 2024, 11:54pm

Hey @Shauny, we have received reports that the current toxicity module provider in the Discourse AI plugin is indeed too sensitive for most communities.

To address that issue, we just made it possible to use the new large language models as classifiers for the means of flagging content, which will give community managers the means to customize the sensitivity of the flagging to the one appropriate for their communities.

We don’t have a guide for this brand new feature yet, as we are working on it, but I will ping you here when we have one to share.

Saif · March 31, 2024, 10:48am

We just published a guide on detecting spam. The prompts could be customized with Enable AI Bot help to look for sensitive content!

Topic		Replies	Views
Setting up toxicity detection in your community Site Management moderation , automation , how-to , ai	0	710	August 7, 2024
Have AI check for inappropriate post or at least words and flag the post Support ai , ai-toxicity	3	379	July 7, 2023
Discourse AI - Spam detection Site Management moderation , how-to , ai , spam	13	1582	August 1, 2025
What's next for Toxicity detection in Discourse AI Announcements automation , ai , ai-toxicity	8	375	December 5, 2024
Setting up spam detection in your community Site Management moderation , automation , how-to , ai	11	1577	January 30, 2025

AI flagging too sensitive

Related topics