Prompting users to avoid harmful language

awesomerobot · May 6, 2021, 5:41pm

We experimented with Discourse Google Perspective API Plugin, but ran into similar issues as described in the article you linked… it struggled on more nuanced language, and would sometimes pick up on some non-offensive language as offensive… and these false positives can be offensive in their own right!

However, Twitter’s early tests ran into some problems. It found its systems and algorithms sometimes struggled to understand the nuance that occurs in many conversations. For example, it couldn’t always differentiate between offensive replies and sarcasm or, sometimes, even friendly banter. It also struggled to account for those situations in which language is being reclaimed by underrepresented communities, and then used in nonharmful ways.

Topic		Replies	Views
Pre-emptively warning a contributor about the toxicity of their post Feature	19	3426	September 27, 2017
Auto-checking quality of language in posts? Feature	2	990	October 26, 2017
Evaluating Google's Perspective API on your Discourse forum Dev	12	3678	February 12, 2018
How to Get Notified of Offensive Language in Personal Messages to Issue Warnings? Support	3	57	December 1, 2024
Have AI check for inappropriate post or at least words and flag the post Support ai , ai-toxicity	3	378	July 7, 2023

Prompting users to avoid harmful language

Related topics