Blocking email addresses using profanity filter?

downey · Février 17, 2016, 10:42

Continuing the discussion from Inappropriate / Obscenity / Profanity Language Filter:

So the profanity filter works well … not that we see it used often.

But is there a way to add regex/patterns so it could block people from putting an email address in a post?

codinghorror · Février 18, 2016, 12:37

That would be extremely dangerous, though.

downey · Février 18, 2016, 1:31

“Extremely dangerous” is a strong phrase. Can you say more about why you feel that way?

codinghorror · Février 18, 2016, 1:44

Etc etc etc

Mittineague · Février 18, 2016, 1:50

Regex is like a language unto itself. Even a lot of seasoned programmers have trouble with it.

Using it requires not only understanding every possible variation you want to match, but also every possible variation you want to not match.

A lot of people do fairy well with the first, but fail with the second.

For example, using (.)* matches everything, anything, and nothing.
I see it used way too often as a “short cut” to get things to match, but unfortunately it often results in matching what it shouldn’t.

I guess if it were under the “developers only” section it might be enough to scare off Admins that shouldn’t mess with it. But human nature being what it is, give out loaded guns and it’s only a matter of time before someone shoots themselves in their foot.

And as for a valid email regex, it is notoriously difficult to craft a fool-proof one. Many come close and are “good enough” but without additional processing there will likely be problems at some point.

downey · Février 18, 2016, 1:51

Fair enough, but the filter already exists in Discourse, even if you aren’t using it personally. Also, keep in mind (in response to your blog post) that the Discourse filter doesn’t replace strings, it masks them with squares.

Regexes are inherently difficult, so I’m not necessarily proposing that you ask everyday users to use them as the mainstream use case. The current system works fine for most cases, but there’s no way to surefire way prevent people from posting most common email addresses. (I am not interested in the debate on the “perfect” email regex.)

Meanwhile, I’m simply blocking some of the most common domain names like @gmail.com, @yahoo.com, etc.

I am not interested in letting perfect get in the way of good here. Just trying to prevent the most common occurrences.

Mittineague · Février 18, 2016, 1:54

Actually, last I knew it replaces the characters with the box decimal value

eg. blocking “@gmail.com”, “someone@gmail.com” would look like

someone■■■■■■■■■■

and the source would be

someone&#9632;&#9632;&#9632;&#9632;&#9632;&#9632;&#9632;&#9632;&#9632;&#9632;

downey · Février 18, 2016, 1:56

That is what I meant when I finished the sentence with:

Forgive my error of specificity. What I meant was that it doesn’t replace it with other letters to change the word, as described in the blog post above.

outofthebox · Septembre 24, 2019, 12:11

Bonjour,

Je souhaite donc empêcher les utilisateurs de partager leur adresse e-mail dans les discussions publiques de notre communauté, afin de protéger leur vie privée (peut-être que certaines personnes, malgré nos meilleurs efforts, ne réalisent pas que les discussions sont publiques ?).

Cette approche est-elle appropriée ? Ou présente-t-elle des risques importants que je n’aurais pas pris en compte ?

*@*.com
*@*.org
*@*.net
*@*.edu
*@*.info
*@*.biz

codinghorror · Septembre 24, 2019, 12:33

Toutes les expressions régulières comportent un risque élevé ; plus elles sont larges, plus le risque est grand. Celles-ci sont… assez risquées.

outofthebox · Septembre 24, 2019, 2:25

Mon espoir naïf était que le « @ » et l’inclusion des noms de domaine de premier niveau permettent de restreindre la recherche aux adresses e-mail uniquement. N’y a-t-il aucun moyen de cibler ces éléments ?

codinghorror · Septembre 24, 2019, 2:40

Quelque chose comme *.?@gmail.com serait nettement plus sûr. Idéalement, je dirais uniquement des caractères de mots, pas d’astérisque (tous les caractères).

Sujet		Réponses	Vues
Blocking email address with regular expression not working Support	3	627	Avril 16, 2021
Is there any way I can automatically censor email addresses on posts? Support	18	2630	Avril 25, 2023
Blocking spammer.<random>.domain registration attempts Support	11	347	Octobre 8, 2025
Suggestion: Wildcard Block Email Address Feature	32	4693	Décembre 7, 2021
Protect forum against email harvesters bots Support	2	975	Mai 17, 2017

Blocking email addresses using profanity filter?

Sujets connexes