Spam account scanner script

tannerfilip · 18 Septiembre, 2019 20:11

I’ll start off by saying, I’m not a great programmer. This is the first “real” tool I’ve written that’s (potentially) useful to people other than me. I’d love any feedback/criticism you have.

I’ve written a Python script that scans through the list of suspect and/or silenced users and lets you delete them if necessary. I ran it over on Mozilla’s Discourse and deleted a few dozen accounts - this was only after I deleted close to a hundred by hand.

There are a few things that seem pretty hacky, especially lines 174 to 191. As I said, I’d appreciate any feedback you might have, and would be happy to answer any questions!

codinghorror · 19 Septiembre, 2019 00:05

Very cool! One thing you’ll want to do is be sure Akismet is enabled, as we recently (within the last 2-3 months) added a feature where the Akismet plugin will scan new user accounts for spammy stuff and flag them for you thanks to @Roman

Yes, completely human spam account signups – accounts that never post once, just set up an account with profile info and walk away forever – is indeed still a problem. The below is even after Akismet checking:

But bear in mind user profiles aren’t indexed at all, and new user profiles have seriously suppressed info… and our Akismet change helps tremendously.

Having a cleanup tool is still needed though!

tannerfilip · 19 Septiembre, 2019 01:29

I didn’t know that! I’ll have to talk to @LeoMcA to see if we want to enable that.

riking · 20 Abril, 2020 18:04

Suspect users are now being sent to the Review Queue, which removed the suspect users list this script was using. As they’re being pushed to manual review, is this needed now?

Mine_Zcash · 26 Agosto, 2025 15:50

¿Ha habido algún progreso en esto?

Nuestra comunidad está experimentando varias inscripciones de cuentas de spam/bots por día que tienen 0 publicaciones leídas, 0 temas vistos, menos de 1 minuto de tiempo de lectura. Sería bueno tener una función de eliminación automática para todas las cuentas con ciertos parámetros seleccionados.

Además, ¿existe la opción de un plugin Captcha o similar para ayudar a filtrar bots?

Canapin · 27 Agosto, 2025 13:52

Si esas cuentas no tienen actividad, son inofensivas. Son invisibles para otros usuarios (incluida una lista pública de usuarios). Y los perfiles de usuario, independientemente de su nivel de confianza, están prohibidos en robots.txt y no son visibles en los motores de búsqueda.

Además, las cuentas inactivas se limpian periódicamente, consulta la configuración Limpiar usuarios inactivos después de días (“Número de días antes de que se elimine un usuario inactivo (nivel de confianza 0 sin publicaciones). Para deshabilitar la limpieza, establécelo en 0”).

Es activado por el trabajo SideKiq CleanUpInactiveUsers.

Jagster · 27 Agosto, 2025 14:54

Eso no prohíbe nada. robots.txt es solo una sugerencia educada, que al mismo tiempo apunta en la dirección correcta.

Mine_Zcash · 27 Agosto, 2025 18:51

Puede que sea inofensivo, pero en el pasado los spammers han utilizado estas cuentas para “envejecer” sus perfiles antes de activarlos, sabiendo que vigilamos las cuentas nuevas. Luego, de repente, una cuenta de hace 3 meses intenta enlazar con cualquier intento de spam o phishing de usuarios.

Personalmente, me gustaría tener mejores herramientas para evitar que se conviertan en un problema en lugar de esperar. También sería útil si tuviéramos herramientas más sólidas para evitar que los bots se registren en primer lugar.

Canapin · 27 Agosto, 2025 19:14

Claro, todavía puede ser un problema a veces. Experimento mucho spam, pero hasta ahora no he visto que cuentas de spam publiquen de repente después de mucho tiempo.

Si publicaran spam, de todos modos serían rápidamente marcadas por otros usuarios.

Y aún puedes reducir drásticamente la duración después de la cual se elimina una cuenta inactiva.

Tema		Respuestas	Vistas
Need assistance with massive amounts of spam Support	14	1740	10 Junio 2023
Dealing with unwanted (and probably spam) accounts via SSO? Feature wordpress , sso , discourseconnect	36	8919	16 Octubre 2022
Lots of Spam New User Registrations? Support	43	5730	30 Abril 2024
Expanding the 'suspect user' filter Feature	5	745	3 Diciembre 2019
Strategies for filtering spam / AI user accounts? Community Building spam	5	315	9 Abril 2025

Spam account scanner script

Temas relacionados