בוט ספאם של AI טוען שהוא לא ספאם, אך יומן הסריקה מצביע על כך שהוא כן ספאם

J-Ha_Hasegawa · 20 באוגוסט,‏ 2025,‏ 12:20am

I’ve enabled the Discourse AI spam handling on our forum. I’ve set up Claude Sonnet 4 with an API key and selected the Spam detector persona.

I did a test post that is clearly spam. Nothing subtle about it.

It was not blocked and was posted immediately.

When I gave the post URL to the spam bot using the test feature, the result says Not spam, but in the Scan log it says: SPAM - This is a clear promotional advertisement…

My expectation would be that the result would be SPAM, matching the Scan log declaration of SPAM. And that this would then queue up the post for review by admins and moderators, for example.

Might anyone be able to share what I’m missing? I’m no expert – so am open to any guidance!

Thank you!

Roman · 20 באוגוסט,‏ 2025,‏ 12:34am

מהי רמת האמון של המשתמש שפרסם? ה-AI Spam ידלג על פוסטים ממשתמשי TL2 ומעלה.

J-Ha_Hasegawa · 20 באוגוסט,‏ 2025,‏ 12:48am

תודה על תשובתך!

המשתמש שבאמצעותו פרסמתי הוא משתמש חדש ברמת אמון

יש לך מחשבות מדוע הפוסט עבר?

אני מעריך את עזרתך!

Roman · 20 באוגוסט,‏ 2025,‏ 5:09pm

This will fix both the test and the post not getting flagged:

The Spam detector Persona system prompt was confusing Claude models. The change makes the expected response format instructions more explicit.

J-Ha_Hasegawa · 22 באוגוסט,‏ 2025,‏ 1:33am

Ah, fantastic! The test feature is working as expected.

I am wondering if you might be able to help with why the AI Spam feature is still not blocking a spammy post from being immediately posted? I sent the post to the AI Spam test and it is flagging it as spam - but it was posted.

Am I missing a connecting piece perhaps? Thank you so much for your help with this!

Jagster · 22 באוגוסט,‏ 2025,‏ 5:05am

האם אתה מנהל מערכת, או TL בכיר יותר? אם כן, אז אולי תנסה להשתמש במשתמש בדיקה של TL נמוך יותר.

Roman · 22 באוגוסט,‏ 2025,‏ 1:25pm

אנו מדלגים על פוסט כאשר:

רמת האמון של הכותב גבוהה מ-TL1.
הפוסט שייך לנושא של הודעה פרטית.
הכותב הוא בוט.
הכותב הוא צוות (מנהל/אדמין).
הכותב כבר כתב יותר מ-3 פוסטים בנושאים רגילים (לא הודעות פרטיות).
הפוסט כבר נסרק 3 פעמים או יותר.

אם הבדיקה עובדת, אני בטוח שזה חייב להיות בגלל אחת מהסיבות הנ"ל.

J-Ha_Hasegawa · 22 באוגוסט,‏ 2025,‏ 3:29pm

Ahhh yes! Thank you for your patient and helpful replies!

I posted with my admin user instead of my trust level 0 user.

It’s working! I love the way the discourse_ai_spam user shows up as the user who flagged and unlisted the post.

Thank you again for your quick and generous help with this!

נושא		תגובות	צפיות
Discourse AI - Spam detection Site Management moderation , spam , how-to , ai	32	3519	10 במרץ,‏ 2026
AI powered Spam detection Announcements spam , ai	11	997	11 בינואר,‏ 2025
Are you experiencing AI based spam? Community Building ai	23	1938	19 בינואר,‏ 2025
Discourse AI spam detection "Scan log" is frequently truncated Bug ai	2	111	22 בדצמבר,‏ 2025
Setting up spam detection in your community Site Management ai , how-to , moderation , automation	11	1896	30 בינואר,‏ 2025

בוט ספאם של AI טוען שהוא לא ספאם, אך יומן הסריקה מצביע על כך שהוא כן ספאם

נושאים קשורים