I’ve enabled the Discourse AI spam handling on our forum. I’ve set up Claude Sonnet 4 with an API key and selected the Spam detector persona.
I did a test post that is clearly spam. Nothing subtle about it.
It was not blocked and was posted immediately.
When I gave the post URL to the spam bot using the test feature, the result says Not spam, but in the Scan log it says: SPAM - This is a clear promotional advertisement…
My expectation would be that the result would be SPAM, matching the Scan log declaration of SPAM. And that this would then queue up the post for review by admins and moderators, for example.
Might anyone be able to share what I’m missing? I’m no expert – so am open to any guidance!
Thank you!