¿Deberíamos indicar al escáner de spam de IA que marque las publicaciones que contienen números de teléfono por defecto?

,

It seems like a fairly common variety of spam are unrelated customer support posts, probably trying to get false information to appear in AI or search results? These often contain phone numbers… should we mention phone numbers in our AI spam prompt? Likely uncommon for people to post these in normal use cases?

Existing default prompt
You are a spam detection system. Analyze the following post content and context.

Consider the post type carefully:
- For REPLY posts: Check if the response is relevant and topical to the thread
- For NEW TOPIC posts: Check if it's a legitimate topic or spam promotion

A post is spam if it matches any of these criteria:
- Contains unsolicited commercial content or promotions
- Has suspicious or unrelated external links
- Shows patterns of automated/bot posting
- Contains irrelevant content or advertisements
- For replies: Completely unrelated to the discussion thread
- Uses excessive keywords or repetitive text patterns
- Shows suspicious formatting or character usage

Be especially strict with:
- Replies that ignore the previous conversation
- Posts containing multiple unrelated external links
- Generic responses that could be posted anywhere

Be fair to:
- New users making legitimate first contributions
- Non-native speakers making genuine efforts to participate
- Topic-relevant product mentions in appropriate contexts

Site Specific Information:
- Site name: {site_title}
- Site URL: {site_url}
- Site description: {site_description}
- Site top 10 categories: {top_categories}

Format your response as a JSON object with a one key named "spam", which is a boolean that indicates if a post is spam or legitimate.
Your output should be in the following format:

{"spam": xx}

Where xx is true if the post is spam, or false if it's legitimate.
reply with valid JSON only
7 Me gusta

And I think that often those phone numbers include various obfuscations of phone numbers to get around regex phone number tests (I don’t really know, like, uh, unicode or some other nonsense?). Is there some way to get the AI to specifically look for those?

2 Me gusta

We can do some testing based on spam we’ve received, but I suspect prompting it to look out for phone numbers and specifying that it should pay attention to possible unicode obscurification attempts may be good enough to catch most of it?

I’d guess it would be better at catching attempts to hide numbers than regex would be just because it’s more flexible.

1 me gusta

Coming from someone who knows little about AI and not a lot about unicode (relative to the group that I hope is paying attention to this), that sounds great! :rofl:

I would think so. Given that the AI can identify Unicode reliably, there is zero reason for a phone number to contain Unicode, so it should be pretty easy. Back in the days of simple Bayesian classification, a Unicode phone number would be a very good Spam indicator. A bunch of them also had some weird unicode TM symbol and a couple others.

And if it could reliably look for stuff like “customer support” at a company that’s not connected to the forum, that would also be some low hanging fruit.

2 Me gusta

A lot of this depends on “we need evals”:

We have:

What we need to do here is:

  1. Add a big bunch of spam/ham posts to the eval suite (say 20-30 or each)
  2. Run the eval
  3. Fix the prompt
  4. Run the eval again

Otherwise we tend to be poking in the dark. cc @Falco

2 Me gusta