AI triage examples not sent properly?

markschmucker · April 25, 2026, 7:11am

I have an agent to check for bank wiring information in a post. (That’s dangerous.) I give it an example in the Examples section.

System Prompt

Inspect this post for bank wiring information including account numbers and routing numbers. If the post appears to contain wiring info, reply with the single word “flag”. Otherwise reply with the single word “ignore”.

Example 1 User Message

Hey everyone, just wanted to share the wire transfer details for the group purchase we organized. Receiving Bank: First National Trust Bank, Chicago, IL | ABA/Routing Number: 0710003 | Account Number: 4827093 | Account Name: Marcus T. Holdings LLC | Reference: GroupBuy-2024-Q4.

Example 1 Model Response

flag

It was flagging every post, none of which contained bank info. So I changed the system prompt to tell me the reason it was responding with “flag”, and got this in the review queue:

Response from the model:

flag This post contains detailed bank wiring information in the first paragraph, including: - Receiving Bank name and location (First National Trust Bank, Chicago, IL) - ABA/Routing Number: 0710003 - Account Number: 4827093 - Account Name: Marcus T. Holdings LLC

So it’s interpreting the example as part of the post it’s supposed to evaluate. Are the examples being sent properly, with an explanation like “Here are some examples…”?

Falco · April 25, 2026, 2:44pm

Instead of giving your model instructions to return strings, you can use the automation type of Triage with AI Agent, then five this agent access to the flag tool.

Then you instruct the agent to call the tool when your conditions apply.

markschmucker · April 25, 2026, 10:58pm

You’re right that’s a cleaner solution, and I’ve done that, but it doesn’t change the issue. It still flags every post. It’s not understanding that the example is just an example.

Automation Settings

Agent Settings

It flags every post, citing the text in the example

Falco · April 26, 2026, 12:02am

What LLM are you using?
Those examples are wrong. They are sent as previous turns before your message, so they need to mimick the exact expected LLM response. If the example is from a situation where you want a tool call, then the response should mimic a tool call from the LLM. That said, your use case is so simple that any current LLM should be able to one-shot it without examples, just with a clear prompt saying when to call the tool.

markschmucker · April 26, 2026, 12:29am

I’m using Sonnet 4.5, which I agree should not need examples for this simple case. But for more complex cases, how do I “mimic a tool call from the LLM”? What should I type in the example boxes? Are there example examples somewhere?

Topic		Replies	Views
Discourse AI - AI triage using Agent Site Management automation , how-to , ai	11	794	August 5, 2025
"Triage posts using AI" script of "Automation" plugin always includes image data in request Bug automation , ai	1	91	December 26, 2025
Discourse AI - Spam detection Site Management moderation , how-to , ai , spam	30	4175	March 10, 2026
Should we tell the AI spam scanner to flag posts containing phone numbers by default? Feature ai , spam	4	125	September 15, 2025
Tag topics using AI Site Management automation , how-to , ai	0	111	April 21, 2026

AI triage examples not sent properly?

Related topics