That is why LLMs break prompts into system and user, so there is differentiation between safe and unsafe entries.
But yes, that is a possibility, specially among smaller and older models.
That is why LLMs break prompts into system and user, so there is differentiation between safe and unsafe entries.
But yes, that is a possibility, specially among smaller and older models.