I think it’s hard to recommend specific models because there’s so much variation from site to site, even within specific tasks (and also budget constraints)… but as far as this goes:
is this still the case today? I’m able to use 4o for spam on a test site, but if you’re still having issues send us a message and we’ll take a closer look!