Reducing backscatter in email interface?

icculus · 27.Март.2017 15:36:31

This is a bit of a braindump, under the assumption there isn’t a “this already works if you click this checkbox” solution out there.

I’m experimenting with Discourse and its Mailing List Mode (which are both pretty great!), and I was thinking about some minor improvements with email backscatter.

I’m using the mail-receiver Docker container that Discourse offers, not POP3 polling. What I’m going to describe here only applies to handling one’s own incoming SMTP traffic.

The gist is this: Discourse is smart enough to reject incoming mail if the user isn’t allowed to post somewhere, or if the user wrote to a bogus email address. It sends a polite bounce email, but this has unintended consequences:

Spammers hitting the mail server will generate bounces, either for bogus addresses or real people that didn’t actually try to interact with Discourse, so you can get a flurry of backscatter, which is bad.
Legit email from legit users that made legit mistakes will get a legit bounce, and GMail will quietly put it in the user’s spam folder (at least, it did for me).

A better solution would be to reject mail during the SMTP transaction, when reasonable to do so. Making that work will:

Reduce unnecessary email bounce traffic due to joe-job spammers, etc.
Prevent risk of damage to a domain’s spam reputation.
Give users a more visible and immediate notice that something went wrong (email providers will tend to put a THIS DIDN’T SEND email in the user’s inbox).

Even if we just use the mail-receiver Docker container that Discourse offers, this still needs some support from the Discourse instance to make this happen. I couldn’t find any API to do this, but it would be useful if we could query with the system API key for something like this:

Can a@b.c send mail to x@y.z?

And this returns a boolean (yes or no on whether mail is acceptable) and maybe a simple string of why ("x@y.z isn’t a valid reply-to-thread slug, x@y.z isn’t a create-new-topic address, a@b.c isn’t allowed to post here, etc. It could even just be the existing “Email::Receiver::StrangersNotAllowedError” style error strings).

This could be more fine-grained (“is a@b.c a valid user?” “is x@y.z an available email target?”), but ultimately we want an API that says “should I drop this email right now or send it through to the system?” …it doesn’t have to, and probably shouldn’t, deal with rejections for things that need further processing (email is too short rejections, etc). We really just need to catch the obvious spam and malware vectors here.

Then that API call can be hooked up to /usr/local/bin/receive-mail in the mail-receiver Docker container, which can optionally return a permanent fail to the SMTP client instead of sending the email content to Discourse at all.

Does this sound like a reasonable approach? Does something like this exist already?

mpalmer · 27.Март.2017 23:15:38

Nothing like that exists already. It wouldn’t be impossible to add, but it wouldn’t be trivial, either. Postfix has some excellent hooks for allowing SMTP-time checks to occur, we’re just not using them because mail-receiver was very much a “simplest thing that could possibly work” implementation.

If you’d like to work on this, I’m happy to provide guidance, code review, and testing. I’d like to see it happen, it’s just not something that I think anyone here at CDCK World HQ will have any bandwidth to work on any time soon.

icculus · 05.Апрель.2017 06:50:16

I’ve posted a PR for the new API inside of Discourse here.

If I’m understanding Postfix correctly, we’d need something that talks over the Milter protocol or SMTP to properly reject email (it’s too late when /usr/local/bin/receive-mail runs). That seemed non-trivial, at least for the moment, so I’ll wait for feedback on the PR before worrying about that part.

mpalmer · 05.Апрель.2017 23:30:21

I’ll leave the core PR to those more experienced in the app, but for the Postfix integration, yes, you need something that will talk milter, SMTP or Postfix policy protocol to Postfix and HTTP to Discourse, to translate. I’d strongly recommend going the Postfix policy route, as the protocol is easy to parse, and you can avoid the need to manage spawning your own daemons by having the Postfix master process do it for you.

icculus · 06.Апрель.2017 06:58:08

or Postfix policy protocol

Oh wow, this policy protocol thing is so much less painful! Thanks for mentioning that.

Ok, I wrote a script to handle the SMTP portions of all this magic.

First, you’ll need the patch in the PR applied, and a sv restart unicorn or reboot or whatever to pick up the changes.

Then get this script:

discourse-smtp-fast-rejection.rb.gz (1.3 KB)

In the mail-receiver Docker container, you’ll want to put that file at /usr/local/bin/discourse-smtp-fast-rejection.rb, perhaps with a less verbose name if you want.

/etc/postfix/mail-receiver-environment.json in the mail-receiver Docker container needs a DISCOURSE_SMTP_SHOULD_REJECT_ENDPOINT entry, or just hardcode it in the script appropriately. It should look like this:

https://discourse.example.com/admin/email/smtp_should_reject.json

I don’t know how/where that JSON file gets generated. I ended up hardcoding it in my copy for now.

Add this to the end of /etc/postfix/master.cf …

policy  unix  -       n       n       -       -       spawn
    user=nobody argv=/usr/local/bin/discourse-smtp-fast-rejection.rb

…and this to the end of /etc/postfix/main.cf …

smtpd_recipient_restrictions = check_policy_service unix:private/policy

And run postfix reload inside the container, or just reboot or whatever.

Note that there’s a bug in my PR about reply-to addresses, but once that’s sorted out, this work might be complete. The mail-receiver container’s SMTP server is rejecting email appropriately before sending it on to Discourse.

mpalmer · 06.Апрель.2017 07:15:53

A PR against mail-receiver would be greatly appreciated, once the core changes have gone in.

icculus · 06.Апрель.2017 18:16:15

Okay, this is all sitting in various PRs now. You’ve probably gotten 12 emails with the links at this point, :), but here they are one more time for the general public:

https://github.com/discourse/discourse/pull/4793

https://github.com/discourse/mail-receiver/pull/2

https://github.com/discourse/discourse_docker/pull/344

watchmanmonitor · 07.Апрель.2017 03:45:25

This looks pretty slick, nice work.

There is mention of testing.. I’ve got a new discourse instance that’s up and running now, but not scheduled to be in production for a week or so. I can offer it for real-world testing (and can grant access as needed).

As for the setup of this, looking at the new variable in the mail-receiver, I wonder:

If DISCOURSE_SMTP_SHOULD_REJECT_ENDPOINT is going to be optional, should that be commented out by default?

If it should be on by default, maybe there should be just one variable to set, like

DISCOURSE_ADMIN_URL

I’m suggesting admin URL as it would have to take in to account any subfolder. Using this, the paths to /email/handle_mail and email/smtp_should_reject.json could be implied by the mail-receiver itself.

mpalmer · 07.Апрель.2017 04:13:13

I was pondering that myself; rather than having to list a whole bunch of URLs all over the place, when the internal structure is known and fixed. @icculus, if you’re feeling frisky that would be a nice change to make, although in the interests of getting things merged I wouldn’t consider it a blocker, it can always be done in the future.

icculus · 07.Апрель.2017 04:30:34

This is a good idea, I’ll fix this up in the morning.

icculus · 07.Апрель.2017 04:37:49

I’ve got a live instance running this already, and I’m trying to decide if there’s any reason I can’t just blow away my existing mail-receiver and build a new one to make sure I didn’t screw up the new parts of the Docker container configuration.

@mpalmer, I assume this line in mail-receiver.yml pulls in the contents of GitHub - discourse/mail-receiver · GitHub from somewhere?

base_image: discourse/mail-receiver:1.0.0

Is there an easy way to override that, so it picks up my changes? Or is that literally just cloning from the github repo when it sees that?

mpalmer · 07.Апрель.2017 05:30:16

That’s a reference to the docker hub image that comes pre-built. You can build your own image based on your changes, by running docker build -t my-funky-mail-receiver . in the mail-receiver repo, and then change base_image to my-funky-mail-receiver in the mail-receiver.yml, and everything will Just Work.

icculus · 07.Апрель.2017 17:06:37

If it should be on by default, maybe there should be just one variable to set, like
DISCOURSE_ADMIN_URL

I went further and changed it to DISCOURSE_BASE_URL; we can tack on the “admin” part too.

The PRs are updated with that now.

icculus · 07.Апрель.2017 17:33:10

change base_image to my-funky-mail-receiver in the mail-receiver.yml, and everything will Just Work.

This worked like a champ, thank you; discourse.libsdl.org is up and running with a freshly built mail-receiver container, so we can say with some confidence that the latest PR is safe to merge.

You can send email to any address @discourse.libsdl.org to get a rejection to see it working on your end.

robbyoconnor · 06.Июнь.2017 06:36:33

Should I work on this, or do you have it @icculus?

icculus · 06.Июнь.2017 12:10:02

If you have time now, go for it!

robbyoconnor · 06.Июнь.2017 21:33:45

Okay – just lemme know if you do before I do…I might do it if I get bored…maybe…

icculus · 08.Декабрь.2020 06:07:59

Привет, оживляю старую тему, потому что какая-то версия патча для получателя почты, который так и не был применён, всё же попала в систему контроля версий. Со временем, по мере устаревания кода, мой модифицированный контейнер перестал работать.

Я удалил свой контейнер получателя почты, создал новый контейнер и новый API-ключ. Входящая почта на моём экземпляре Discourse снова работает… но система, кажется, снова рассылает отказы в получении писем, как в 2017 году.

Действительно, оба ответа об отказе вернулись в мой почтовый ящик, хотя письмо с ошибкой BadDestinationAddress должно было быть отклонено на уровне SMTP ещё до передачи в Discourse для дальнейшей обработки и отправки ответа. Если спамер отправит на этот сервер письмо с несуществующим адресом, это приведёт к генерации backscatter.

При прямом обращении к SMTP-серверу я вижу, что он не пытается отклонять поддельные письма.

root@discourse:/var/discourse# telnet localhost 25
Trying 127.0.0.1...
Connected to localhost.
Escape character is '^]'.
220 ESMTP server
HELO sdfsdfsdf
250 discourse-mail-receiver.localdomain
MAIL FROM: sdfsdf@example.com
250 2.1.0 Ok
RCPT TO: sdfsdfsdf@discourse.libsdl.org
250 2.1.5 Ok

…всё это к тому, что я вижу: в стандартном образе получателя почты подключён скрипт быстрого отклонения, но, похоже, он ничего не отклоняет…?

/etc/postfix/master.cf:

policy     unix  -       n       n       -       -       spawn user=nobody
    argv=/usr/local/bin/discourse-smtp-fast-rejection

/etc/postfix/main.cf:

smtpd_recipient_restrictions = check_policy_service unix:private/policy

Нужно ли мне что-то настроить, чтобы это заработало, или есть способ глубже разобраться, почему это не работает? У других людей это работает?

Спасибо!

icculus · 26.Январь.2021 14:30:48

Поднимаю… У кого-то ещё есть эта проблема?

Simon_Manning · 07.Май.2022 22:51:38

Я тоже наблюдаю эту проблему, которую заметил, когда недавний спам использовал адрес для ответов (replies+abc123@instance) в поле «От кого», что вызвало у Discourse создание серии автоматических ответов самому себе.

Судя по всему, весь функционал быстрого отклонения был слит, и в README для mail-receiver упоминается использование BLACKLISTED_SENDER_DOMAINS, который является частью быстрого отклонения, однако быстрое отклонение отключено по умолчанию.

github.com/discourse/discourse_docker

Added environment variable for mail-receiver's smtp_should_reject support. (#344)

master ← icculus:smtp-fast-reject

closed 11:15PM - 22 Aug 19 UTC

icculus

+4 -5

These are the changes needed to the main Discourse container to fight backscatte…r. This isn't tested because I have no idea what I'm doing. :) The discussion that led to this is here: https://meta.discourse.org/t/reducing-backscatter-in-email-interface/59974 There is also a PR for discourse/mail-receiver that relies on this PR: https://github.com/discourse/mail-receiver/pull/2 And the new API that mail-receiver will expect to call is on this PR: https://github.com/discourse/discourse/pull/4793

Вышеупомянутое изменение так и не было слито, ссылаясь на необходимость проверить обратную совместимость, и из-за отсутствия DISCOURSE_BASE_URL быстрое отклонение отключено следующей строкой:

github.com/discourse/mail-receiver

lib/mail_receiver/fast_rejection.rb

08a25ae38


      
          
          @disabled = @env['DISCOURSE_FAST_REJECTION_DISABLED'] || !@env['DISCOURSE_BASE_URL']

Приложение mail receiver написано так, что оно обрабатывает либо DISCOURSE_BASE_URL, либо DISCOURSE_MAIL_ENDPOINT, как показано ниже, и в README не упоминается последний, поэтому я считаю, что будет безопасно применить изменение из того pull request, используя https вместо http, чтобы отразить более недавнее изменение в образце.

Изменение, предложенное в pull request, означает, что быстрое отклонение будет включено по умолчанию для новых операторов mail-receiver, то есть для тех, кто копирует образец в качестве отправной точки.

Существующие операторы продолжат иметь его отключенным, пока файл yml не будет вручную обновлен для использования DISCOURSE_BASE_URL. Если мои утверждения здесь точны, возможно, об этом стоит написать в #feature:announcements после обновления образца.

github.com/discourse/mail-receiver

lib/mail_receiver/discourse_mail_receiver.rb

08a25ae38


      
          def endpoint
            return @endpoint if @endpoint
          
            @endpoint = @env["DISCOURSE_MAIL_ENDPOINT"]
          
            if @env['DISCOURSE_BASE_URL']
              @endpoint = "#{@env['DISCOURSE_BASE_URL']}/admin/email/handle_mail"
            end
            @endpoint
          end

Также было бы несложно сделать что-то очень похожее на endpoint handle_mail для endpoint быстрого отклонения, то есть определять endpoint из DISCOURSE_MAIL_ENDPOINT, если DISCOURSE_BASE_URL отсутствует.

Тогда быстрое отклонение можно будет включить по умолчанию как для новых, так и для существующих операторов. Я с радостью подготовлю pull request для этого изменения, если включение его для всех (кто пересоберет mail-receiver после изменения) является желательным.

Тема		Ответов	Просм.
Email receive issue Bug mail-receiver , fixed	29	602	13.04.2026
Configure direct-delivery incoming email for self-hosted sites with Mail-Receiver Self-Hosting email , how-to , advanced-setup , mail-receiver	76	50109	27.03.2026
Mail-receiver won’t deliver mail to Discourse Self-hosting mail-receiver	16	476	26.01.2023
Seeing smtp_should_reject "400 bad request" failures on bounce processing Bug mail-receiver	13	287	24.11.2025
Bounced emails not being detected Support email	16	1544	07.02.2024

Reducing backscatter in email interface?

Связанные темы