Email retries during long email server shutdown

Wall-E · October 12, 2023, 6:14pm

I’m not the sys admin of the AWS EC2 instance running our Discourse instance, but I’m the admin of the discourse instance itself. We had an AWS SES email service shutdown 3 weeks ago for security reasons. Our cloud staff is only fixing it now. So for 3 weeks, our site could not send emails and i’m seeing more than 40000 of failed jobs and as many retries. I’m not a web developer so I don’t understand what the Sidekiq page is telling, but I’m worried the failed jobs will be retried when our email server is back online, flooding people with outdated emails they didn’t get for 3 weeks. Will that be the case? Does Discourse resend emails that could not be sent due to an email server being offline? If so, how can I disable that to avoid flooding people’s with emails from our site? Can we adjust the granularity? Say only send emails showing new activities since some given date?

RGJ · October 13, 2023, 8:46am

Your fear is valid.

I’m not sure how much time you have to fix this? One solution could be to set up and configure a mail server that accepts emails but just throws them away.

The really quick and (very) dirty way to resolve this is to use redis-cli and issue a flushdb command. That will remove all queued jobs. It will also log out all users. Then reboot your Discourse to make sure all regular jobs run again.

Wall-E · October 13, 2023, 1:44pm

Logging out all users is certainly not desirable… The email server should be fixed today, but I’m not sure if our sys admins will have the flexibility to setup the email server to throw everything away.

I’m seeing a “kill all” & “delete all” button at the bottom of the “retries” page of sidekiq (see attached). Is that something that can help?

supermathie · October 13, 2023, 8:49pm

Purging all jobs from the queue of a certain type should do the trick.

(I would have to go back and try to dig out how to do this…)

pfaffman · October 14, 2023, 3:32pm

I think you are sure. They took three weeks to fix it at all.

You could ask if they could Google how to purge jobs from sidekiq and delete the mail jobs. I think that’s your best bet.

I’m guessing you don’t have access to do it yourself or hire anyone to help. Can you ssh into the ec2 that it’s running on? You could endeavor to delete all 50k from the web interface.

Wall-E · October 25, 2023, 10:20am

The sidekick page with the kill/delete options worked. No EC2 sys admin was needed, being forum’s admin was enough to operate from the sidekick page, I could delete all queued emails.
After the email server was back online, no “queued” email was resent.

Topic		Replies	Views
Email failed jobs Support	7	93	March 4, 2025
Remove emails from send queue? Support	4	2143	May 14, 2016
Disabling emails for a few hours without Sidekiq queueing them Support	3	542	November 18, 2019
Reply-by-Email was working, now broken Support	9	1108	November 18, 2021
Discourse sent 18K+ Emails? Or, my server got hacked Support	22	1096	May 19, 2020

Email retries during long email server shutdown

Related topics