Scheduled backup fails when Sidekiq is forcefully restarted

JagWaugh · March 23, 2018, 4:34pm

The log indicates that the zip operation runs out of space, but…

it leaves 3 files (two are .gz) spaced roughly 30 min apart.
the .gz files do not appear in the GUI.
It seems to happen when sidekiq gets restarted for using too much memory.

We’ve oodles of RAM, and enough space on the drive if it weren’t for having 3 (failed) backups for that day.

78G space on the drive, backups are 16G each

Here are the corpses left over after the job fails

-rw-r--r-- 1 ubuntu www-data 17094434327 Mar 19 08:48 jag-lovers-forums-2018-03-19-083309-v20180309014014.tar.gz
-rw-r--r-- 1 ubuntu www-data 17099593947 Mar 19 09:19 jag-lovers-forums-2018-03-19-090524-v20180309014014.tar.gz
-rw-r--r-- 1 ubuntu www-data 17184337920 Mar 19 09:52 jag-lovers-forums-2018-03-19-093558-v20180309014014.tar

We started seeing the sidekiq restarts in December

Anyone else seeing this?

codinghorror · March 24, 2018, 5:47am

Yes, I have definitely seen this happen if sidekiq gets forcefully restarted (due to excess memory use) in the middle of a backup. cc @sam

Typically it was only an issue due to a global rebake, as the post version was incremented in a commit about 1 month ago – this means every single post in the system must be rebaked and can take months. The global rebake process causes Sidekiq to run out of memory much more frequently over that time period.

JagWaugh · March 24, 2018, 6:11am

Our sidekiq restarts between 3 and 12 times/day. This started happening back in Dec.

We’ve oodles of RAM. Is there a way we can up the limit for sidekiq (it’s failing when it goes much above 500M)?

sam · March 25, 2018, 11:38pm

Yes absolutely, we do that on our heavy instances. Set:

env:
  UNICORN_SIDEKIQ_MAX_RSS: 1000

This will double it.

JagWaugh · April 3, 2018, 5:15am

To follow up on this:

It’s been a week since we increased the sidekiq headroom. Sidekiq restarts have gone from 3-12 restarts/day to zero, and the backup hasn’t failed once.

jomaxro · April 9, 2018, 10:00pm

This topic was automatically closed after 4 days. New replies are no longer allowed.

Topic		Replies	Views
Backups fail to upload to S3 several times- works eventually Support backups , s3	7	125	May 27, 2025
Automatic backups are a hit or miss Support	12	1360	July 8, 2018
Sidekiq stops after some time Installation	8	1081	July 14, 2023
Backup restore failed, sidekiq won't stop then runsaway Support	8	844	December 6, 2020
When i restore, get this Sidekiq did not finish running all the jobs in the allowed time Support	0	193	December 21, 2023

Scheduled backup fails when Sidekiq is forcefully restarted

Related topics