"Your Redis network connection is performing extremely poorly"

markersocial · December 12, 2024, 10:26pm

I am consistently getting this in the logs - with values between ~100k to ~1.35m - but the readings near 100k seem to be quite common:

Your Redis network connection is performing extremely poorly. Last RTT readings were [97069, 103986, 98459, 100762, 381617], ideally these should be < 1000. Ensure Redis is running in the same AZ or datacenter as Sidekiq. If these values are close to 100,000, that means your Sidekiq process may be CPU-saturated; reduce your concurrency and/or see https://github.com/mperham/sidekiq/discussions/5039

This indicates that perhaps Redis isn’t able to use enough CPU? There seems to be plenty of breathing room for CPU and ram on the server itself though.

also:
Sidekiq is consuming too much memory (using: 3570.19M) for 'www.example.com', restarting

This is using the all in one app.yml with Discourse stable 3.3.2.

From the app.yml:

UNICORN_SIDEKIQS: 9
DISCOURSE_SIDEKIQ_WORKERS: 5

I added this configuration to the host also:

MKJ's Opinionated Discourse Deployment Configuration

Kernel configuration

Redis (one of the key components on which Discourse is built) strongly recommends disabling transparent huge pages when using on-disk persistence (which Discourse does), and I also allow memory overcommit.
echo 'sys.kernel.mm.transparent_hugepage.enabled=never' > /etc/sysctl.d/10-huge-pages.conf
echo 'vm.overcommit_memory=1' > /etc/sysctl.d/90-vm_overcommit_memory.conf
sysctl --system

Sidekiq dashboard info:

It does seem like Redis is not able to surpass 1024M memory usage.

If anyone has any ideas, I’d appreciate it!

markersocial · December 19, 2024, 2:38pm

To follow up with this, I’m having this same issue with Jobs::PostAlert:

With those jobs often going up to 15 minutes when using 4 sidekiqs with 5 (default) threads with current testing. Seems like the jobs per second speed for Sidekiq is mostly dependent on how many of those jobs are being ran simultaneously and how many threads are free for the other jobs.

Increasing Sidekiqs to 6 or higher (5 threads) will increase the queue clearing speed, but postgres will crash fairly regularly (I am guessing from too many Jobs::PostAlert jobs being ran simultaneously.

This is on Stable 3.3.2. The changes and fixes from the linked thread seem to be already be implemented in 3.3.2, if I am not mistaken.

supermathie · December 19, 2024, 4:11pm

Postgres should never crash and generally indicates a postgres bug or some sort of larger problem.

Do you have logs?

Ed_S · December 19, 2024, 5:10pm

Have you rebooted the server since making those kernel config changes?

Maybe

lscpu

would also be helpful

Falco · December 19, 2024, 5:21pm

You should never bump UNICORN_SIDEKIQS that high, only increasing workers but

This should never happen.

The possibilities are:

You are constrained on resources because either
a) Your site has over grown the server resources
b) You are misallocating resources
There is a bug somewhere in the stack

I’d start making

UNICORN_SIDEKIQS: 1
DISCOURSE_SIDEKIQ_WORKERS: 20

which should release some RAM from your server.

For further information you will need to run the offending jobs in a PostgreSQL console and report what is the bottleneck.

Topic		Replies	Views
Redis used_memory limited to 1024mb? Support	1	74	April 4, 2025
Could sidekiq queue be reason for 500 errors? Installation server-resources	31	3783	July 13, 2018
Sidekiq is consuming too much memory, restarting Installation	40	8695	October 13, 2020
Redis connection timed out Installation	30	9460	June 8, 2024
Sidekiq is consuming too much memory Installation	10	2258	November 9, 2023

"Your Redis network connection is performing extremely poorly"

Related topics