Lockup with wait state at 90% Plus

tisawyer · June 24, 2020, 5:18pm

My Discourse is locking up with the CPU wait state (top wa:) at 90% or more. Is there a common reason other admins may have seen that causes this condition? I’m running Debian on AWS.

Falco · June 24, 2020, 5:36pm

Is the database in RDS or in the same container as the web is?

Is the machine disk a EBS network mount? Did you check if you run out of allowed IOPS ?

tisawyer · June 25, 2020, 10:58pm

The database is in the same docker container. The fellow that set this up for me created two EBS volumes, one is 8GiB the other is 32GiB. Volume types are GP2. Both volumes have 100 IOPS. Is that enough IOPS? I’m reading this Optimize the Performance of Amazon EBS Provisioned IOPS Volumes to learn but any hints pointing me in the right direction would be much appreciated.

Edit: I found that theQueue Length (mentioned in the above article) got very long during the last outage on the 19 (below chart). Question is now how do I find out what is doing that and how to prevent it?

Topic		Replies	Views
Connection timed out while connecting to upstream on AWS Hosting	12	3820	June 28, 2016
Another discourse mystery Installation	12	710	October 16, 2022
Trying to troubleshoot I/O Wait bottleneck Hosting	1	1044	October 30, 2020
I just hit my CPU cap on the Digital Ocean 2GB/2xCPU plan Hosting	35	17509	April 30, 2018
Discourse unavailable with high load average Support	21	2379	April 26, 2021

Lockup with wait state at 90% Plus

Related topics