More details on how the Redis cache is utilized?

griegm · September 15, 2021, 8:05pm

Hey there!

I’ve been performing some load tests on a Discourse instance and I noticed that when repeatedly fetching the same comment thread over and over the Redis cache hit rate went down, rather than up, which was a bit unexpected (for a mixture of reads/writes we saw a cache hit rate of up to 85%, for 100% reads it dipped down to as low as 22%).

I’ve done some searching around in the codebase and on the forums here, and how exactly the Redis cache is utilized is a bit unclear to me. The README states this:

We use Redis as a cache and for transient data.

I used redis-cli to dump the commands that were sent to the Redis cache during the aforementioned load test, and I primarily saw “get” commands for scheduled jobs and for keys prefixed with “__mb_backlog_id_n_” (I believe this is referring to MessageBus stuff).

I have the following questions:

Is there an “easy” way for me to search the codebase for which bits of data are cached in Redis? I’d love to be able to answer these questions on my own, but unfortunately I’m not super familiar with Ruby on Rails applications (or Ruby in general, really).
Does being logged-in/logged-out impact cache hit rates? For reference, the load test mentioned above was using an admin API key.
Are frequently queried/fairly static data such as post contents cached in Redis? Or is Redis primarily used for job scheduling and background processing with Sidekiq and whatnot?

Thanks in advance!

Falco · September 15, 2021, 11:28pm

This is the main thing here. The most aggressive cache happens for anonymous requests, so I’d suggest retrying the load test with some anon robots.

There are a handful of grepable methods, like Discourse.cache.fetch and DistributedCache.new.

We do cache some infrequent config blobs, but the approach to topics is mostly caching the whole response for the anons, allowing the app to craft a response while barely touching the DB.

Redis is heavily used for Sidekiq and MessageBus.

griegm · September 16, 2021, 5:15am

Awesome, thanks for the super helpful response!

griegm · September 17, 2021, 9:49pm

I just re-ran the load test but with anonymous requests this time around, and we saw a huge performance improvement! Previously we were only able to hit around 25 requests per second on a single host, now we can hit 380! The Redis cache hit rate also went up from ~22% to ~66%.

Just thought I’d circle back with the results in case anyone was curious.

Thanks again for the help!

system · October 17, 2021, 9:50pm

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
What type of details does redis store? Installation	2	415	March 21, 2023
What does Discourse use Redis for? Dev	5	3259	August 18, 2022
Using Redis then dump data is just great Dev	4	1007	April 12, 2016
How Discourse handle high HTTP request? Dev	2	573	March 12, 2019
Huge increase in Redis use after changing hosts Installation	9	1350	March 14, 2021

More details on how the Redis cache is utilized?

Related topics