Purpose of the Discourse shared volume in a high availability setup

nicktrav · April 4, 2019, 12:47am

Hi! I’m wondering what the purpose of the shared volume is in a Discourse deployment?

For context, we have Discourse up and running in a Kubernetes cluster (in GKE), but we’d like to scale out the number of instances of our deployment to make it more highly available. All instances would obviously continue to talk to the same Postgres database and Redis instance, but I’m wondering if all the webservers need to be talking to the same shared volume, or whether the webservers can be scaled independently (i.e. can each webserver instance just can have it’s own “shared” volume).

Or is there a hard requirement that all webservers utilize the same shared volume, in which case we’d have to look at mounting in something like an NFS volume into each of our containers.

Thanks!

sam · April 4, 2019, 12:52am

The shared volume is there as a value add you can get away without it. In a typical “uploads are on AWS”, PG / Redis somewhere central setup you will only use it for Rails/Unicorn/NGINX etc logs. You would then ship them somewhere central with some log aggregation service.

nicktrav · April 4, 2019, 12:59am

Perfect, thanks @sam!

Just wanted to check that there weren’t going to be issues with uploads going to one host, and then a request hits another host and isn’t available due to it running in a separate container with a separate mount.

Sounds like we’ll be ok here .

sam · April 4, 2019, 1:43am

Note, It will be an issue unless you use our s3 uploads provider

Topic		Replies	Views
Sharing shared volumes Support	11	980	November 9, 2020
Using multiple containers - what needs to be shared? Installation	2	888	March 3, 2021
How to scale discourse? Hosting	4	4189	February 9, 2015
Setting up a cluster? Installation	4	3437	February 12, 2017
App.yml shared volumes for a two website setup Installation	5	1217	July 3, 2019

Purpose of the Discourse shared volume in a high availability setup

Related topics