Upgrade Postgres com pouquíssimo espaço

The documented procedure on the forums lists two methods of upgrading Postgres.

  1. Just let Discourse handle it. This requires 3 times the disk space. So if your DB is 100GB, you would need an additional 200GB free to do the upgrade. Obviously a huge problem for people with large installs.
  2. Follow their “manual update” procedure. This requires 2 times the disk space, so if your DB is 100GB you would need an additional 100GB free. This is also a big problem for some.

In this post, @Falco suggested using the --link option to do the upgrade in-place using hard links. The docker container they suggest using supports that argument, but Discourse devs don’t suggest using it in the post.

So my question is this, should option 3 be:

  1. Run the command below, which will require a very small amount of additional disk space. So if your DB is 100GB, it might require, say, an additional 10GB? And if so, is this a recommended procedure by the Discourse devs, and has anyone actually done it before and lived to tell the tale?

New command to upgrade in-place:

docker run --rm \
	-v DIR:/var/discourse/shared/standalone/postgres_data:/var/lib/postgresql \
	tianon/postgres-upgrade:12-to-13 \
	--link

 

Compared to the old command to upgrade into a new directory (requiring double the space):

docker run --rm \
	-v /var/discourse/shared/standalone/postgres_data:/var/lib/postgresql/12/data \
	-v /var/discourse/shared/standalone/postgres_data_new:/var/lib/postgresql/13/data \
	tianon/postgres-upgrade:12-to-13

P.S.: I would have just replied to that PG13 upgrade thread, but it deletes posts after 7 days. Why do you have it configured that way? I know there was a lot of discussion when this first came up that would have been useful for reference.

1 curtida

If they have, they didn’t mention it here. Mostly instrucions here try to be as foolproof as possible and require as little system adminstration knowledge as possible. Most people here woud rather do something the safest, most tested way possible than some way designed to save a very few dollars.

If it works for you, you can update PostgreSQL 13 update accordingly, but before you do, do you feel comfortable recommending to someone who doesn’t know what bash is that they do it that way? You’re sure that it won’t hose their database and their site will be ruined forever?

The idea is that if some other good information is presented that it be added to the OP rather than asking people to read through year’s worth of posts that are likely to be unhelpful or wrong.

1 curtida

No I’m not sure, I don’t have much experience with postgres and was hoping one of the discourse devs could provide some assurances it would work.

Even if it does work I also wouldn’t recommend it as the default upgrade procedure as the old way keeps a separate copy of the DB for rollback. If it works it would be a great option for space-constrained environments though.

Another easy way is to spin up a new server, migrate the data, and turn off the old one. If you must use the old one, do the upgrade on a temporary server, so a fresh install on the original server (which probably needs an OS upgrade) and move it back.

That’s safe, easy, and well documented. Hundreds of people have done that.

Yes, but that would take a day or two. During that time we could either a) tell users their posts during this period will be lost or b) set the forum read-only. Neither is a great solution.

1 curtida

I don’t think that the server would be down a whole lot longer than during the rebuild. And if you move to the new server and stay there, you can leave the old server in read only mode while you make the move. If downtime is your concern then moving to a new server will be much, much better.

1 curtida

We have a pretty big forum, but I’ve never tried restoring a backup so I don’t know how long it would take. We would indeed stay on the new host if we did it. I would like to avoid that due to the extra work/annoyance if possible.

1 curtida

Yup, as I originally suggested here Discourse on postgres 12 breaks upgrades - #8 by merefield

I’d just bite the bullet?

1 curtida

All my posts here have been in an ongoing attempt to avoid doing that.

Você já conseguiu atualizar isso @Wingtip?

1 curtida

Outra forma de resolver o problema de upgrade com espaço limitado é fazer um backup, remover o diretório do postgres com rm -r, reconstruir e, em seguida, restaurar o backup. Fiz isso em um site na semana passada.

2 curtidas

O backup não ocupará quase tanto espaço quanto duplicar o diretório de dados (ou até mais, já que ele também precisa compactar)?

1 curtida

Não, nunca consegui atualizá-lo. Excluir o banco de dados e restaurar o backup parece bastante arriscado. Precisamos que a atualização no local funcione, basicamente.

Estamos executando o Ubuntu 18.04, que será descontinuado em 2023, então imagino que nesse ponto não teremos escolha a não ser migrar para um novo host de qualquer maneira, e planejamos engolir o sapo então, montar um novo host executando o 22.04 LTS e restaurar a partir do backup.

Hmm. Pode ser um empate. Acho que com o modelo de backup uma das cópias está compactada, o que pode fazer a diferença? E o site em que fiz isso tinha backups no S3. E era um site de teste, então as apostas eram baixas se houvesse um problema.

Exceto que os backups são usados com muito mais frequência em muito mais situações do que o upgrade in-place. Eu considero muito mais seguro.

1 curtida

Talvez, mas não tenho muita experiência com postgres e não me sinto confortável em fazer isso. Restaurar o site inteiro do backup em uma VM completamente diferente, isso eu me sinto confortável em fazer, no entanto, significaria perder posts por quantas horas forem necessárias para restaurar, então também não estou super entusiasmado com isso. Mas como o 18.04 está sendo descontinuado, não terei muita escolha no ano que vem.

1 curtida

A menos que seu banco de dados tenha dezenas de gigabytes, não levará horas. E você colocará o fórum em modo somente leitura antes de fazer backup e restaurar, então não perderá nenhum post. Não é tão difícil de fazer com virtualmente nenhum tempo de inatividade, apenas tempo somente leitura.

root@forum-app:/shared/postgres_data# du -sh
97G     .

Eu não colocaria em modo somente leitura, eu colocaria um banner dizendo às pessoas que suas postagens de hoje são efêmeras. Melhor deixá-los conversar sobre isso, mesmo que essas postagens sejam perdidas, na minha opinião.

1 curtida

Até lá, você também terá acesso ao chat integrado do Discourse, um recurso que será lançado na versão 2.9 (possivelmente desativado por padrão, mas em beta e com suporte para uso).

3 curtidas