Discourse Randomly Does Not run or Rebuild

joshhabka · March 20, 2025, 9:28pm

Out of nowhere, Discourse is no longer wanting to run and is not even rebuilding using ./launcher rebuild app. I commented out all plugins too.

Here is the logs when I try to start it: https://codefile.io/f/8XUuOqyEDd

Here are the logs when I use ./launcher rebuild app. I see something about “failed listening on port 6379 (TCP) aborting” but I have nothing running on that port!

https://codefile.io/f/zxCBRzEOA9

Canapin · March 20, 2025, 10:12pm

I don’t think it’s related to your issue. This warning often (always?) appear during a rebuild.

error: connection to server on socket "/var/run/postgresql/.s.PGSQL.5432" failed: Connection refused

I think your issue more likely comes from this.

Canapin · March 20, 2025, 10:17pm

This might give clues:

joshhabka · March 20, 2025, 11:40pm

Would this be what is causing it to not work when I run it(without doing any ./launcher rebuild app)?

joshhabka · March 21, 2025, 5:15am

I stopped all other services on my server and updated to the latest Ubuntu LTS and it still shows this:

PG::ConnectionBad: connection to server on socket "/var/run/postgresql/.s.PGSQL.5432" failed: Connection refused (PG::ConnectionBad)
        Is the server running locally and accepting connections on that socket?

which is what I would think the error is.

joshhabka · March 21, 2025, 5:37am

Swapping templates with 13 and even 15 did not solve the issue, which is what was shown in the referenced post.

Caused by:
PG::ConnectionBad: connection to server on socket “/var/run/postgresql/.s.PGSQL.5432” failed: No such file or directory (PG::ConnectionBad)
Is the server running locally and accepting connections on that socket?

mwaniki · March 21, 2025, 6:58am

timeout: down: postgres: 1s, normally up, want up

Seems like the database isn’t starting up correctly. The logs show it appears to occasionally start up properly, but only for a short time, so that could be a red herring.

ok: run: postgres: (pid 315501) 0s

The postgres logs could have some hint of the problem, especially when trying to start the app container.

tail -f shared/standalone/log/var-log/postgres/current

pfaffman · March 21, 2025, 1:38pm

Did you do the PostgreSQL 15 update

I too think it’s about an unclean shutdown. If you’ve got a backup, what I would do is spin up a new vm and restore it. You can follow Move a Discourse site to another VPS with rsync and exclude postgres_*.

The alternative, which is your only option if you don’t have a backup, will be to figure out a bunch of stuff about postgres that you don’t want to learn about.

joshhabka · March 22, 2025, 12:33am

How can I access my backups if my forum is down(as in I cannot go to admin settings and download a backup)?

I also did not try to migrate anything, I have been using it as normal and updating via the web ui? Why would the database have an unclean shutdown??

joshhabka · March 22, 2025, 12:33am

I will provide the Postgres logs, one second

joshhabka · March 22, 2025, 12:35am

2025-03-22 00:30:44.110 UTC [4922] FATAL: lock file “postmaster.pid” is empty
2025-03-22 00:30:44.110 UTC [4922] HINT: Either another server is starting, or the lock file is the remnant of a previous server startup crash.
2025-03-22 00:30:45.127 UTC [4964] FATAL: lock file “postmaster.pid” is empty
2025-03-22 00:30:45.127 UTC [4964] HINT: Either another server is starting, or the lock file is the remnant of a previous server startup crash.
2025-03-22 00:30:46.151 UTC [4966] FATAL: lock file “postmaster.pid” is empty
2025-03-22 00:30:46.151 UTC [4966] HINT: Either another server is starting, or the lock file is the remnant of a previous server startup crash.
2025-03-22 00:30:47.168 UTC [4970] FATAL: lock file “postmaster.pid” is empty
2025-03-22 00:30:47.168 UTC [4970] HINT: Either another server is starting, or the lock file is the remnant of a previous server startup crash.
2025-03-22 00:30:48.192 UTC [4977] FATAL: lock file “postmaster.pid” is empty
2025-03-22 00:30:48.192 UTC [4977] HINT: Either another server is starting, or the lock file is the remnant of a previous server startup crash.

joshhabka · March 22, 2025, 12:41am

-rw------- 1 syslog kvm 0 Mar 18 19:48 /var/discourse/shared/standalone/postgres_data/postmaster.pid

This is where my lockfile is

pfaffman · March 22, 2025, 3:23am

They are in /var/discourse/shared/standalone/backups/default

If you follow the rsync instructions I linked earlier, you’ll get them.

It crashed or the server rebooted or something things happen.

The database is “migrated” from one set of tables (tables get added and changed) to another on most upgrades.

You might try to stop the container and delete that lock file

And look in PG_VERSION to see what version you have, since I think you tried changing the template.

joshhabka · March 22, 2025, 3:24am

Yes, I did try to change after I saw the error.

So, would I do rm /var/discourse/shared/standalone/postgres_data/postmaster.pid ? to delete the lockfile then try to rebuild

Also thank you for helping me with this

joshhabka · March 22, 2025, 3:27am

I would do this command to delete the lockfile?

joshhabka · March 22, 2025, 3:51am

rm /var/discourse/shared/standalone/postgres_data/postmaster.pid was the solution, thank you!

system · April 21, 2025, 3:51am

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Newest Discourse Version breaks rebuild, Redis Port already in use Installation	8	1346	January 1, 2021
Error when starting Discourse, Redis port already in use Installation	12	1120	October 3, 2022
Moving to New Server woes Support	16	165	March 10, 2025
Cannot rebuild app, Support	2	93	February 3, 2025
Discourse update doesn't wait for Postgress DB to shut down Installation	5	875	January 28, 2022

Discourse Randomly Does Not run or Rebuild

Related topics