Health check API

axil · 4. Juni 2019 um 08:40

It would be nice to have some sort of health check API. We recently faced an issue with an update and Discourse was producing 500 errors.

However, curl returned 200:

curl -I https://forum.gitlab.com
HTTP/2 200
server: nginx

codinghorror · 4. Juni 2019 um 08:58

This exists, see

/srv/status

axil · 4. Juni 2019 um 11:53

Ah, thank you! I was searching “health check” and that didn’t yield any results.

ryancey · 29. Juli 2019 um 08:44

Post is 404. What are the endpoints?

codinghorror · 29. Juli 2019 um 08:56

I’ve updated my post.

michaeld · 29. Juli 2019 um 11:21

Actually I don’t think /srv/status would catch migration issues like the one mentioned above…

(and it would be pretty hard to build a check that does catch issues like that one)

sam · 30. Juli 2019 um 08:22

Yes… /srv/status is there as a very cheap test, all it does is ensures the apps middleware stack is working.

To catch issues where you auto deploy I would recommend monitoring 200s, if there is a large increase in non 200s alert.

downey · 28. Januar 2020 um 21:18

Would https://discourse.example.org/srv/status be a good place to point an uptime monitor? I am thinking it may not be enough to have a reliable measure of “is the site up”, but it would be nice to have something that produces less load on the system for monitoring purposes.

(Alternatively, might there be any plans to expand the components listed on this endpoint?)

sam · 28. Januar 2020 um 21:19

Yeah that is a reasonable spot, you could also point at a specific topic and search for text if you want something more fancy

downey · 28. Januar 2020 um 21:27

Yeah, we had been using /about but inclined to use this instead.

My old ops/on-call mind woke up to make me think it could still be interesting (and helpful occasionally for troubleshooting) if it were something like:

db ok
middleware ok
whatever-else ok
...
all systems ok

Thema		Antworten	Aufrufe
How to test /srv/status Support	1	713	17. März 2021
`/srv/status` returns OK even if database is broken Dev	6	659	18. Juli 2020
What URL should we monitor to be sure Discourse is up Support	3	1554	25. April 2016
Webhook for Discourse Uptime Monitoring? Dev	24	1783	16. Januar 2026
`/srv/status` monitoring endpoint doesn't catch some service unavailability issues - one example free space Feature	14	1528	26. April 2017

Health check API

Verwandte Themen