PG13 compatibility issue 🔥

RGJ · October 15, 2024, 10:28pm

What is this about?

This migration introduces NULL NOT DISTINCT on an index on the problem_check_trackers table. DEV: Fix problem check tracker unique index not respecting NULLs by Drenmi · Pull Request #29169 · discourse/discourse · GitHub

What is the problem?

By default, when checking uniqueness on a tuple for the purposes of enforcing a unique index, PostgreSQL considers NULLs to be distinct values. Because of this we could incorrectly have multiple entries with { identifier: "rails_env", target: nil } created due to race conditions. This would then cause errors at runtime.

How does this solve it?

Drop the existing index and recreate it with the NULLS NOT DISTINCT option.

Problem

However, NULLS NOT DISTINCT was introduced in Postgres 15 beta2. The current Postgres version on a standard install is Postgres 13 and that does not support this feature.

Consequences

this change will not have any effect on PG13 since the NULLS NOT DISTINCT will be ignored (source)
attempting to restore a backup from a PG15 server to a PG13 server will fail with the following error

ERROR:  syntax error at or near "NULLS"
LINE 1: ...m_check_trackers USING btree (identifier, target) NULLS NOT ...
^
EXCEPTION: psql failed:                                                              ^
/var/www/discourse/lib/backup_restore/database_restorer.rb:92:in `restore_dump'

(full line: CREATE UNIQUE INDEX index_problem_check_trackers_on_identifier_and_target ON public.problem_check_trackers USING btree (identifier, target) NULLS NOT DISTINCT;)

@tgxworld @drenmi

tgxworld · October 15, 2024, 11:20pm

@drenmi Looks like we have to revert the migration and reconsider another solution. This is probably causing self hosted installs to error out.

ted · October 18, 2024, 4:35am

I have a PR up here that should work for PG13 as well.

github.com/discourse/discourse

FIX: Make problem check tracker unique constraint work on PG13

discourse:main ← discourse:fix/problem-check-target-pg13

opened 03:40AM - 18 Oct 24 UTC

Drenmi

+32 -11

### What is this change? In #29169 we added a `NULLS NOT DISTINCT` option to …the unique index on `problem_check_trackers`. This is to enforce uniqueness even when the `target` is `NULL`. (Postgres considers all `NULL`s to be distinct by default.) However, this only works in PG15. In PG13 it does nothing. This commit adds a default dummy string value `__NULL__` to `target`. Since it's a string, PG13 will be able to correctly identify duplicate records. ### Is it safe to run this? Adding a default value will lock the table and can cause issues on large tables, but the `problem_check_trackers` table is constrained by the number of problem check classes, and is in the ballpark of 10-100 rows.

We’re still considering how realistic this case is in the wild, and how much we should potentially invest in working around it.

RGJ · October 18, 2024, 7:33am

Well, I can’t speak for others and I don’t know whether I am truly representative (probably not) but I came across it twice within 3 days after that change was made…

So a workaround (and revert !) would be much appreciated.

ted · October 21, 2024, 3:38am

Now that we have the workaround, which should work on PG15 as well, we should be able to remove the NULLS NOT DISTINCT.

Out of curiosity, what were you doing that necessitated restoring a PG15 backup on PG13? (It won’t affect the task above, just trying to understand what’s happening “in the wild” as much as possible.)

RGJ · October 21, 2024, 5:35am

We had one client who was attempting to restore a backup (I think they were self hosted and had been trying things beyond their knowledge level ) , and we had another client who asked us to set up a staging site for custom plugin development purposes and we took a backup from CDCK hosting.

in general the built in versioning mechanism in the backup metadata works really well in being able to proactively determine when something is going to blow up or not, but these kind of situations* are like land mines

(* Actually, the only other thing I can think of that is not covered by migration versioning is when a migration with an older date stamp is injected into main, but I digress)

ted · October 22, 2024, 2:29am

Thanks for the info @RGJ!

The PR for removing the NULLS NOT DISTINCT OPTION is up:

github.com/discourse/discourse

DEV: Remove NULLS NOT DISTINCT from problem check trackers

discourse:main ← discourse:dev/remove-nulls-not-distinct-from-problem-check-trackers

opened 02:28AM - 22 Oct 24 UTC

Drenmi

+12 -0

### What is this change? We added `NULLS NOT DISTINCT` to a unique index on `…problem_check_trackers`. This option is only available in PG15+. It does not in itself break PG13, but restoring a PG15+ backup to PG13 currently errors out. It seems this is an operation that's more common than we first thought. This commit fixes that by removing the `NULLS NOT DISTINCT`. ### Don't we need it, since we added it? We already have another, backwards-compatible approach to do the same thing in place, so this shouldn't change existing behaviour.

gpoole · October 23, 2024, 1:16am

I can see the issue is already resolved but to add an in-the-wild experience: I had this issue trying to restore a backup created on a Discourse hosted instance to a dev container I had set up locally using Docker as part of setting up a dev environment. Seems that Discourse hosting runs PG 15 but the dev environment is 13?

sam · October 23, 2024, 1:27am

Yeah this is the root of the issue, we need to update our open source container to 15. we will get to it over the next few months.

ted · November 18, 2024, 12:00am

This topic was automatically closed after 9 days. New replies are no longer allowed.

Topic		Replies	Views
Restore fails - could not create unique index Installation	22	3780	July 6, 2020
"EXCEPTION: psql failed: DETAIL: Key (post_id)=(36946) is duplicated." Installation	8	1165	June 20, 2022
Restore fails: could not create unique index Installation	10	1194	February 13, 2022
Problem with restore Discourse from backup (3.4, quite large DB) Support	18	204	January 14, 2025
Error restoring backup "key is duplicated" Installation	8	1802	August 31, 2017

PG13 compatibility issue 🔥

What is this about?

What is the problem?

How does this solve it?

Problem

Consequences

Related topics