I was migrating our Discourse instance from a server to another, and came across an interesting issue…
We use S3 to store uploads from the forum. We have enabled them several years ago, hence it’s not something we introduced in this migration.
After fixing a couple of other issues, I was able to get the backup to be imported. But, it failed on a S3-related step with the following:
Updating the URLs in the database...
Removing old optimized images...
Flagging all posts containing lightboxes for rebake...
72038 posts were flagged for a rebake
EXCEPTION: 257 posts are not remapped to new S3 upload URL. S3 migration failed for db 'default'.
After digging a bit, I was able to trace down the issue to this line:
Then, I went to the rails console, and I was able to replicate the queries with the following:
discourse(prod)> SiteSetting.cdn_path("/uploads/#{@current_db}/original").sub(/https?:/, "")
=> "/uploads//original"
discourse(prod)> RailsMultisite::ConnectionManagement.current_db
=> "default"
discourse(prod)> cdn_path = SiteSetting.cdn_path("/uploads/default/original").sub(/https?:/, "")
=> "/uploads/default/original"
discourse(prod)> Post.where("cooked LIKE '%#{cdn_path}%'")
=> ...
Then, I went to those particular posts, and they were part of the Performance Reports (screenshot is from after I run a find-and-replace script):
Apparently, that check is retrieving any post containing /uploads/default/original
in the cooked field, despite it might not be a legitimate asset. In this case, /uploads/default/original
was used as “plain text”, hence it was not missed during the migration job.
Not sure if this is expected?
Thank you!