I would like to re process all imported posts (700,000 + posts) …
Basic problem was that a super large number of images were from photobucket but the import process didn’t handle them correctly.
Where code like this:
Was turned into this:
So photobucket is the easy ones to handle because of the similarity of the URL’s - so I’ll tackle that one first and then look for other broken items.
… basically I would like loop over all posts (PM’s, forum content) and:
- Run a regular regex replace creating a new in-memory string (revised_content) based on the latest revision (original_content)
- If revised_content is different to the original_content
- Add new revision with revised_content
- This action should not bump the topics activity date.
Just wondering if anybody has done this kind of thing already and has a code-snippet or two to point me at before I get started?