Can 'download my posts' output have consistently useful inline images?

(Eli the Bearded) #1

Here on Meta, the “download my posts” files have a mostly complete URL in the src attribute of uploaded images, lacking only scheme, something like this:

<img src="//

But on other sites, such as the Discourse hosted the URLs lack even a hostname:

<img src='/uploads/nextthing/original/2X/f/f2cf3de9...

(I notice that the nextthing .csv file has some src="..." and some src='...', while I only see the first form in the .csv file from Meta.)

Full URL for image in text/plain emails
(Régis Hanol) #2

There should be the hostname (or the CDN if enabled) for local uploads… is that from an old post? Does the URL change after you rebuild the HTML of that post?

(Eli the Bearded) #3

That was output from the profile button “Download my Posts” from two sites that I am a user on, not an admin.

Both of those downloads were made yesterday. I’ve got such post archives from a number of sites, but not all of them do I have uploaded images on. I have started to consider also archiving the images I have with the posts, and to do that I need to parse out the HTML. It would be helpful if Discourse would use a consistent format.

(Jeff Atwood) #4

Is this still an issue?

(Eli the Bearded) #5

Yes, a fresh download of Meta and Nextthing posts show both are in the same format.

(Jeff Atwood) #6