Unable to archive Discourse pages with the Internet Archive...?


(Evert Meulie) #1

I noticed something perculiar: When I try to index a page of a Discourse installation via http://web.archive.org/ , it seeminly succeeds, but it returns a blank page.
Since the Internet Archive has no problem with any of the other pages & sites out there that I’ve tried to archive, I suspect it being something in the source code of Discourse pages?

Example: I try to index Multisite configuration with Docker .
Result: Multisite configuration with Docker - Discourse Meta
(The source of that result does seem to contain the page though)

I’ll also pass this on to the Internet Archive, in case they happen to know what causes this peculiar behaviour.


Unable to archive Discourse with the Internet Archive "Save Page Now" button
(Erlend Sogge Heggen) #2

It should tell it to grab the no-JS version somehow, because even if it works with JS, only a few pages will be stored, as exemplified by this NodeBB archive.


(Jeff Atwood) #3

Yes let us know what they reply with. Google has no trouble getting to the pages, so hopefully Internet Archive can too.


(Sam Saffron) #4

What is the status on this?


(Kane York) #5

status: PR submitted

https://github.com/discourse/discourse/pull/2941


(Sam Saffron) #6

This seems fine, closing.


(Jeff Atwood) #9