Improving Discourse static HTML archive

saurabhp · March 25, 2019, 12:51pm

It is recommended to use HTTrack to take a dump of static HTML and host that as a static archived website. But the layout for crawlers is not very pretty to host it as a static site. I will be working on improving the layout and adding necessary data to the static website. You can see the crawler layout at https://meta.discourse.org/?escaped_fragment which I will try to improve.

This is just a placeholder to link with changes I make so that someone reviewing it can get more context.

Let me know if you have any suggestions on this topic.

Thanks

saurabhp · March 29, 2019, 12:33pm

I have created few pull requests related to this and added screenshots in them:
https://github.com/discourse/discourse/pull/7250
https://github.com/discourse/discourse/pull/7270
https://github.com/discourse/discourse/pull/7286

Let me know if you have any suggestions.

tgxworld · April 2, 2019, 5:02am

Sorry in advanced for my question since I’m not very familiar with HTTrack. Why do we need to use HTTrack to take a dump of the static HTML page and host that as a static archived website?

saurabhp · April 2, 2019, 6:29am

Hey,
You can go through these links to get more context related to this:

HTTrack will basically just crawl your website and create a static HTML dump which you can host as a static website.

Quoting from the link above on why people want it.

Let me know if you have any other questions.

codinghorror · April 2, 2019, 6:30am

You do not “need” to use httrack tool you can use recursive wget and other similar command line Linuxy spidering tools as well.

saurabhp · April 7, 2019, 6:33am

Just an update regarding this.

All 3 pull requests have been merged. I’m adding screenshots with the new static archive look here below. Let me know if any of you have any suggestions on things to improve.

Topic		Replies	Views
Any updates on the best way to create a HTML archive of a static site? Community	8	165	July 15, 2025
Archive an old forum "in place" to start a new Discourse forum Migrating to Discourse	0	19411	March 5, 2014
A basic Discourse archival tool Dev	24	14133	April 30, 2025
Archiving an inactive discourse forum Support	6	1158	January 28, 2022
How do I export the complete forum as static html pages? Support	4	2864	May 11, 2022

Improving Discourse static HTML archive

Related topics