I’m using other forums which based on Discourse and some of them have great discussion that I want to save/archive for future reference.
I can think of 1 way: start at the first discussion, scroll all the way back to the last messages, then save the page. But it’s very time consuming and not efficient. Especially when there are more discussion to come after my save, I have to repeat the whole process again.
Is there anyway I can do using Linux command line? Or is there any tool that is useful for this purpose?
Thank you very much. I’d appreciate your suggestion.
Here is another way to do this for Discourse and just about any other webpage:
Also available for Chrome. It takes a complete webpage as it currently appears in the browser and saves it as an HTML file with all images and other static content preserved.
To view the archive, you just open up the HTML file from your computer in a web browser.
This by itself would not be a problem, because the Save Page WE extension captures the current state of the DOM…
…but this will cause an incomplete archive, yes.
Ctrl+P is not working for me either, I get one cut-off reply on the first page and a blank second page. Given the JS-scroll behavior I think this wouldn’t work either.
I also tried loading a thread with JavaScript disabled. This gives a very simplified display with 20 posts shown.
With the print shortcut, Discourse switches to a printer friendly version of the style sheet, shows you a preview and then opens up the print dialog.
Other way you can see it in action is by just appending “print” to the end of the topic URL. And then printing it the way you want. When I do this in Safari on my iPad for the topic you shared, it shows me a preview of a 115-page PDF in a printer friendly format.
Yes I was using Cmd-P; that brings up the same dialog as your “/print” suggestion.
What is interesting though is I was not waiting for the full preview to load until I proceeded to print to PDF. That seems to be why I got an incomplete 7-page / 2Mb report.
On Chrome with my iMac and 32Gb RAM, it eventually fails with this message:
But on my Mac Pro with 128Gb RAM, if I am patient the full 101 page / 42 Mb printout appears and is beautiful:
So yes, you are right that you can print from PDF with Chrome - but on long threads, it is resource intensive. I am surprised that on the 32Gb Mac it just fails rather than swapping to disk and finishing the job.
I am pretty sure it is not resource intensive. I have had no trouble loading the previews in machines with RAM as low as 4GB, including the iPad I mentioned which is the cheapest of them all. It may be something to do with your browser configuration or some extensions you may be using. But I will let you figure that out.
I’m just curious but when I open that page on iPhone and use Print function of Safari, it just shows 8 pages. I also tested on my iPad but it only shows 9 pages.
Could you please show me the steps you took to get a preview of that 115-page PDF? I just want to learn if I miss something.