Doncaster Templestowe Historical Society: Backup DTHS Blogger Website

Method 1: Use Blogger to backup code (i.e. no images) in XML format. Suggest backup every 6 months.

Settings - Backup Content

Download

An XML file is downloaded to the computer

The XML File contains all formatting, text and links from pages, BUT not images or videos

Method 2: Use a page downloader to save PDF copies of all pages (includes images)

Clear the downloads folder of the computer in readiness for downloading the site web pages.

Install a browser extension link extractor like LinkClump

Go to the Website SiteMap

Hold down the z key on the keyboard and the left browser button while you drag a selection rectangle around the whole list of site pages.

Release the button and the z key.

Open a text file editor (e.g textedit or notepad.

Paste the links into the pad.

Go to sejda (URL to PDF conversion service)

Pay for 1 week access ($8 as at June 2022)

Go to the HTML to PDF page.

Copy 100 links:

Paste the 100 links into the text box:

Scroll down to the bottom and click "Options", then check the options are correct:

Click "Convert HTML to PDF"

After about 5 minutes, the task will complete and the compressed file can be downloaded:

Extract the files to see a pdf created from each web page including images and links.

Repeat the process for the next 100 links until all pages have been downloaded.

When finished, delete the files from the previous DTHS Website Page Backup, and paste the new backup into the folder.

This process, not only safeguards the information in the website, BUT also enables all website content to be included in the offline "Research Resources" so that a single offline desktop search covers all DTHS files whether on they are online or not.

Errors

If you get a timeout message from sejda, then eliminate this page from the download list and repeat the download.

Print the problem page manually.

Split the page content across 2 pages to reduce the size of the webpage before the next download.

Pages

Backup DTHS Blogger Website

Method 1: Use Blogger to backup code (i.e. no images) in XML format. Suggest backup every 6 months.

Method 2: Use a page downloader to save PDF copies of all pages (includes images)

Errors

No comments: