Script to Recover Pages

The Internet Archive Wayback Machine supports a number of different APIs to make it easier for developers to retrieve information about Wayback capture data. archive

We check internet archive for pages that we know have existed based on our own federation scrape. github

The deno command line will run this script for any site passed in as the first parameter. deno

deno run --allow-net --reload 'https://raw.githubusercontent.com/WardCunningham/assets/master/pages/script-to-recover-pages/check-archive.js' ward.dojo.fed.wiki > export.json

This will report progress then write an export.json file that can be dropped on any wiki to view or restore pages.

272 pages indexed in search 27 pages available in wayback 27 html pages retrieved 27 partial pages restored

We are now just restoring the synopsis but will be willing to try to dig more out of the html if this would be useful. See this import for an idea of what we have working now.