Customize

How to get the Wayback machine to instantly archive updates to your web page

Discussion in 'How To' started by Anonymous, Jul 24, 2013.

  1. Anonymous Member

    These steps are specifically for web pages that already have older versions on archive.org
    (For a page not already on it, edit the URL of the page's address prefixing it with http://web.archive.org/liveweb/ to get it on, then go to step 7)
    1. Create a new bookmark in your web browser, name it "Wayback" or "Archive" or whatever, and for the URL (location) paste this in:
      Code:
      javascript:void(location.href='http://wwwb-live-lb.us.archive.org:3129/_web/'+location.href);
    2. Move your new bookmarklet to the bookmarks toolbar or wherever in the UI is easiest to access.
    3. Go to the web page you want archived, wait for it to finish loading, then click on your bookmarklet (This is enough to get it archived, the rest of the steps are how to check and see that it worked)
    4. You should get a prompt dialog asking what to do with such a file, choose "Save" and change the name of the file it saves as to include the suffix .txt.gz
    5. Unzip the file using any good tool, preferably open source ones like peazip, and open it in any text editor
    6. The first line of the file should look like this example
      Code:
      http://whyweprotest.wikia.com/wiki/Chronology_of_publications_on_Scientology 199.27.77.194 20130724152320 text/html 261334
      The part you need is between the IP address and text/html, in this example it's 20130724152320
    7. Wait anywhere from a few days to a few weeks depending on how under load archive.org is
    8. Look for all copies of your chosen web page on the wayback machine, so the example would appear at http://web.archive.org/web/*/http:/...iki/Chronology_of_publications_on_Scientology - you should see the string from step 6 in the URL of the matching date on the calendar view. Click it to view your archived web page.
    NOTES: No guarantees this will work forever especially if/when archive.org changes any part of its process. In fact at the time of this post the example archived web page hasn't appeared yet. Also sometimes the file you're prompted to download is something other than a .txt.gz file, but I've found it still triggers the wayback machine to archive a copy.
    • Like Like x 2
  2. Anonymous Member

    A couple extra useful bookmarklets:
    Here's the one to use on an as yet unarchived web page:
    Code:
    javascript:void(location.href='http://web.archive.org/liveweb/'+location.href);
    If it's already archived, you'll just get redirected to the latest archived copy.

    This one is to get to the calendar view of all archived copies of a page:
    Code:
    javascript:void(location.href='http://web.archive.org/web/*/'+location.href);
    Another useful tip: To see the raw copy of the page without the archive.org overlay, insert im_ into the URL, so for example:
    http://web.archive.org/web/20130626030834/http://vorb.is/
    becomes
    http://web.archive.org/web/20130626030834im_/http://vorb.is/
    • Like Like x 2
  3. Anonymous Member

    This is relevant to my interests. Thank you.
  4. Anonymous Member

    • Like Like x 1
  5. Anonymous Member

  6. Anonymous Member

    The Wayback machine has just switched to using https by default, and has made it a bit easier to instantly save a webpage to archive.org
    This stopped working, but I found out you need to change the URL of it to this:
    Code:
    javascript:void(location.href='https://web.archive.org/save/'+location.href);
    Testing shows it works on previously archived pages even better than the original bookmarklet, so after you use this on your page you'll be auto-redirected to the URL of the saved copy.
    This one still works for now, and if it's used on a page that hasn't been archived, the wayback machine now offers a link to click for instantly adding it to archive.org
    If it stops working, just alter it to use https thusly:
    Code:
    javascript:void(location.href='https://web.archive.org/web/*/'+location.href);
    • Like Like x 1
  7. Anonymous Member

  8. Anonymous Member

    Bump for useful knowledge. Bookmarked.

Share This Page

Customize Theme Colors

Close

Choose a color via Color picker or click the predefined style names!

Primary Color :

Secondary Color :
Predefined Skins