, 18 tweets, 27 min read Read on Twitter
In addition to the excellent tools from the tweeted picture, there are many #webarchiving tools in development by @WebSciDL and the @LosAlamosNatLab Research Library Prototeam.
github.com/oduwsdl
github.com/mementoweb

Some examples below #eResearchNZ19 #DH #eResearch
Ever wonder which archives have a capture of a given URL? What about for a specific time? Do you want to search multiple #webarchives at once?

The Memento TimeTravel service run by @LosAlamosNatLab uses the #Memento protocol for just that: timetravel.mementoweb.org

#webarchiving
From your own machine, MemGator, by @ibnesayeed and @machawk1, also allows one to search across #webarchives using the #Memento protocol.

github.com/oduwsdl/MemGat…

ieee-tcdl.org/Bulletin/v13n1…

#webarchiving
MementoEmbed produces a social card or thumbnail of a #memento from #webarchives for inclusion in blog posts and other web pages. It is by @shawnmjones, with contributions by @ibnesayeed and @machawk1.

github.com/oduwsdl/Mement…

ws-dl.blogspot.com/2018/08/2018-0…

#webarchiving
For seamlessly browsing the web of the past, I use the Chrome web extension developed by @hvdsomp @hariharshankar from @LosAlamosNatLab.

bit.ly/memento-for-ch…

Just pick a date and browse the web as if it were that date!

#webarchiving
ArchiveNow saves a live web resource to multiple #webarchives at once! It is by @maturban1, with contributions by @machawk1, @ibnesayeed, @ruebot, and myano.

github.com/oduwsdl/archiv…

ws-dl.blogspot.com/2017/02/2017-0…

#webarchiving
InterPlanetary Wayback (IPWB) disseminates WARCs via the distributed web Interplanetary Filesystem (IPFS). It is a project by @machawk1 and @ibnesayeed.

github.com/oduwsdl/ipwb

slideshare.net/ibnesayeed/int…

#webarchiving
CarbonDate allows one to find the earliest existence of a webpage using #webarchives and other services, by @hanysalaheldeen, @acnwala, @grantcatkins, with contributions by DarkAngelZT, @ibnesayeed, and @machawk1

github.com/oduwsdl/Carbon…

ws-dl.blogspot.com/2017/09/2017-0…

#webarchiving
Archive-It Utilities (AIU), by @shawnmjones, extracts metadata from @archiveitorg collections.

github.com/oduwsdl/archiv…

ws-dl.blogspot.com/2018/07/2018-0…

#webarchiving
TMVis provides a visualization of thumbnails from #webarchives, allowing users to see changes over time. It is based on a paper by @aalsum, and work by @machawk1, @mir_smi, and @mrgunn.

github.com/oduwsdl/tmvis

ws-dl.blogspot.com/2017/10/2017-1…

#webarchiving
MementoDamage, a service for quantifying #memento quality, is based on research by @justinfbrunelle. The code was developed by @erikaris, with contributions by @ibnesayeed, @machawk1, @grantcatkins, and soedomoto.

github.com/oduwsdl/web-me…

ws-dl.blogspot.com/2017/11/2017-1…

#webarchiving
The Off-Topic-Memento-Toolkit (OTMT) detects soft-404 pages and other off-topic pages in a web archive collection. It is based on work by @yasmina_anwar with code and further research by @shawnmjones.

github.com/oduwsdl/off-to…

doi.org/10.17605/OSF.I…

#webarchiving
.@machawk1 has created #webarchiving Chrome extensions:
* Create WARCs from Chrome with WARCreate: warcreate.com
* Find archived versions of your current page, and/or save the current page to multiple web archives with Mink: chrome.google.com/webstore/detai…

#webarchiving
.@WebSciDL also has Twitter services, like @_wdill and @icanhazmemento run by @acnwala.

@_wdill shows web page evolution over time: whatdiditlooklike.mementoweb.org

@icanhazmemento allows one to save a link to #webarchives from within Twitter!
ws-dl.blogspot.com/2015/07/2015-0…

#webarchiving
A presentation on the @WebSciDL #webarchiving tools was given at WADL as part of @jcdl2018 by @weiglemc:
slideshare.net/mweigle/enabli…

She also wrote about some of the tools for @SSRC_org: parameters.ssrc.org/2018/09/on-the…
This is not a comprehensive list of tools (or even @WebSciDL projects), but I hope some find this list to be useful. Many of these tools address use cases that most people didn't know that they had. I hope one or more finds its way into your #webarchiving toolkit.
I also use AUT (archivesunleashed.org/aut/) by @unleasharchives, warcio (pypi.org/project/warcio/) by @IlyaKreymer, and @webrecorder_io. I have not tried the new release of #Heritrix yet:

If you know of other #webarchiving tools I should try, please comment.
Missing some Tweet in this thread?
You can try to force a refresh.

Like this thread? Get email updates or save it to PDF!

Subscribe to Shawn M. Jones
Profile picture

Get real-time email alerts when new unrolls are available from this author!

This content may be removed anytime!

Twitter may remove this content at anytime, convert it as a PDF, save and print for later use!

Try unrolling a thread yourself!

how to unroll video

1) Follow Thread Reader App on Twitter so you can easily mention us!

2) Go to a Twitter thread (series of Tweets by the same owner) and mention us with a keyword "unroll" @threadreaderapp unroll

You can practice here first or read more on our help page!

Follow Us on Twitter!

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just three indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3.00/month or $30.00/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!