@WebSciDL will be represented by:
* @acnwala
* @ibnesayeed
* @OpenMaze
* @weiglemc
* @phonedude_mln
We'll be presenting 3 full papers, 1 poster, and 2 #WADL2019 workshop presentations.
Links below.
Social media with lots of links are demonstrations of domain expertise & produce quality seeds otherwise missed w/ SERPs & hashtags
arxiv.org/abs/1905.12220
github.com/anwala/MicroCo…
#JCDL2019
Store fixity info about archived resources in... archives... *different* archives. Evaluates two methods: Block (faster) & Atomic (easier for humans to inspect).
arxiv.org/abs/1905.12565
github.com/oduwsdl/archiv…
#JCDL2019
example: @ArquivoWeb_PT has 4.9B mementos but only 3% of MemGator's 5M requests are hits there
MementoMap describes what archives *actually* hold.
arxiv.org/abs/1905.12607
github.com/oduwsdl/Mement…
#JCDL2019
DFS: file system to standardize the metadata representation of datasets
DDU: scalable architecture based on DFS for semi-automated metadata generation & data recommendation in the cloud
cs.odu.edu/~sampath/publi…
#JCDL2019
* MementoMap (described above)
* Our "cookie violations" blog post about how Heritrix-based web archives have difficulty with how Twitter supports multiple languages: ws-dl.blogspot.com/2019/03/2019-0…
#JCDL2019