Ben Lee Profile picture
Assistant Professor @uw_ischool | essays in @gawker @WIRED @curaffairs etc. | https://t.co/qOVo88RbdJ
Nov 18, 2025 8 tweets 4 min read
1/ Announcing GovScape – multimodal search for 10 million government PDFs (70 million pages) from the End of Term Web Archive! GovScape offers visual search, semantic textual search, and keyword search.

Website: govscape.net
ArXiv link: arxiv.org/abs/2511.11010 Image 2/ GovScape is built on top of the End of Term Web Archive () and contains all renderable PDFs of length 50 pages or fewer from the 2020 crawl, documenting the first Trump administration. An overview of GovScape’s search functionality can be found here: eotarchive.orgImage
Sep 15, 2020 10 tweets 9 min read
1/ With @LC_Labs, #NDNP, and @dsweld, I’m excited to share the Newspaper Navigator search app: train your own AI navigators to search over 1.5 million historic newspaper photos by visual similarity! (desktop viewing recommended) #ChronAm

news-navigator.labs.loc.gov/search A screenshot of an AI navigator trained to retrieve images o 2/ For the first phase of my @librarycongress Innovator in Residence project, I created the #NewspaperNavigator dataset: extracted visual content from 16+ million newspaper pages in #ChronAm. This search app provides new ways of searching the dataset.

news-navigator.labs.loc.gov