Our open-source archive of California COVID data has expanded again.

A nerdy new milestone: 100% of our web scrapers are running free and open via @github's action framework.

That's 80+ routines, written by a half-dozen team members, running in unity.

github.com/datadesk/calif…
@github GitHub's action system, which came into its own during the pandemic, has changed the web scraping game.

This approach allows us to store, script and schedule our data-gathering systems. For free.
@github Here's how it works:

1️⃣ Scrapers repo acquires data.

2️⃣ Private repo pulls it in, processes and publishes tracker. Also via actions, and (almost) entirely automated.

3️⃣ Processed files are pushed to open-source data archive for reuse.

github.com/datadesk/calif…
@github Today's release includes dozens of scrapers that pry data tracking places, neighborhoods and ZIP Codes from county-run dashboards.

They're monitored by an ingenious @rdmurphy hack that automatically files a pull request if something goes wrong.

github.com/datadesk/calif…
And, to be clear, when I say free, I mean both gratis and libre, both beer and speech.

That means this portion of our most ambitious data gathering effort ever has zero cost.

That's because @github doesn't charge for open-source repositories.

docs.github.com/en/billing/man…
This morning, I expanded our export pipeline to include a few more files.

You can now find cleaned up and regularly updating data tracking hospital load and county-level demographic trends in vaccination.

github.com/datadesk/calif…

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Ben Welsh

Ben Welsh Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @palewire

8 Oct 19
Alright, nerds

I filed a FOIA appeal and won the infamous NROL-39 surveillance satellite logo as a PDF.

github.com/palewire/nrol-…

This ain't a scanned powerpoint. This is a resizable vector. You know what you must do. Unleash the swag.
One can never be sure, but I think the FOIA officer wanted to make this one happen.
What do you dorks think? Should @caseymmiller be assigned to make the next item in her nerdleisure line?

Read 4 tweets
29 Apr 18
Two weeks ago new @latimes owner @drpatsoonshiong said he will move the newsroom to El Segundo.

The next day I led an off-the-cuff @Twitter tour of the historic #DTLA HQ being left behind. More than 650,000 people followed along.

Here it is, in one post palewi.re/posts/2018/04/…
Reposting the material to my personal blog allows me to better archive the material. And clean up the many typos I littered on the tour via my smartphone keyboard.

I continue to be blown away by the response. I did not expect anyone to care when I started.
Since then I've continued to document as much of the @latimes building as I can before we go.

This thread catalogs the eccentric fixtures, knobs, lights and buttons around the building.
Read 4 tweets
14 Apr 18
The new @latimes owner is moving the newsroom from its historic HQ to El Segundo.

I've been lucky enough to inhabit and explore the interlocking buildings at 1st and Spring for over a decade.

I'd like to share it with you. It's a beautiful day in #DTLA. Shall we take a wander?
The buildings wer constructed and built by the Chandler Family.

The different sections of the block have different cornerstones set by succeeding generations.
The lobby at 2nd and Spring with it's beautiful fixtures, including -30-, the traditional newspaper code for a story's end, hung over the exit.
Read 78 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Follow Us on Twitter!

:(