The computer science maxim "garbage in, garbage out," (#GIGO) dates back at least as far as 1957. It's an iron law of computing: no matter how powerful your data-processing system is, if you feed it low-quality data, you'll get low-quality conclusions.

1/ Image
And of course, machine learning (AKA "AI") (ugh) does not repeal GIGO. Far from it. ML systems that operate on garbage data produce garbage predictive models, which produce garbage conclusions at vast scale, coated with a veneer of algorithmic objectivity facewash.

2/
The scale and credibility of ML-derived GIGO presents huge risks to our society in domains as varied as the credit system, criminal justice, hiring, education - even whether your kids will be taken away by Child Protective Services.

3/
To make this all worse, the vast data-sets used to train ML systems are in scarce supply, which leads multiple ML models to be trained on the same data, enshrining the defects of that data in all kinds of systems.

4/
One of the most significant training datasets is Imagenet, a collection of 14m labeled images that jumpstarted the ML revolution in 2012. As @willknight writes for @WIRED, Imagenet's labels came from low-waged, undersupervised workers.

wired.com/story/foundati…

5/
Imagenet is one of the data-sets examined in new research from MIT's Curtis Northcutt and colleagues, who found that Imagenet and other comparable datasets have a typical error rate of about 6%.

TK

6/
This small margin of error has big consequences: first, because the errors aren't evenly distributed, and instead cluster around the kinds of biases that labelers have (for example, labeling images of woman medical professionals with "nurse" and men with "doctor").

7/
And second, because the incorrect labels obscure relative performance differences between models. When one model does better than another, you can't know if that's because it is a better model, or because it's less sensitive to incorrect labels.

8/
ETA - If you'd like an unrolled version of this thread to read or share, here's a link to it on pluralistic.net, my surveillance-free, ad-free, tracker-free blog:

pluralistic.net/2021/03/31/vac…

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Cory Doctorow

Cory Doctorow Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @doctorow

2 Apr
Today's Twitter threads (a Twitter thread).

Inside: The zombie economy and digital arm-breakers; and more!

Archived at: pluralistic.net/2021/04/02/inn…

#Pluralistic

1/ Image
The zombie economy and digital arm-breakers: Debts that can't be paid won't be paid, but still...



2/ Image
#5yrsago 33 state Democratic parties launder $26M from millionaires for Hillary counterpunch.org/2016/04/01/how…

#1yrago Turn on wifi sharing pluralistic.net/2020/04/02/eff…

#1yrago How you subsidize the otherwise unprofitable Fox News pluralistic.net/2020/04/02/eff…

3/ Image
Read 15 tweets
2 Apr
It's a zombie economy. For 40 years, we've eroded the wages of workers and transfered their share of profit and productivity to owners of capital. This is a problem, because people need money to buy things, and if they run out of money, they stop buying and profits vanish.

1/ Image
Time and again, capitalism has kicked any reckoning over this down the road. First came the great liquidation: pension cashouts, raided savings, reverse mortgages. Then came consumer borrowing, a tidal wave of unrepayable debt.

2/
That's the zombie part: all the unpayable debt, which has been turned into bonds that enrich debt-holders. As Michael Hudson has told us again and again, debt that can't be paid, won't be paid. Our debt-based economy is the walking dead, a zombie.

3/
Read 49 tweets
1 Apr
SUPERMAN: THE MOVIE (1978)
dir. Richard Donner
atomic-chronoscaph.tumblr.com/post/647295745…
SUPERMAN: THE MOVIE (1978)
dir. Richard Donner
atomic-chronoscaph.tumblr.com/post/647295745…
SUPERMAN: THE MOVIE (1978)
dir. Richard Donner
atomic-chronoscaph.tumblr.com/post/647295745…
Read 9 tweets
1 Apr
Today's Twitter threads (a Twitter thread).

Inside: Ontario's drug-dealer premier is shockingly bad at distributing vaccines; and more!

Archived at: pluralistic.net/2021/04/01/inc…

#Pluralistic

1/ Image
Ontario's drug-dealer premier is shockingly bad at distributing vaccines: How is it that Doug Ford was so good at slinging hash, and is so bad at vaccinating?



2/
#5yrsago Among a Thousand Fireflies: children’s book shows the sweet, alien love stories unfolding in our own backyards memex.craphound.com/2016/04/01/amo…

#1yrago Snowden’s Box: the incredible, illuminating story of the journey of Snowden’s hard drive memex.craphound.com/2020/03/31/sno…

3/ Image
Read 14 tweets
1 Apr
Ontario politics are a wild ride, but they rarely escape the province, or, at most, the nation. Which is weird, because Ontario has been a leading indicator of neoliberalism's cruelty, paranoia, and surrealism since (at least) the mid-nineties.

1/ Image
Start with the 1995 election of Conservative Premier Mike Harris, a bland, dead-eyed sociopath whose "Common Sense Revolution" slashed Ontario's excellent public services and implemented a forced-labor program for poor people, AKA "workfare."

2/
Harris was a Romneyish sort of fellow: a personality-free, interchangeable suit who didn't raise anyone's pulse but excelled at administration. His major achievement was the amalgamation of Toronto: a forced merger of the City of Toronto with its heretofore separate suburbs.

3/
Read 31 tweets
1 Apr
The Muppet Show (1976), “Mark Hamill” adventurelandia.tumblr.com/post/647267540…
The Muppet Show (1976), “Mark Hamill” adventurelandia.tumblr.com/post/647267540…
The Muppet Show (1976), “Mark Hamill” adventurelandia.tumblr.com/post/647267540…
Read 6 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Follow Us on Twitter!