Charlie Profile picture
Mar 16 3 tweets 2 min read
50 years ago statistician Frank Anscombe warned us that #dataviz is essential to good statistical analysis

To demonstrate the point he generated what is now known as “Anscombe’s quarter”.

Four datasets that share 10 statistical measures but are from different populations (1/2)
In 2016 @AlbertoCairo produced the Datasaurus Dozen to make this point even more dramatically.

It’s absolutely wild that 50 years later we still need to communicate this to folks who have recently awarded STEM degrees.

Dataviz needs to be a fundamental part of data education
But also, we need to be forgiving of instructors who already have overburdened syllabi and would find it hard to include more than a passing mention of this issue.

Just like we need to be forgiving of me missing the actual 50 year anniversary by 4 days 😬

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Charlie

Charlie Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @charliejhadley

Sep 6, 2018
If you want to make code/data “available”, GitHub isn’t enough.

You must deposit at a DOI-issuing data repository @figshare & @ZENODO_org are both free & awesome; can be synced w/ a GitHub repo

Why GitHub not enough? 1/4
#OpenAccess #OpenData
GitHub is a place for things to be worked on, not for them to live forever.

- Links are fragile (username, repo name)
- Users can delete repos
- GitHub could make your code/data unavailable in the future.

DOI-issuing data repositories preserve your stuff for the future 2/4
Depositing on @KaggleDatasets isn’t good enough for #OpenAccess #OpenData either.

- No API for accessing files without an account
- Fragile URLs
- Kaggle Datasets is a commercial thing.

Do all three! GitHub repo, Kaggle Dataset and @figshare or @ZENODO_ORG 3/4
Read 4 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(