So excited I can FINALLY share our new work on machine learning practitioners' data documentation perceptions, needs, challenges, and desiderata, which will appear in #CSCW2022!

arxiv.org/abs/2206.02923

Joint work w/ @AmyHeger, @lizbmarquis, @mihaela_v, and @hannawallach 1/n
Data is central to the development & evaluation of ML models. Using problematic or inappropriate datasets can lead to harms.

Data documentation frameworks like datasheets & data nutrition labels were proposed to encourage transparency and deliberate reflection on datasets. 2/n
But do these frameworks meet the needs of ML practitioners who create and consume datasets?

We conducted a series of semi-structured interviews with 14 ML practitioners and had them answer a list of questions borrowed from datasheets for datasets. 3/n
To me, one of our most surprising findings is that while data doc frameworks are often motivated from the perspective of responsible AI, practitioners struggled to connect the questions they were answering to their RAI implications.

There's a big gap here we need to address! 4/n
Participants want data doc frameworks to be adaptable, integrated into existing tools and workflows, and as automated as possible.

They have trouble prioritizing the needs of dataset consumers and providing info that someone unfamiliar with their datasets might need to know. 5/n
In the paper, we provide 7 design requirements for future data documentation frameworks based on our findings.

Our hope is that these will help make data documentation more widespread and practical and contribute to responsible AI practices. 6/n
If you're interested in data documentation for ML, also check out:
- datasheets (dl.acm.org/doi/10.1145/34…)
- the data nutrition project (datanutrition.org)
- data statements for NLP (aclanthology.org/Q18-1041/)
- @PartnershipAI's ABOUT ML project (partnershiponai.org/workstream/abo…)

7/7

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Jenn Wortman Vaughan

Jenn Wortman Vaughan Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @jennwvaughan

Feb 4
I want to talk about burnout. A brief 🧵...

I was well aware I was burnt out in the fall, but it's hard to fully appreciate the impact of burnout in the moment.

After 2 weeks of vacation and a month of aggressively blocking daily focus time, the impact has become more clear:
- I am less angry. This is huge.

- I feel less like every situation is adversarial, more willing to give people the benefit of the doubt.

- I feel less threatened by other people's actions, more in control of my own outcomes.

- I feel optimistic about those outcomes. (2/n)
- I can see paths forward where there were only roadblocks before.

- I can brainstorm and get excited about ideas.

- I can focus on what people are saying in meetings and feel less of a need to (poorly) multitask.

- As a result, I'm more interested in talking to people. (3/n)
Read 6 tweets
Jan 8, 2020
Ok, people! Are you looking for something to read at the intersection of machine learning and HCI?

Three new papers posted online today that you should check out, all with my amazing colleague/BFF @hannawallach!

Ready? I'm gonna try a thread!
@hannawallach #1 Our upcoming book chapter/survey/position paper/labor of love on "A Human-Centered Agenda for Intelligible Machine Learning"

jennwv.com/papers/intel-c…
@hannawallach #2 Our CHI 2020 paper on "Interpreting Interpretability"

Developing interpretability tools is not enough! Why user-centric evaluation of interpretability tools is necessary...

With @harmankkaur, @hannawallach, and other awesome non-twitter people.

jennwv.com/papers/interp-…
Read 4 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(