Tweet

Jenn Wortman Vaughan

Jun 8 • 7 tweets • 4 min read

@AmyHeger

So excited I can FINALLY share our new work on machine learning practitioners' data documentation perceptions, needs, challenges, and desiderata, which will appear in #CSCW2022!

arxiv.org/abs/2206.02923

Joint work w/ @AmyHeger, @lizbmarquis, @mihaela_v, and @hannawallach 1/n

Data is central to the development & evaluation of ML models. Using problematic or inappropriate datasets can lead to harms.

Data documentation frameworks like datasheets & data nutrition labels were proposed to encourage transparency and deliberate reflection on datasets. 2/n

But do these frameworks meet the needs of ML practitioners who create and consume datasets?

We conducted a series of semi-structured interviews with 14 ML practitioners and had them answer a list of questions borrowed from datasheets for datasets. 3/n

To me, one of our most surprising findings is that while data doc frameworks are often motivated from the perspective of responsible AI, practitioners struggled to connect the questions they were answering to their RAI implications.

There's a big gap here we need to address! 4/n

Participants want data doc frameworks to be adaptable, integrated into existing tools and workflows, and as automated as possible.

They have trouble prioritizing the needs of dataset consumers and providing info that someone unfamiliar with their datasets might need to know. 5/n

In the paper, we provide 7 design requirements for future data documentation frameworks based on our findings.

Our hope is that these will help make data documentation more widespread and practical and contribute to responsible AI practices. 6/n

@PartnershipAI

If you're interested in data documentation for ML, also check out:
- datasheets (dl.acm.org/doi/10.1145/34…)
- the data nutrition project (datanutrition.org)
- data statements for NLP (aclanthology.org/Q18-1041/)
- @PartnershipAI's ABOUT ML project (partnershiponai.org/workstream/abo…)

7/7

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @jennwvaughan

Jenn Wortman Vaughan

@jennwvaughan

Feb 4

I want to talk about burnout. A brief 🧵...

I was well aware I was burnt out in the fall, but it's hard to fully appreciate the impact of burnout in the moment.

After 2 weeks of vacation and a month of aggressively blocking daily focus time, the impact has become more clear:

- I am less angry. This is huge.

- I feel less like every situation is adversarial, more willing to give people the benefit of the doubt.

- I feel less threatened by other people's actions, more in control of my own outcomes.

- I feel optimistic about those outcomes. (2/n)

- I can see paths forward where there were only roadblocks before.

- I can brainstorm and get excited about ideas.

- I can focus on what people are saying in meetings and feel less of a need to (poorly) multitask.

- As a result, I'm more interested in talking to people. (3/n)

Read 6 tweets

Jenn Wortman Vaughan

@jennwvaughan

Jan 8, 2020

@hannawallach

Ok, people! Are you looking for something to read at the intersection of machine learning and HCI?

Three new papers posted online today that you should check out, all with my amazing colleague/BFF @hannawallach!

Ready? I'm gonna try a thread!

@hannawallach

@hannawallach #1 Our upcoming book chapter/survey/position paper/labor of love on "A Human-Centered Agenda for Intelligible Machine Learning"

jennwv.com/papers/intel-c…

@hannawallach

@hannawallach #2 Our CHI 2020 paper on "Interpreting Interpretability"

Developing interpretability tools is not enough! Why user-centric evaluation of interpretability tools is necessary...

With @harmankkaur, @hannawallach, and other awesome non-twitter people.

jennwv.com/papers/interp-…

Read 4 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Jenn Wortman Vaughan

People who liked this thread also liked...

Try unrolling a thread yourself!

More from @jennwvaughan

Jenn Wortman Vaughan

Jenn Wortman Vaughan

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?