Talk by Chris Sweeney at #FAT2020 on "Reducing sentiment polarity for demographic attributes in word embeddings using adversarial learning," with @Maryam_Najafian. Image
There are several types of bias encoded in language models, and this paper focuses on sentiment bias, where certain identity terms encode a more positive sentiment than others. #FAT2020 ImageImage
Various papers have studied the different possible sources of this bias, and this paper focuses on the word vectors themselves. #FAT2020 Image
In particular, they define sentiment polarity by using various positive and negative words to obtain a positive/negative sentiment axis, and look at where various identity terms fall on that axis when their word vectors are projected onto that axis. #FAT2020 ImageImageImage
A given identity term's sentiment score is where its embedding's projection lies on this axis. Goal is to reduce the polarization of a set of identity terms while preserving semantic meaning. #FAT2020 ImageImage
They use an adversarial technique, learning to minimize the distance between polarized/depolarized word vectors, while the adversary maximizes the error between sentiment polarity and groundtruth. #FAT2020 Image
They evaluate whether the resulting embeddings are depolarized and the effect on fairness/accuracy in downstream tasks. They show that they reduce the polarity of names typically associated with different demographic groups. #FAT2020 ImageImage
Case study uses Equality Evaluation Corpus, and they show improvement according to Sentiment valence regression metric for different demographic categories. #FAT2020 ImageImageImageImage

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Ben Packer

Ben Packer Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Follow Us on Twitter!