So happy my first major piece of work at DeepMind is now published!

We consider a problem at the intersection of cogsci, social science and AI - can AI be used as a force for good, to help groups of people who disagree to find consensus?

Paper: dpmd.ai/3XR3Bm3

🧵…
We generated thousands of political questions and posed them to human participants.
e.g. should there be a tax on sugary foods?

Instead of a poll, people wrote their opinions out and explained their thinking
We sent these opinions into a large language model, and asked it to produce potential consensus statements that capture the group thinking overall.

Sometimes the model-generated consensus’s weren’t great, so we asked the same people to rate how much they agreed with each one
Promoted models were ok, but we knew we could do better!

We trained our model on these human preferences, so that with each new phase of human interactions, it got better and better at producing consensus statements that people like
We find that after training, our model produces statements that people prefer over the best human-written opinions.
And that the model is sensitive to the particular opinions the group has - it hasn’t just aligned to the preferences of a ‘generic’ user, but appreciates that different users can have different beliefs.
There are more details in the paper ofc, but we think this highlights the potential that large language models have as tools for social good, to help humans align their values with one another.

End of 🧵.

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Hannah Sheahan

Hannah Sheahan Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(