Tweet

Hannah Sheahan

Dec 1 • 7 tweets • 2 min read

https://twitter.com/deepmind/status/1598293523862032385

So happy my first major piece of work at DeepMind is now published!

We consider a problem at the intersection of cogsci, social science and AI - can AI be used as a force for good, to help groups of people who disagree to find consensus?

Paper: dpmd.ai/3XR3Bm3

🧵…

https://twitter.com/deepmind/status/1598293523862032385

We generated thousands of political questions and posed them to human participants.
e.g. should there be a tax on sugary foods?

Instead of a poll, people wrote their opinions out and explained their thinking

We sent these opinions into a large language model, and asked it to produce potential consensus statements that capture the group thinking overall.

Sometimes the model-generated consensus’s weren’t great, so we asked the same people to rate how much they agreed with each one

Promoted models were ok, but we knew we could do better!

We trained our model on these human preferences, so that with each new phase of human interactions, it got better and better at producing consensus statements that people like

We find that after training, our model produces statements that people prefer over the best human-written opinions.

And that the model is sensitive to the particular opinions the group has - it hasn’t just aligned to the preferences of a ‘generic’ user, but appreciates that different users can have different beliefs.

There are more details in the paper ofc, but we think this highlights the potential that large language models have as tools for social good, to help humans align their values with one another.

End of 🧵.

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Hannah Sheahan

People who liked this thread also liked...

Try unrolling a thread yourself!

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!