Jessy Lin Profile picture
Apr 18 8 tweets 6 min read
How can agents infer what people want from what they say?

In our new paper at #acl2022nlp w/ @dan_fried, Dan Klein, and @ancadianadragan, we learn preferences from language by reasoning about how people communicate in context.

Paper: arxiv.org/abs/2204.02515
[1/n]
@dan_fried @ancadianadragan We’d like AI agents that not only follow our instructions (“book this flight”), but learn to generalize to what to do in new contexts (know what flights I prefer from our past interactions and book on my behalf) — i.e., learn *rewards* from language. [2/n]
@dan_fried @ancadianadragan The challenge is that language only reveals partial, context-dependent information about our goals and preferences (when I tell a flight booking agent I want “the jetblue flight,” I don’t mean I always want a jetblue flight — just in this particular case!). [3/n]
@dan_fried @ancadianadragan On the other hand, we have a lot of techniques in inverse reinforcement learning to go from actions -> underlying rewards, but these methods will miss the fact that language naturally communicates *why* people want those actions. [4/n]
@dan_fried @ancadianadragan To study this, we collect a dataset of natural language in a new task, FlightPref, where one player (the “assistant”) has to infer the preferences of the “user" while they book flights together.

Lots of rich, interesting phenomena in the data (to be released!): [5/n]
@dan_fried @ancadianadragan We build a pragmatic model that reasons that language communicates what agents should do, and the *way* people describe what to do reveal the features they care about. Both enable agents to make more accurate inferences. [6/n]
@dan_fried @ancadianadragan There’s many directions to take FlightPref / reward learning from language further: building agents that learn to ask questions based on uncertainty, studying adaptation to different humans, and seeing how these ideas extend to inferring real preferences in the wild! [7/n]
@dan_fried @ancadianadragan More broadly, it’s an exciting time to be working on language + action/RL! A lot of work has been focused on e.g. language for generalization, but our work hints at how language humans use to *communicate* present distinct challenges for grounded agents (pragmatics, etc.!). [8/8]

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Jessy Lin

Jessy Lin Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(