Roma Patel Profile picture
currently cs phd student @BrownUniversity. language & rl & multi-agent rl & interpretability. intern @googleai '19, @deepmind '20. (she/her)

Apr 28, 2020, 6 tweets

this paper from @shaohua0116 on guiding RL agents with program counterparts of natural language instructions was one of my favourites at #ICLR2020. here is why i think it's exciting and quite different from existing work. 1/6

there's a large literature of #NLProc work on semantic parsing (converting language->executable meaning representations) for a variety of tasks. this is helpful e.g., for database operations, for goals/rewards for planners, to ground to predefined robotic actions etc. 2/6

apart from select works, a lot of the time, the programs are treated as static---their executions are pre-defined, are usually used once at some beginning/end-point (e.g. to produce a goal state for some RL algorithm/planner) and do not extend over time or with interactions. 3/6

this work uses programs that are more _functional_, involving control flows, conditionals, as well as nested subtasks. agents therefore perceive the environment and interact with it following the control flows---which is far more natural than previous RL/language setups. 4/6

this should allow better handling of temporal conditions and lifelong learning guided by program synthesis. moreover, lots of dependencies/signals in natural language that couldn't be captured by static goal-based programs, now have a better chance of being realised! 5/6

there's still a lot left to do on the language side here (e.g., this has rule-based program interpreters and doesn't do the language->program pipeline), but i'm pretty excited about this direction and future work that does! 6/6

Share this Scrolly Tale with your friends.

A Scrolly Tale is a new way to read Twitter threads with a more visually immersive experience.
Discover more beautiful Scrolly Tales like this.

Keep scrolling