Jessy Lin Profile picture
PhD @Berkeley_AI, visiting researcher @AIatMeta. Interactive language agents ๐Ÿค– ๐Ÿ’ฌ
Aug 27 โ€ข 10 tweets โ€ข 5 min read
๐Ÿ” How do we teach an LLM to ๐˜ฎ๐˜ข๐˜ด๐˜ต๐˜ฆ๐˜ณ a body of knowledge?

In new work with @AIatMeta, we propose Active Reading ๐Ÿ“™: a way for models to teach themselves new things by self-studying their training data. Results:

* ๐Ÿ”๐Ÿ”% on SimpleQA w/ an 8B model by studying the wikipedia docs (+๐Ÿ‘๐Ÿ๐Ÿ‘% vs plain finetuning)
* a domain-specific expert model: ๐Ÿ๐Ÿ”๐ŸŽ% vs FT on FinanceBench knowledge
* an 8B wikipedia expert competitive w/ 405B on factuality (๐Ÿ’ฅopen-sourced!)

๐Ÿงต[1/n]Image Currently, we train models by doing a single pass over the data.

Contrast w/ how humans learn: when we read a textbook, we use many strategies to internalize new info: thinking about a concept in different ways, imagining practice problems, or relating to things we already know.

We apply this idea to LLMs: for each training doc, we have the model itself propose study strategies, "actively reading" to synthesize its own augmented training corpus.

๐Ÿงต [2/n]
Jun 1, 2023 โ€ข 10 tweets โ€ข 5 min read
How can agents like LLMs become decision-making partners for humans?

๐Ÿ’ฌ Excited to share a new paper + suite of envs for ๐˜ฅ๐˜ฆ๐˜ค๐˜ช๐˜ด๐˜ช๐˜ฐ๐˜ฏ-๐˜ฐ๐˜ณ๐˜ช๐˜ฆ๐˜ฏ๐˜ต๐˜ฆ๐˜ฅ ๐˜ฅ๐˜ช๐˜ข๐˜ญ๐˜ฐ๐˜จ๐˜ถ๐˜ฆ๐˜ด, where agents + humans collab to solve hard everyday problems. [1/n]

Site: collaborative-dialogue.github.io A lot of everyday problems involve making decisions with messy constraintsโ€”from researching a laptop to buy to prioritizing a company roadmap.

Agents could help us make these decisions! But they need to integrate the fuzzy real-world knowledge and preferences that we know.
Apr 18, 2022 โ€ข 8 tweets โ€ข 6 min read
How can agents infer what people want from what they say?

In our new paper at #acl2022nlp w/ @dan_fried, Dan Klein, and @ancadianadragan, we learn preferences from language by reasoning about how people communicate in context.

Paper: arxiv.org/abs/2204.02515
[1/n] @dan_fried @ancadianadragan Weโ€™d like AI agents that not only follow our instructions (โ€œbook this flightโ€), but learn to generalize to what to do in new contexts (know what flights I prefer from our past interactions and book on my behalf) โ€” i.e., learn *rewards* from language. [2/n]