Tweet

Jacob Andreas

Dec 6 • 8 tweets • 2 min read

Speculative (!!!) paper arguing that big LMs can model agency & communicative intent: arxiv.org/abs/2212.01681 (somehow in EMNLP findings). Briefly:

1. LMs do not in general have beliefs or goals. An LM trained on the Internet models a distribution over next tokens *marginalized*

over all the authors who could have produced the context. All large-scale training sets are generated by a mixture of authors w/ mutually incompatible beliefs & goals, and we shouldn't expect a model of the marginal dist. over utts to be coherent.

BUT

2. B/c single training documents are written by individuals who do have specific communicative intentions, and b/c understanding these intentions helps w/ next-word prediction, we should expect LMs to *infer and represent* the latent beliefs/intentions/etc that give rise to a ctx

3. There's lots of evidence that current LMs already do this for coarse notions of "belief", "goal", "intention", etc. The one I find most convincing is the fact that LMs get better at TruthfulQA if you prefix the input w a description of a truthful agent: arxiv.org/abs/2109.07958

So even if an LM isn't a model of a specific agent in general, a good LM can model the communicative behavior of an agent in context, and will do so by building internal representations that have the same causal relation to output that the agent's internal states have.

Obviously current models still make lots of errors when doing this. ChatGPT is especially interesting, since it was clearly trained to model a specific agent rather than the entire distribution, but can be tricked into modeling other agents with carefully designed contexts!

There are likely also architectural limitations that prevent current archs from doing agent-inference-and-simulation in a general & reliable way, not least of which is that it's hard to fully specify beliefs & goals in a few thousand tokens. So even training on text alone,

better models might do a better job of explicitly modeling the latent author of the text the LM is trying to generate, and provide downstream users more transparency and control over this inferred author's internal state.

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @jacobandreas

Jacob Andreas

@jacobandreas

Sep 21, 2020

New TACL paper from Semantic Machines! A first look at the approach we've been developing for modeling complex, task-oriented conversations using dataflow graphs.

aka.ms/AA9oxf3

I am extremely excited about this work for several reasons:

1/ It highlights one of the most consequential but overlooked consequences of neural models in dialogue: explicit representations of intent still matter (a production system can't lie about what it did even 1% of the time) but now *we can predict whatever representations we want*

If you look at the earliest dialogue research (e.g. Grosz & Sidner's Shared Plans), the community used to be way more ambitious about the dialogue phenomena we tried to represent. Most of that went out the window when contemporary ML approaches weren't up to modeling it.

Read 9 tweets

Jacob Andreas

@jacobandreas

Jul 9, 2020

@alexandersclark

Was also very happy to get multiple pointers to @alexandersclark & Eyraud's work on substitutable languages (dl.acm.org/doi/pdf/10.555…). Haven't done a full read yet but "weak substitutability <=> syntactic congruence" is exactly what (1-fragment, full-context) GECA assumes---

so you can think of GECA as attempting to constrain models to substitutable languages by computing the closure of the training data over observed syntactic congruences. This closure need not produce a CFL (which answers a question I had about GECA!)...

@XiangLisaLi2

...and I think the constraint could be straightforwardly extended to *unobserved* sequences using a posterior regularization approach like this one from @XiangLisaLi2 and @srush_nlp virtual.acl2020.org/paper_main.243….

Read 5 tweets

Jacob Andreas

@jacobandreas

Jul 6, 2020

Thoughts on Kathy McKeown's #acl2020nlp keynote:

someone should go back and categorize all the "what's most important about deep learning" responses---literally nobody agrees! Accessibility, scalability, empirical performance, representations of lexical meaning, feature representations more broadly, support for new inputs, ...

Always good to be reminded that apart from Aravind, almost all of the "founding members" of what's now the NLP community were women: Spärck-Jones, Webber, Grosz, Hajicova.

Read 7 tweets

Jacob Andreas

@jacobandreas

Jul 3, 2020

https://twitter.com/cainesap/status/1279076489267462144

sing minutes grammars and three transducers

https://twitter.com/cainesap/status/1279076489267462144

neural semantic persons

invariant to lexical choice by replacing words with their citizens

Read 8 tweets

Jacob Andreas

@jacobandreas

Jun 26, 2020

@jayelmnop

New preprint led by Jesse Mu (@jayelmnop) on discovering compositional concepts in deep networks! You've heard of the "cat neuron" and the "sentiment neuron"; now, meet the green-and-brown-water neuron, the castle-or-surgery neuron, and the cheating-at-SNLI neuron. 1/

https://twitter.com/jayelmnop/status/1276558715508928515

Earlier work from Berkeley (arxiv.org/abs/1707.08139) and MIT (netdissect.csail.mit.edu) automatically labels deep features with textual descriptions from a predefined set. In our new work, we generate more precise & expressive explanations by composing them on the fly. 2/

(Think of this as labeling each neuron with a *program* by running inductive program synthesis with respect to a set of primitive detections and composition fns to approximate the target neuron's output as well as possible.) 3/

Read 4 tweets

Jacob Andreas

@jacobandreas

May 15, 2020

@aggielaz

Reading arxiv.org/pdf/2005.07064… by @aggielaz Potapenko & Tieleman. Nice side-by-side comparison of different combos of sup grounding loss & communicative / task loss for NLG. Even when you lock down semantics, agents trained w task loss form weird *pragmatic* conventions.

This last obs is a clear illustration of the fact that RSA (at least as usually applied to captioning) only explains a limited subset of pragmatic implicatures! In settings like the one studied here, learners are free to come up with their own conventions for the rest.

(Compare the "Mike has a yellow hat" example in the paper to Horn's "he broke a finger" -> "he broke his own finger".)

Read 4 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Jacob Andreas

People who liked this thread also liked...

Try unrolling a thread yourself!

More from @jacobandreas

Jacob Andreas

Jacob Andreas

Jacob Andreas

Jacob Andreas

Jacob Andreas

Jacob Andreas

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!