Laura Ruis Profile picture
Jan 19 12 tweets 3 min read
I simultaneously have chatGPT-fatigue and also think about it every day - it truly changed the discourse around LLMs. Plus, chatty-P provided us with _so_ many memes. The following is my current mental model of chatGPT (now popularly synonymous to LLMs) and what might be missing.
Contrary to what we sometimes read on twitter, language is meaningfully different from other types of communication; it's a complex system of symbolic reference.
How? Symbols relate to each other in a system; allowing to describe each in terms of others; they don't need to co-occur with physical events (we can talk about democracy and the future); and once a new one is acquired the relational system allows generalisation to novel context.
With lots of effort, we can teach chimpanzees a similar form of symbolic system (a language). Even though lots of non-human animals have incredibly complex forms of communication, language does not _naturally_ show up in any of them.
LLMs acquired a noisy system of symbols from inordinate amounts of data and speak fluently. This is really cool (and scary, thanks safety ppl for your work 🙇‍♀️) and something that many didn't think possible years ago.
But base LLMs are exceedingly toxic. (RL)HF aligned them (somewhat) to polite modes of speaking (at a surface level subject to T&Cs and caveats etc etc etc don't believe anything chatty-P says basically but then again also don't believe anything I say).
There's still lots of failure modes and as of yet, I think chatGPT won't pass the true Turing test (if the judge knows what they're doing). Although an impressive story-teller and, at times, hilarious, chatGPT can be glaringly incoherent. We've all seen examples of it.
I think an important missing component is #agency (setting goals and achieving them). There are levels to agency. Lizards can find food under uncertainty. A monkey sets diverse goals and achieves them under more uncertainty. We are at the top of this self-defined agency-pyramid.
When we speak, there's complex underlying intentions and beliefs at play. These intentions are what make us mostly coherent and give our language meaning. LLMs simulate this agency to some extent (arxiv.org/abs/2212.01681), but are no agent and hence show incoherence sometimes.
(RL)HF might make this seemingly better, because the LLMs seem to more coherently adhere to a smaller set of beliefs (be polite! be helpful! don't be racist!), but it's not a fundamental change to the fact that there is no agency, and hence no coherence or "meaning".
Language might require something different (to be added! with all the DL!! I don't hate chatty-P trust me!!!). I don't know if embodiment is part of this picture or whether agency can exist separately, but I'm excited to think about it because actually I don't know what agency is
Sources of inspiration:
- Ape language: From conditioned response to symbol by Savage-Rumbaugh
- The Symbolic Species by Deacon
- Symbolic Behaviour in AI by Santoro, Lampinen et al
- The Evolution of Agency by Michael Tomasello
- Language models as agent models by Andreas

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Laura Ruis

Laura Ruis Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @LauraRuis

Dec 4, 2022
Lots of people have been sending me implicatures we used as examples in our paper that #chatGPT understands (i.e. explains well when prompted). So cool! Happy to see people interested in this. I wanted to write a short thread about my thoughts of what this means. ⬇️
E.g. left is chatGPT, right is Davinci-2 ImageImage
Before we started the paper, I would try lots of implicatures I came up with myself on Davinci-2, in different wordings, with moderate success. Some always solved, some half of the time depending on the wording (meaning random performance, since the test is a binary yes/no one).
Read 13 tweets
Nov 20, 2022
Some of the best #advice I got early in my PhD was from @rockt urging me to set up a good note-keeping system. 1 year in my notes have helped me in so many situations! It took a few iterations, and I'll share the things that were most helpful for me personally here ⬇️🐊 1/13
Everything below I use @obsdmd for. I love how it's simply a markdown editor and file-structure visualiser at first but you can add all the plugins you need to make it ever more complex.
I fill out the same note template for every paper I read. It asks for metadata, content tags, a personal brief thought about the paper, etc. I make sure to write a tl;dr in my own words after reading. DM me if you want the template :) (it adjusted from @y0b1byte's template)
Read 13 tweets
Oct 28, 2022
Before giving up on Twitter, check out our recent finding 🔥 LLMs are *not* zero-shot communicators!

We show a limitation of #LLMs in interpreting language given the context of shared experience of the world, a ubiquitous part of communication ⬇️🧵

arxiv.org/abs/2210.14986

1/22
Work with awesome collaborators @akbirkhan @BlancheMinerva @sarahookr @_rockt @egrefen

If this thread is long, it’s only because I had to keep the meme-to-information ratio at a respectable level (and because there’s a lot to unpack)

lauraruis.github.io/2022/09/29/com…

2/22
Let’s start off with an example of the type of language we’re investigating. Consider the following exchange between a human and InstructGPT

(text-davinci-002 temp. 0) 3/22
Read 22 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(