Latest Twitter Threads by @_jennhu on Thread Reader App

Apr 4, 2024 • 8 tweets • 2 min read

New preprint w/ @mcxfrank:

How can we ascribe cognitive abilities to language models? We evaluate them! But evals impose challenges separate from the underlying ability of interest. These "task demands" affect LM performance, esp. for smaller models! 1/8 bit.ly/3VMdl2u

A shared goal in psychology and AI is to ascribe cognitive capacities to black-box agents. For example, we might be interested in whether a young child has theory of mind, or whether an LM can distinguish grammatical and ungrammatical sentences. 2/8

Dec 16, 2022 • 6 tweets • 3 min read

New paper with Sammy Floyd, @OlessiaJour, @ev_fedorenko, @LanguageMIT! Non-literal language understanding is an essential part of communication. But what is the role of mentalizing vs. language statistics in pragmatics? & how well do NLP models capture human prag behaviors? 🧵1/6 We explore this through a fine-grained comparison of LMs and humans on 7 pragmatic tasks. Our eval materials are an expert-curated set of multiple choice q's. Each answer option represents different strategies for solving the task (pragmatic, literal, low-level heuristics). 2/6

Sep 11, 2021 • 8 tweets • 2 min read

Excited to share our new preprint (w/ @smallhannahe & @ev_fedorenko): Our results support the idea that language comprehension & production draw on the same knowledge representations, which are stored in the language-selective network. 1/7biorxiv.org/content/10.110… A network of left frontal and temporal brain regions has been implicated in language comprehension & production, but what is the precise role of this ‘language network’ in production? Across 4 fMRI expts, we characterize the response of the lang regions to production demands. 2/7

Share this page!

Enter URL or ID to Unroll