Research Scientist @allen_ai, PhD in NLP 🤖 UofA. Ex @GoogleDeepMind @MSFTResearch @MilaQuebec
🚨🚨 NEW BLOG about o1 models: https://t.co/PPVoY25Ofe
May 31, 2023 • 7 tweets • 5 min read
🚀📢 GPT models have blown our minds with their astonishing capabilities. But, do they truly acquire the ability to perform reasoning tasks that humans find easy to execute? NO⛔️
We investigate the limits of Transformers *empirically* and *theoretically* on compositional tasks🔥
We find that GPT3, ChatGPT, and GPT4 cannot fully solve compositional tasks even with in-context learning, fine-tuning, or using scratchpads. To understand when models succeed, and the nature of the failures, we represent a model’s reasoning through computation graphs.