CS PhD student @stanfordnlp @StanfordAILab. Language for machine learning, not machine learning for language. he/him
May 4, 2023 • 4 tweets • 2 min read
New LM eval just dropped—Google has no moat??
Problem w/ this article is the cited LM evals are misleading: they don't measure frontier capabilities but a very narrow task distr. Claims that closed LMs have no moat must evaluate OSS models on actual knowledge work, not stuff like "name a restaurant"
Excited to present 3 #NeurIPS2022 papers on a trend I've been very excited about recently: blurring the boundaries between language models and RL agents
(+a bonus 4th paper on active learning!)
🧵(0/7)
PS: I'm on the industry job market!
1️⃣ Improving Intrinsic Exploration with Language Abstractions
Using language abstractions to guide exploration in RL, e.g. by self-designing a curriculum of increasingly difficult language goals