A research workshop on large language model gathering 1000+ researchers around the world
Follow the training of the 176B multilingual model live @BigScienceLLM
Oct 18, 2021 • 10 tweets • 5 min read
First modeling paper out of BigScience is here!
T0 shows zero-shot task generalization on English natural language prompts, outperforming GPT-3 on many tasks, while being 16x smaller!