BigScience Research Workshop Profile picture
A research workshop on large language model gathering 1000+ researchers around the world Follow the training of the 176B multilingual model live @BigScienceLLM
Oct 18, 2021 10 tweets 5 min read
First modeling paper out of BigScience is here!

T0 shows zero-shot task generalization on English natural language prompts, outperforming GPT-3 on many tasks, while being 16x smaller!

Model: huggingface.co/bigscience/T0pp
Repo: github.com/bigscience-wor…
Paper: arxiv.org/abs/2110.08207 This was an international collaborative effort, with over 40 people across more than 25 organizations. The group included dedicated researchers and engineers from different universities, companies, and think tanks.