First modeling paper out of BigScience is here!

T0 shows zero-shot task generalization on English natural language prompts, outperforming GPT-3 on many tasks, while being 16x smaller!

Model: huggingface.co/bigscience/T0pp
Repo: github.com/bigscience-wor…
Paper: arxiv.org/abs/2110.08207
This was an international collaborative effort, with over 40 people across more than 25 organizations. The group included dedicated researchers and engineers from different universities, companies, and think tanks.
Our approach uses natural language prompts that allow us to share a format for a large variety of NLP tasks. We used a prompted format with the goal of allowing our model to generalize to unseen prompted tasks.
To collect the prompts for these tasks we built an open-source system for prompt engineering at a tremendous scale (as of now, there are ~2’000 prompts for 170+ datasets). The tool PromptSource is open-source and available on github.com/bigscience-wor…
To create T0, we fine-tuned T5 on a multi-task mixture of prompted datasets from Promptsource. When evaluated on zero-shot tasks, we found that it matched or exceeded GPT-3's performance on 9 of 11 datasets.
On a subset of BIG-Bench (github.com/google/BIG-ben…), a new collection of diverse and novel tasks intended to be difficult for large language models, T0 outperforms 6x larger language models on 13 out of 14 datasets.
We have released T0 models in the Hugging Face Model Hub and you can try it out in your browser here: huggingface.co/bigscience/T0pp
We are also releasing all of our prompts as the Public Pool of Prompts (P3). You can see them on bigscience.huggingface.co/promptsource.
This is the first results coming out of the modeling working group, focusing on testing the method on English first. We are excited to work on extending the approach to multiple languages especially for languages with fewer existing datasets!
This project was made possible through the support of the TPU Research Cloud (@TensorFlow) and @Genci_fr who provided computational resources to train and evaluate our models.

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with BigScience Research Workshop

BigScience Research Workshop Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Follow Us on Twitter!

:(