, 3 tweets, 1 min read
My Authors
Read all threads
Self-Supervised Learning making strides in speech recognition.

Wav2Vec 2.0 from FAIR uses a kind of contrastive SSL for pre-training.

This is the first time an SSL system reaches the very best results on a number of different ASR tasks.

arxiv.org/abs/2006.11477
1/N
After SSL pre-training, a word error rate of 10% can be obtained with just 10 minutes of labeled training data.

With only 1h of labeled data, this beats the best previous SSL method trained on 100h of labeled data.
2/N
Wav2vec 2.0 matches the best known word error rate when trained on the full 960h of labeled data in LibriSpeech, but with a rather simpler architecture.
3/N
N=3
Missing some Tweet in this thread? You can try to force a refresh.

Keep Current with Yann LeCun

Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

Twitter may remove this content at anytime, convert it as a PDF, save and print for later use!

Try unrolling a thread yourself!

how to unroll video

1) Follow Thread Reader App on Twitter so you can easily mention us!

2) Go to a Twitter thread (series of Tweets by the same owner) and mention us with a keyword "unroll" @threadreaderapp unroll

You can practice here first or read more on our help page!

Follow Us on Twitter!

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3.00/month or $30.00/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!