Follow @ylecun

12,399 views

Yann LeCun

Follow @ylecun

, 3 tweets, 1 min read

My Authors

Self-Supervised Learning making strides in speech recognition.

Wav2Vec 2.0 from FAIR uses a kind of contrastive SSL for pre-training.

This is the first time an SSL system reaches the very best results on a number of different ASR tasks.

arxiv.org/abs/2006.11477
1/N

After SSL pre-training, a word error rate of 10% can be obtained with just 10 minutes of labeled training data.

With only 1h of labeled data, this beats the best previous SSL method trained on 100h of labeled data.
2/N

Wav2vec 2.0 matches the best known word error rate when trained on the full 960h of labeled data in LibriSpeech, but with a rather simpler architecture.
3/N
N=3

Missing some Tweet in this thread? You can try to force a refresh.

Keep Current with Yann LeCun

Stay in touch and get notified when new unrolls are available from this author!

This Thread may be Removed Anytime!

Twitter may remove this content at anytime, convert it as a PDF, save and print for later use!

Try unrolling a thread yourself!

1) Follow Thread Reader App on Twitter so you can easily mention us!

2) Go to a Twitter thread (series of Tweets by the same owner) and mention us with a keyword "unroll" @threadreaderapp unroll

You can practice here first or read more on our help page!

Try unrolling a thread yourself!

More from @ylecun see all

Embed code for your website

Did Thread Reader help you today?