This neural network architecture that was showcased at the @Tesla AI day is a perfect example of Deep Learning at its finest. Mix and match all the greatest innovations to do something drastic and super ambitious. Congrats!
Treating the job of figuring out valid "lanes" from images as language is brilliant. Combining CNNs, transformers, attention, pointer networks, etc., you essentially write a set of instructions to build up the graph by connecting the dots, start new lanes, set curvature, etc.
This isn't ML-new, but who cares? Applied at the level of ambition of full-scale real world impact, with the right team, execution, (and compute/data!), you can do things that felt impossible before. Both the architecture and cool use of language heavily reminded me of AlphaStar.
With all the debates on who-claimed-what-when, ego wars, research-prophecies, and other non-sense that is filling my Twitter feed these days, it is sobering to see what good execution can achieve. Congratulations to all those involved in this and similar projects!

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Oriol Vinyals

Oriol Vinyals Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @OriolVinyalsML

Jan 5
2021 personal highlights, a🧵. Despite being a challenging year globally due to the pandemic 😷🦠, but thanks to many incredible collaborators, it's been an exciting year research-wise 🤖 Some highlights below.👇
Diversity and inclusion. I kept engaged through our efforts @DeepMind, mentorship and as a member of @Khipu_AI community, supporting AI in Latin America, where we had a fireside chat w/ @geoffreyhinton (). I was also a mentor for: docs.google.com/spreadsheets/d…
Perceiver. Being able to treat every modality as a sequence of bytes has been a personal deep learning dream. Perceiver is a transformer-derived architecture proposing a few modifications to achieve this.

Papers: arxiv.org/abs/2103.03206 & arxiv.org/abs/2107.14795
Read 11 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(