, 7 tweets, 2 min read Read on Twitter
If I write a paper one day that is 1/10th as creative and interesting as this one, I will be very happy:
arxiv.org/pdf/1904.11111…

Idea - build a dataset for Monocular Depth Estimation by collecting a set of videos from the "Mannequin Challenge" 1/n
Then use motion parallax to infer the depth of all static points (which is possible because everyone/thing in the video is being helpfully very still). This is already cool, but in addition, they use a 2D segmentation model to mask out the people. 2/n
Why is this cool? Well, for the static videos, the people don't move, so it's easy to predict their depth. But in actual videos, people do move - so if the network is also trained to predict the depth of the masked out humans, it can transfers nicely to a real video 3/n
They show in the paper that you can use this for all kinds of cool visual effects, but the real killer application is Monocular SLAM/3D mapping (monocular depth estimation was hard when the camera is also moving, this looks like a massive step towards fixing that). 4/n
The extensions to this are extremely obvious - use a segmentation model for self-driving cars and slurp up a bunch of data from google maps. Mask out the cars, and boom - you've got the same thing for any moving object that you can accurately segment. Pretty cool. 5/n
I wish in NLP we had such creative datasets! The overwhelming feeling at the moment is "We enslaved a bunch of Turkers who don't care about your task or data..... hopefully this captures all of semantics/syntax/common sense?" (please point me to such datasets if they exist) 6/n
Missing some Tweet in this thread?
You can try to force a refresh.

Like this thread? Get email updates or save it to PDF!

Subscribe to Mark Neumann
Profile picture

Get real-time email alerts when new unrolls are available from this author!

This content may be removed anytime!

Twitter may remove this content at anytime, convert it as a PDF, save and print for later use!

Try unrolling a thread yourself!

how to unroll video

1) Follow Thread Reader App on Twitter so you can easily mention us!

2) Go to a Twitter thread (series of Tweets by the same owner) and mention us with a keyword "unroll" @threadreaderapp unroll

You can practice here first or read more on our help page!

Follow Us on Twitter!

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just three indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3.00/month or $30.00/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!