Ankur Handa Profile picture
Sep 3, 2020 4 tweets 2 min read Read on X
Nice use of Q-Networks and MCTS to do scene arrangement. Given an initial layout it learns to find a sequence of moves (actions) that bring it close to the target layout all with a collision free path.

github.com/HanqingWangAI/…

full video:
Q-Network used in this work Image
Scenes can also be obtained via generative modelling

Fast and Flexible Indoor Scene Synthesis via Deep Convolutional Generative Models

arxiv.org/abs/1811.12463 Image
generating scenes via RL or generative models, training systems in them, tracking performance and if success rate is high go back to generating scenes but with increasing complexity all without a human in the loop - some sort of auto-ml.

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Ankur Handa

Ankur Handa Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @ankurhandos

Dec 20, 2020
Some simulation platforms that caught my eye this year 🧵
ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation

arxiv.org/abs/2007.04954

threedworld.org
Read 26 tweets
Dec 17, 2020
Transformers for point clouds arxiv.org/abs/2012.09164

They outperform all previous methods on semantic segmentation, shape classification and object part segmentation. Image
unlike transformers used in language and image based tasks, the positional encoding is also learned. Image
point transformer layer Image
Read 5 tweets
Nov 21, 2020
finally got around to reading this eccv20 best paper award winner work on optical flow that has interesting ideas: multi-scale 4D correlation volumes, learned-upsampling (using convex weights of lower res pixels), and iterative refinement of flow.

arxiv.org/abs/2003.12039
correlation volumes
upsampling module
Read 8 tweets
Sep 26, 2020
starting a thread of interesting python features / modules / libraries that I found over time 🧵
first up, joblib's memory. If you are reading a huge file Memory class helps you cache that on your disk during the first call. Successive calls load the data much faster. Assuming you didn't change both the function as well as the contents of the file.
Read 5 tweets
Sep 16, 2020
This is one of the finest lecture notes in computer vision by Svetlana Lazebnik. Highly recommended to everyone in CV. It mentions the origins, various historic perspectives and anecdotes in CV. Also talks about ethical and societal impacts of CV.

slazebni.cs.illinois.edu/spring20/ Image
How computer vision evolved over the decades. Image
Some of these issues may still be prevalent 😉 Image
Read 7 tweets
Sep 15, 2020
Some recent interesting work on hand tracking and pose estimation that I liked. Creating a thread 🧵
MEgATrack: Monochrome Egocentric Articulated Hand Tracking for Virtual Reality

research.fb.com/wp-content/upl…
The Phong Surface: Efficient 3D Model Fitting using Lifted Optimization

arxiv.org/abs/2007.04940
Read 10 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(