Tweet

Vladimir Haltakov

21 Apr, 11 tweets, 4 min read

Computer vision for self-driving cars 🧠 🚙

There are different computer vision problems you need to solve in a self-driving car.

▪️ Object detection
▪️ Lane detection
▪️ Drivable space detection
▪️ Semantic segmentation
▪️ Depth estimation
▪️ Visual odometry

Details 👇

Object Detection 🚗🚶‍♂️🚦🛑

One of the most fundamental tasks - we need to know where other cars and people are, what signs, traffic lights and road markings need to be considered. Objects are identified by 2D or 3D bounding boxes.

Relevant methods: R-CNN, Fast(er) R-CNN, YOLO

Distance Estimation 📏

After you know what objects are present and where they are in the image, you need to know where they are in the 3D world.

Since the camera is a 2D sensor you need to first estimate the distance to the objects.

Relevant methods: Kalman Filter, Deep SORT

Lane Detection 🛣️

Another critical information the car needs to know is where the lane boundaries are. You need to detect not only lane markings, but also curbs, grass edges etc.

There are different methods to do that - from traditional edge detection based methods to CNNs.

Driving Path Prediction ⤴️

An alternative is to train a neural network that will directly output the trajectory that the car needs to drive. This can be used as a substitute to centering between the lane markings if they are not visible for example.

Drivable Space Detection ⭕️

The goal here is to detect which parts of the image represent the space where the car can physically drive onto.

The methods here are usually very similar to the semantic segmentation methods (see below).

Semantic Segmentation 🎨

Not all parts of the image can be described by a bounding box or a lane model, e.g. trees, buildings, the sky. Semantic segmentation methods classify each pixel in the image.

Relevant methods: Fully Convolutional NN, UNet, PSPNet

Depth Estimation 📐

The goal is to estimate the distance to every pixel in the image, in order to have a better 3D model of the surrounding.

Methods like stereo and structure-from-motion are now being replaces by self-supervised deep learning models working on single images.

Visual Odometry 🎥

While we know the movement of the car from the wheel sensors and IMU, determining the actual movement in the camera can be more accurate to get the pitch angle for example.

The visual odometry estimates the 6 DoF movement of the camera between two frames.

Summary 🏁

There are of course many other computer vision problems that may be helpful, but this thread will give you an overview of the most important ones.

As you see, nowadays, deep learning methods (and especially CNNs) dominate all aspects of computer vision...

@haltakov

If you liked this thread and want to read more about self-driving cars and machine learning follow me @haltakov!

I have many more threads like this planned 😃

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @haltakov

Vladimir Haltakov

@haltakov

22 Apr

What is a self-driving car engineer? 🧑‍💻 🧠 🚙

It's not a single job description - there are many roles in a self-driving project!

🧠 Machine Learning
👀 Computer Vision
💽 Big Data
🕹️ Simulation
✅ Test and Validation
🦺 Safety
💻 Software Development

Read more below 👇

🧠 Machine Learning Engineer

👀 Computer Vision Engineer

Read 9 tweets

Vladimir Haltakov

@haltakov

20 Apr

https://twitter.com/haltakov/status/1384570314336743425

Interesting results from the small experiment... 😄

This was actually a study reported in a Nature paper. Most people offer additive solutions (adding bricks) instead of substractive solutions (removing the pillar).

More details 👇

https://twitter.com/haltakov/status/1384570314336743425

In this example, the most elegant solution is to remove the pillar completely and let the roof lie on the block. It will be simpler, more stable and won't cost anything.

Some people quickly dismiss this option assuming this is not allowed, but it actualy is 😃

This isn't because people don't recognize the value, but because many don't consider the substractive solution at all. Me included 🙋‍♂️

The paper shows that this happens a lot in real life, especially in regulation. People tend to add new rules, instead of removing old ones.

Read 5 tweets

Vladimir Haltakov

@haltakov

20 Apr

Sensors for self-driving cars 🎥 🧠 🚙

There are 3 main sensors types used in self-driving cars for environment perception:
▪️ Camera
▪️ Radar
▪️ Lidar

They all have different advantages and disadvantages. Read below to learn more about them.

Thread 👇

1️⃣ Camera

The camera is arguably the most important sensor - the camera images contain the most information compared to the other sensors.

Modern cars across all self-driving levels have many cameras for a 360° coverage:
▪️ BMW 8 series - 7
▪️ Tesla - 8
▪️ Waymo - 29

This is an example from Tesla of what a typical camera sees and detects in the scene. Videos from other companies look very similar.

Read 9 tweets

Vladimir Haltakov

@haltakov

19 Apr

End-to-end approach to self-driving 🎥 🕸️ 🕹️

I recently wrote about the classical software architecture for a self-driving car. The end-to-end approach is an interesting alternative.

The idea is to go directly from images to the control commands.

Let me tell you more... 👇

This approach is actually very old, dating back to 1989 and the ALVINN model by CMU. It is a 3-layer neural network using camera images and a laser range finder.

Again, this was back in 1989... 🤯

papers.nips.cc/paper/1988/fil…

A modern example is Nvidia's PilotNet - a Convolutional Neural Network with 250M parameters, which takes as input the raw camera image and predicts directly the steering angle of the car.

No explicit lane boundary or freespace detection needed!

arxiv.org/abs/1604.07316

Read 13 tweets

Vladimir Haltakov

@haltakov

15 Apr

Open-Source Self-Driving Car Simulators 🕹️ 🚙

You want to play around with self-driving car software and gather some experience? Check out these open-source self-driving car simulators!

Details below 👇

CARLA

CARLA is a great software developed by Intel. You can use it to work on any step of the pipeline, model different sensors, maps, traffic. It also integrates with ROS.

carla.org

Deepdive from Voyage

Another great simulator by Voyage - the self-driving company that was recently acuired by Cruise. It is built on the Unreal Engine and supports lots of features.

deepdrive.voyage.auto

Read 11 tweets

Vladimir Haltakov

@haltakov

14 Apr

Useful online courses on self-driving cars 🧠 🚙

Here is a list of useful courses if you want to learn about software for self-driving cars.

Some of the courses are paid, but all platforms offer regular discounts and financial aids if you can't affor them.

Thread 👇

Udacity Self-Driving Car Nanodegree

This program offers hands on experience on all kind of relevant topics like perception, localization, planning and control. It takes a lot of time, but it is worth it.

In the end, your code is run on an actual car!

udacity.com/course/self-dr…

Udacity Introduction Course by Apollo

This is a beginner course explaining the different components of a self-driving car. Apollo is the self-driving division of Baidu.

udacity.com/course/self-dr…

Read 8 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Share this page!

Vladimir Haltakov

Try unrolling a thread yourself!

More from @haltakov

Vladimir Haltakov

Vladimir Haltakov

Vladimir Haltakov

Vladimir Haltakov

Vladimir Haltakov

Vladimir Haltakov

Did Thread Reader help you today?

Like this author's thread?