Santiago Profile picture
14 Feb, 13 tweets, 3 min read
A collection of the most interesting threads I've written about machine learning.

๐Ÿงต x ๐Ÿงต
Batch strategies to train neural networks.

Understanding Accuracy, Precision, and Recall.

Splitting datasets in train, validation, and test sets.

Feature Engineering and the Titanic dataset.

Is it reasonable for someone to dive into machine learning with a shallow knowledge of math?

20 machine learning questions that will make you think.

A quick introduction to some categories of Machine Learning problems.

โ€ข โ€ข โ€ข

Missing some Tweet in this thread? You can try to force a refresh
ใ€€

Keep Current with Santiago

Santiago Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @svpino

13 Feb
It takes a single picture of an animal for my son to start recognizing it everywhere.

Neural networks aren't as good as we are, but they are good enough to be competitive.

This is a thread about neural networks and bunnies.

๐Ÿงต๐Ÿ‘‡
A few days ago, I discussed how networks identify patterns and use them to extract meaning from images.

Let's start this thread right from where we ended that conversation.



(2 / 16)
Let's assume we use these four pictures to train a neural network. We tell it that they all contain a bunny ๐Ÿ‡.

Our hope is for the network to learn features that are common to these images.

(3 / 16)
Read 17 tweets
11 Feb
Today let's talk about why we keep "splitting the data" into different sets.

Besides machine learning people being quirky, what else is going on here?

Grab your coffee โ˜•๏ธ, and let's do it!

๐Ÿงต๐Ÿ‘‡
Imagine you are teaching a class.

Your students are getting ready for the exam, and you give them 100 answered questions, so they prepare.

You now need to design the exam.

What's the best way to evaluate the students?

(2 / 19)
If you evaluate the students on the same questions you gave them to prepare, you'll reward those who just memorized the questions.

That won't give you a good measure of how much they learned.

๐Ÿ˜‘

(3 / 19)
Read 21 tweets
10 Feb
Interviews aren't broken.

A lot of people complain about them, yet few have any experience hiring.

This is a rant full of my own biases and limited perspective โ€”a break from machine learning threads.

๐Ÿงต๐Ÿ‘‡
Building a team is incredibly hard.

Building a good team is even more challenging.

Building a good, diverse team is a nightmare.

๐Ÿ‘‡
Imagine you are starting a new company and you need a couple of developers.

โ–ซ๏ธ Where do you find them?
โ–ซ๏ธ How do you know they are any good?
โ–ซ๏ธ How much do you pay them?

How do you make somebody come and work for you, a nobody?

๐Ÿ‘‡
Read 25 tweets
9 Feb
Seriously though, how the heck can a computer recognize what's in an image?

Grab a coffee โ˜•๏ธ, and let's talk about one of the core ideas that makes this possible.

(I'll try to stay away from the math, I promise.)

๐Ÿ‘‡
If you are a developer, spend a few minutes trying to think about a way to solve this problem:

โ†’ Given an image, you want to build a function that determines whether it shows a person's face.

2/ Image
It gets overwhelming fast, right?

What are you going to do with all of these pixels?

3/
Read 27 tweets
7 Feb
Everything you need to know about the batch size when training a neural network.

(Because it really matters, and understanding it makes a huge difference.)

A thread.
Gradient Descent is an optimization algorithm to train neural networks.

The algorithm computes how much we need to adjust the model to get closer to the results we want on every iteration.

2/
We take samples from the training dataset, run them through the model, and determine how far away our results are from the ones we expect.

We call this "error," and using it, we compute how much we need to update the model weights to improve the results.

3/
Read 19 tweets
5 Feb
The one million dollar question:

"Is it reasonable for someone to dive into machine learning with a shallow knowledge of math?"

โ–ซ๏ธ The short answer is "yes."
โ–ซ๏ธ The more nuanced answer is "it depends."

Let me try and unpack this question for you.

๐Ÿงต๐Ÿ‘‡ Image
You can think about machine learning as a spectrum that goes all the way from pure research to engineering.

The more you move towards a research position, the more you can benefit from your math knowledge. If you move in the other direction, you'll get away with less of it.

๐Ÿ‘‡
I have friends that got a Ph.D. and became college professors.

For them, math is an absolute requirement!

Not only are they working on research projects, but they are teaching the next generation of scientists and engineers.

๐Ÿ‘‡
Read 11 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Follow Us on Twitter!