Tweet

Santiago

Follow @svpino

11 Mar, 10 tweets, 2 min read

Before you start building a machine learning model, you need a baseline.

I find it helpful to think about 3 different levels and tackle them in order.

Here is how I do this: ☕️🧵👇

▫️ Level 1: The human baseline

Before anything else happens, I find it useful to understand how humans do when solving the problem.

This gives me the ceiling that I should aspire to (or maybe even beat it, if I'm lucky!)

Sometimes, the human baseline will hint at whether a model is a feasible solution for the problem.

For example, if the data doesn't contain information that we can use to make predictions, humans will do very poorly. This will save us a lot of work!

To get a human baseline, you can annotate the data twice and treat one set of results as the ground-truth, and the other set as the predictions.

▫️ Level 2: The data-independent baseline

Now that we have an idea of how humans do with the problem, the next step is to build a straightforward baseline model that's completely independent of the data.

The idea is for the baseline model to simply guess the answers.

A couple of examples of independent baselines:

▫️ Random guessing the answer
▫️ Always returning the same answer

I like this baseline because I can put it together very quickly and it's the first step to measure any models.

▫️ Level 3: The model baseline

After I know how random guessing does, I put together the simplest possible model that can beat it.

The idea here is not to be fancy but focus on getting better than the level 2 baseline with very little work.

This baseline will tell us whether there's enough signal in the data to build a model with predictive capabilities.

After we have one, our goal is to beat it with a better model.

I like to keep replacing this level 3 baseline.

Every time I build a better model, it becomes the new yard-stick that I use to measure future progress.

Beating the baseline becomes a cool game to play.

Hey, I know sometimes machine learning sucks.

But we can do this shit together!

Follow me, and every week I’ll help you navigate this thing from doing it, failing at it, learning, and fixing it.

🦕

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @svpino

Santiago

@svpino

12 Mar

Do you like Star Wars?

Let's talk about John, a machine learning engineer.

He is building a model to predict whether people will like a future Star Wars episode.

This is the story of how John screwed things up.

☕️🧵👇

They are screening the new Star War episode, and the theater seems like a good place to collect some information to create the model.

John goes right in and gives an optional survey to the people sitting in the middle of the theater right before the movie starts.

John does the same for a couple of weeks, then goes back to his computer and puts together a model.

It turns out that the model sucks.

The new episode doesn't do well despite the model predicting the opposite.

What happened?

Read 8 tweets

Santiago

@svpino

12 Mar

“Autoencoders and rotten bananas” is the first story trying to teach you something new about machine learning.

A lot of work to get this out, and I hope you enjoy it as much as I did writing it.

digest.underfitted.io/archive/364757

https://twitter.com/HenryDouglas47/status/1370376327149187077?s=20

This is such a great question!

Why in the world do we need autoencoders when we have pretty good compression algorithms?

Autoencoders will never do better than our existing techniques (like for example, JPEG encoding).

https://twitter.com/HenryDouglas47/status/1370376327149187077?s=20

JPEG works for any image. Autoencoders only work for the type of images that it was trained on.

That's a big disadvantage.

But only if you are trying to use them to compress images.

Here is a killer application of autoencoders: anomaly detection.

Read 5 tweets

Santiago

@svpino

11 Mar

"Do I need a Ph.D. or a Master's degree to work as a machine learning engineer?"

No.

A lot of companies ask for degrees to weed out people that apply to jobs prematurely.

If you have the required skills and show your experience, the degree will not matter.

https://twitter.com/dry79911736/status/1369981859577298948

Both are valid paths and it will come down to your personal preference or what you need to accomplish

If you aren’t sure, online courses give you less risk upfront.

https://twitter.com/dry79911736/status/1369981859577298948

https://twitter.com/dennisjunior247/status/1369961361455874048?s=20

There are specific positions that do require (and will continue to do so) a degree in a related field.

But the industry has changed in the last few years. Fewer places require degrees anymore, and the tendency will continue in that direction.

https://twitter.com/dennisjunior247/status/1369961361455874048?s=20

Read 5 tweets

Santiago

@svpino

10 Mar

Let's talk a little bit about machine learning in the real world.

A seemingly simple classification problem that turns ugly quickly.

Hopefully, this gives you an idea of what it takes to put some of the pieces together.

Grab your ☕️, and let's do this thing!

🧵👇

Let's imagine a system where you could sell your stuff.

You submit a bunch of pictures of an object, and the system recommends a price range in which the object could be sold.

Let's focus on classifying the object from the pictures.

An image classification problem sounds simple enough. There are 1,000 examples out there!

Unfortunately, getting value from these systems requires a lot of considerations.

Let me throw a lot of ideas to you. This 🧵 is messy, just like a potential solution to the problem.

Read 19 tweets

Santiago

@svpino

9 Mar

Here is an underrated machine learning technique that will give you important information about your data and model.

Let's talk about learning curves.

Grab your ☕️ and let's do this thing!

🧵👇

Start by creating a model. Something simple. You are still exploring what works and what doesn't, so don't get fancy yet.

We are now going to plot the loss (model error) vs. the training dataset size. This will help us answer the following questions:

▫️ Do we need more data?
▫️ Do we have a bias problem?
▫️ Do we have a variance problem?
▫️ What's the ideal picture?

Read 16 tweets

Santiago

@svpino

8 Mar

This week, I'll be on Twitter Spaces with amazing company!

We'll be talking about some cool machine learning techniques. Each one of us, a different one.

Save the date, and you can join us from your mobile phone right from the Twitter application.

We are planning to record this session, but... But, we will be recording the screen of an iPhone and some other weird stuff to try and get the audio out.

Not the best process, but we will try to get clean audio out of this.

@haltakov

If everything works, @haltakov will make the audio available (likely in the form of a podcast.) Where and how are details that we'll share when we know.

If the audio comes out too crappy, we will probably not bother because it won't be useful for you anyway.

Read 6 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Share this page!

Santiago

Try unrolling a thread yourself!

More from @svpino

Santiago

Santiago

Santiago

Santiago

Santiago

Santiago

Did Thread Reader help you today?

Like this author's thread?