Tweet

Santiago

Follow @svpino

2 Mar, 16 tweets, 3 min read

Let's talk about how you can build your first machine learning solution.

(And let's make sure we piss off half the industry in the process.)

Grab that ☕️, and let's go! 🧵

Contrary to popular belief, your first attempt at deploying machine learning should not use TensorFlow, PyTorch, Scikit-Learn, or any other fancy machine learning framework or library.

Your first solution should be a bunch of if-then-else conditions.

Regular, ol' conditions make for a great MVP solution to a machine learning wannabe system.

Pair those conditions with a human, and you have your first system in production!

Conditions handle what they can. Humans handle the rest.

"But, wait! Conditions? How in the world are we going to get any results with that?"

I'm glad you asked. There are three possibilities:

1. Conditions are all you'll ever need.
2. Conditions give you a mediocre baseline.
3. There's nothing you can predict with conditions.

Turns out that to a hammer, everything starts looking like a nail. Avoid this trap.

Do you really need machine learning to solve a problem?

Ask yourself this question 20 times before moving on. You'll be surprised at what you find.

The first rule of machine learning: you may not need machine learning.

Google said it best.

Sometimes you need machine learning to get good results, but a few conditions can give you a lot of benefits.

Pair this with a human, and you have a solid system.

For example, can you predict invalid samples without using machine learning at all? Let humans handle the rest.

And of course, there's the case where there's nothing you can do with simple conditions.

This is usually the case when dealing with unstructured data (images, videos, audio.)

But you can still follow the same "simplicity" principle.

Find out what's the low-hanging fruit and focus on that. Then let humans deal with the hard cases.

Building a model that finds what's wrong in a circuit board is much harder than finding images that aren't circuit boards at all.

Do that instead.

Imagine you need humans reviewing 1,000 pictures every day to decide which are broken circuit boards.

20% of those images aren't even circuit boards.

Your model can trim those images. Now your humans have to deal with 80% of the load.

You just saved 20% of their time!

The power of this idea is in the approach, not in any specific technique.

Try to get an end-to-end solution as soon as possible.

Every time you frame the problem with a human in the loop, you give yourself a huge advantage!

If you try building a system capable of replacing humans from day 1, you'll have a long road ahead.

Grow to that, but don't start there.

As long as you can avoid the "do-the-best" mentality, you should. And most times, you can.

The pragmatic approach that delivers with minimal headaches:

"Build the simplest system that provides value under human supervision."

Everything else is just gravy.

Building machine learning that works outside a notebook is hard. It sucks for me, and probably for you too.

Follow me if you want some company doing this thing!

As soon as the office closes, I come here every day to tell you what I learned. That might help!

https://twitter.com/jamwalvikram/status/1366785366485835785?s=20

I agree with you here. That's the most important point.

https://twitter.com/jamwalvikram/status/1366785366485835785?s=20

https://twitter.com/1anre/status/1366852840547827722?s=20

The ML industry is heavily tilted towards academics, which has a very different purpose.

https://twitter.com/1anre/status/1366852840547827722?s=20

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @svpino

Santiago

@svpino

4 Mar

When designing your neural network, you first want to focus on your training loss.

Overfit the heck of your data and get that loss as low as you can!

Only after that should you start regularizing and focusing on your validation loss.

☕️🧵👇

Always try to overfit first.

Getting here is a good thing: you know your model is working as it should!

If you can't get your model to overfit, there's probably something wrong with your configuration.

How do you overfit? Pick a model that's large enough for the data.

Large enough means it has enough parameters (layers, filters, nodes) to memorize your data.

You can also so try to overfit a portion of your dataset. Fewer samples will be easier to overfit.

Read 9 tweets

Santiago

@svpino

3 Mar

The more you grow, the more you realize that the language you use doesn't matter at all.

JavaScript, Python, or whatever you use represents exactly $0 of your take-home pay every month.

The value you produce using these languages is the remaining 100%.

https://twitter.com/dannysteenman/status/1367226076716498945

I’ve never had a conversation with a client that cared about a specific language, other than those wanting to build on top of an existing codebase.

https://twitter.com/dannysteenman/status/1367226076716498945

https://twitter.com/stephenwnuchia/status/1367219373442695170

Every line of code is a liability.

Corollary: The best code is the one nobody wrote.

https://twitter.com/stephenwnuchia/status/1367219373442695170

Read 4 tweets

Santiago

@svpino

3 Mar

The two questions related to neural networks that I hear most often:

▫️ How many layers should I use?
▫️ How many neurons per layer should I use?

There are some rules of thumb that I'll share with you after you get your ☕️ ready.

🧵👇

First, let's get this out of the way:

A neural network with a single hidden layer can model any function regardless of how complex it is (assuming it has enough neurons.)

Check the "Universal Approximation Theorem" if you don't believe me.

↓

So, if we can do it all with a single layer, why bother adding more layers?

Well, it turns out that a neural network with a single layer will overfit really quick.

The more neurons you add to it, the better it will become at memorizing stuff.

That is bad news.

↓

Read 12 tweets

Santiago

@svpino

2 Mar

If you want to start with Machine Learning and need some guidance, I want to give you access to my entire course for $10. Today only.

And if you don't like it, you pay $0. But I promise you'll love it!

Thanks to the 100+ of you who already bought it!

👉 gumroad.com/l/kBjbC/50000

If you can’t afford this, reply below explaining how do you think this will help you. I’ll give away 10 copies for free.

Thanks to everyone that has taken advantage of this offer so far!

There are still a few more hours left.

If starting with machine learning feels overwhelming, then this is for you.

gumroad.com/l/kBjbC/50000

Read 4 tweets

Santiago

@svpino

2 Mar

Some hard skills that I use every day as a Machine Learning Engineer:

▫️ A whole lot of Python
▫️ TensorFlow, Keras, Scikit-learn
▫️ AWS SageMaker
▫️ Jupyter
▫️ SQL
▫️ Probabilities, Statistics
▫️ Google Spreadsheets (seriously!)
▫️ Software Engineering

https://twitter.com/Ivy48462095/status/1366741134337212422?s=20

General notions of linear algebra are useful, especially when you want to understand how certain things happen behind the scenes.

That being said, I don't consider myself an expert and it's not part of the day-to-day.

https://twitter.com/Ivy48462095/status/1366741134337212422?s=20

https://twitter.com/sirwallax/status/1366708106441486339?s=20

You could also use Excel.

I use Google Spreadsheets because it's in the cloud, and it's convenient for me. I don't have Microsoft Office installed, and as long as spreadsheets aren't crazy large, Google has what I need.

https://twitter.com/sirwallax/status/1366708106441486339?s=20

Read 6 tweets

Santiago

@svpino

1 Mar

Let's talk about learning problems in machine learning:

▫️ Supervised Learning
▫️ Unsupervised Learning
▫️ Reinforcement Learning

And some hybrid approaches:

▫️ Semi-Supervised Learning
▫️ Self-Supervised Learning
▫️ Multi-Instance Learning

Grab your ☕️, and let's do this👇

Supervised Learning is probably the most common class of problems that we have all heard about.

We start with a dataset of examples and their corresponding labels (or answers.)

Then we teach a model the mapping between those examples and the corresponding label.

[2 / 19]

The goal of these problems is for a model to generalize from the examples that it sees to later answer similar questions.

There are two main types of Supervised Learning:

▫️ Classification → We predict a class label
▫️ Regression → We predict a numerical label

[3 / 19]

Read 19 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Share this page!

Santiago

Try unrolling a thread yourself!

More from @svpino

Santiago

Santiago

Santiago

Santiago

Santiago

Santiago

Did Thread Reader help you today?

Like this author's thread?