Santiago Profile picture
23 Apr, 14 tweets, 2 min read
I've been teaching people how to start with machine learning for more than a year now.

This is a thread with what I've learned and some advice if you are looking to start.

↓ 1/14
People enjoy the process of getting ready to start something new.

Like the first time they go to the gym: they buy supplements, shorts, shoes, and a new headband.

Unfortunately, this is not enough.

↓ 2/14
Most people never stop preparing.

They keep collecting books, tutorials, the "best" videos and ask the same "how to start" questions.

The only thing they forget: taking a concrete step forward.

↓ 3/14
The best course you can watch is the one that you already know of.

The best book is already in your bookcase.

Stop chasing "the best" and pay attention to what's right in front of you.

↓ 4/14
People think they need a roadmap.

They don't. A roadmap is distracting.

What they really need is a single step that sends them in the right direction.

The next step always becomes clear after you start walking.

↓ 5/14
Almost everyone who asks for problems to solve isn't ready to solve problems in the first place.

↓ 6/14
Those who tell you that machine learning is easy don't really understand what machine learning is.

Those who tell you that it is hard to start with machine learning have never started themselves.

↓ 7/14
Those who make a huge deal of needing to learn Python instead of using their favorite language rarely make it.

If you are afraid of learning a new syntax, wait until you find out what machine learning really is.

↓ 8/14
Those who make good progress almost always do it as part of a group.

Find like-minded people that want to take this journey with you.

It will increase your odds significantly.

↓ 9/14
If you finish a course, and the only thing you have to show for it is inside your head, you didn't do it right.

Apply what you learn, solve exercises, write a journal, build a portfolio, tweet about it.

↓ 10/14
Most people are scared about their math skills. Funny enough, it's their programming skills what really hold them back.

Invest in becoming a good developer.

↓ 11/14
Machine learning is a hard, life-long commitment.

If you are looking for quick money, more job opportunities, and less complexity, there are better options for you out there.

↓ 12/14
Machine learning is hard.

But if you put in the work and stick with it, nothing will ever compare to your ability to solve some of the hardest problems humans face.

↓ 13/14
A few weeks ago, I wrote 16 takeaways that are very relevant for this conversation, so check that thread out:



14/14

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Santiago

Santiago Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @svpino

22 Apr
A 13-tweet introduction to one of the most basic structures used in machine learning: a tensor.

Understanding how tensors work is fundamental. They aren't complex but working with them may get confusing if you don't understand all the pieces.

Let's solve that today.

↓ 1/13
Three primary attributes define a tensor:

• Rank: Number of axes.
• Shape: Number of dimensions per axis.
• Data type: Type of data contained in it.

↓ 2/13
The rank of a tensor refers to the tensor's number of axes.

Examples:

• Rank of a matrix is 2.
• Rank of a vector is 1.
• Rank of a scalar is 0.

↓ 3/13
Read 14 tweets
21 Apr
700 people have watched "How To Get Started With Machine Learning." 86 have rated it.

Let's celebrate!

• You can buy the course today for $7.
• $0 if you don't like it.
• Back to $15 tomorrow.

gum.co/kBjbC/only7

If you can't afford it, keep reading:

For every copy I sell today, I'll give away one for free.

To apply for the free copy, reply below with why you think this course will help you.

I'll prioritize the best stories I read.

If you want to support my content, like/retweet this thread, so more people see it.
So far, 8 copies sold, and 8 free copies shared.

Thanks for the continuous support! It helps tremendously!
Read 8 tweets
21 Apr
Creating a good machine learning model is really sexy. That's what's different and where everyone focuses all of their attention.

But machine learning is much more than that.

A thread with a few thoughts about the real job.

1/9
Machine learning engineers spend a lot of time designing and training new models, but this is just a small fraction of their job.

2/9
In reality, dealing with data and operationalizing models is much more time-consuming and sometimes even harder and more involved than creating the models in the first place.

3/9
Read 10 tweets
20 Apr
The backbone of my end-to-end machine learning setup:

• A 48-page Field Notes
• Python
• NumPy, Pandas, Matplotlib, OpenCV
• Scikit-Learn, XGBoost
• TensorFlow
• Google Colab, Jupyter, VSCode
• Docker, Flask
• AWS SageMaker
I personally don't use C/C++.

That doesn't mean it's not useful. I know plenty of people in the industry that rely on C/C++ to do their work.

It just means that I personally haven't needed it.

There are a lot of satellite tools that I have to use depending on the project. Kinesis, Airflow, SQS... the list is endless.

I just tried to list the core of what I need, and it rarely varies.

Read 5 tweets
20 Apr
Yesterday, @PrasoonPratham posted a step-by-step guide to solve the Titanic challenge on Kaggle.

I thought it'd be fun to engineer some features that can help build an even better model.

Here are some ideas worth considering.

↓ 1/10
Attached you can find the original set of input variables that come with the data to solve the problem.

We are going to transform some of these into features that should help our model produce better results.

This is what Feature Engineering is all about.

↓ 2/13
Keep in mind that these are just hypotheses that you'll have to try and validate.

Some of these suggestions might not improve the results or could even make the model perform worse.

This is an exercise to try and think creatively about the data we are getting.

↓ 3/13
Read 15 tweets
19 Apr
Is 10 twice as worse as 5? Sometimes it is, but sometimes it's even worse.

This is the question I always ask myself when deciding how to penalize my models.

Read on for more details and a couple of examples:

↓ 1/11
When we are training a machine learning model, we need to compute how different our predictions are from the expected results.

For example, if we predict a house's price as $150,000, but the correct answer is $200,000, our "error" is $50,000.

↓ 2/11
There are multiple ways we can compute this error, but two common choices are:

• RMSE — Root Mean Squared Error
• MAE — Mean Absolute Error

Both of these have different properties that will shine depending on the problem you want to solve.

↓ 3/11
Read 12 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Follow Us on Twitter!