Santiago Profile picture
26 Dec, 13 tweets, 2 min read
11 key concepts of Machine Learning.

β€” Supervised Learning Edition β€”

πŸ§΅πŸ‘‡
😜

Before starting, remember that, if you follow me, one of your enemies will be immediately destroyed (and you'll get to read more of these threads, of course.)

And if you don't follow me, well, you just hurt my feelings.

😜
1. Labels

(Also referred to as "y")

The label is the piece of information that we are predicting.

For example:

- the animal that's shown in a picture
- the price of a house
- whether a message is spam or not

πŸ‘‡
2. Features

(Also referred to as "x")

These are the input variables to our problem. We use these features to predict the "label."

For example:

- pixels of a picture
- number of bedrooms of a house
- square footage of a house

πŸ‘‡
3. Samples

(This is also known as "examples.")

A sample is a particular instance of data (features or "x.") It could be "labeled" or "unlabeled."

πŸ‘‡
4. Labeled sample

Labeled samples are used to train and validate the model. These are usually represented as (x, y), where "x" is a vector containing all the features, and "y" is the corresponding label.

For example, a labeled sample could be:

([3, 2, 1500], 350000)
5. Unlabeled sample

Unlabeled samples contain features, but they don't contain the label: (x, ?)

We usually use a model to predict the labels of unlabeled samples.

πŸ‘‡
6. Model

A model defines the relationship between features and the label.

You can think of a model as a set of rules that, given certain features, determines the corresponding label.

For example, given the # of bedrooms, bathrooms, and square footage, we get the price.

πŸ‘‡
7. Training

Training is a process that builds a model.

We show the model labeled samples during training and allow the model to gradually learn the relationships between features and the label.

πŸ‘‡
8. Validation

Validation is the process that lets us know whether a model is any good.

Usually, we run a set of (unseen) labeled samples through a model to ensure that it can predict the labels.

πŸ‘‡
9. Inference

Inference is the process of applying a trained model to unlabeled samples to obtain the corresponding labels.

In other words, "inference" is the process of making predictions using a model.

πŸ‘‡
10. Regression

A regression model predicts continuous values, for example:

- the value of a house
- the price of a stock
- tomorrow's temperature

πŸ‘‡
11. Classification

A classification model predicts discrete values, for example:

- the picture is showing a dog or a cat
- the message is spam or not
- the forecast is sunny or overcast

β€’ β€’ β€’

Missing some Tweet in this thread? You can try to force a refresh
γ€€

Keep Current with Santiago

Santiago Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @svpino

27 Dec
An introduction to one of the the most basic structures used in machine learning: a tensor.

πŸ§΅πŸ‘‡
Tensors are the data structure used by machine learning systems, and getting to know them is an essential skill you should build early on.

A tensor is a container for numerical data. It is the way we store the information that we'll use within our system.

(2 / 16)
Three primary attributes define a tensor:

▫️ Its rank
▫️ Its shape
▫️ Its data type

(3 / 16)
Read 16 tweets
22 Dec
The 8-step quick-start guide to learn Machine Learning.

πŸ§΅πŸ‘‡
1⃣ Start with Python 🐍

Yes, you can do other languages, but Python is by far the most straightforward option.

πŸ‘‡
2⃣ Get familiar with numpy, pandas, and matplotlib

These three libraries are probably the most common Python libraries you'll have to use every day.

(Even if you don't end up doing machine learning, these libraries are awesome and useful.)

πŸ‘‡
Read 10 tweets
21 Dec
Rumors are going around that Twitter cripples tweets that include links.

There's nothing in their official documentation, but a lot of people think that's the case.

I thought this through, and I will not play along.

πŸ§΅πŸ‘‡
Allegedly, if I disable the links, my tweets will get much more impressions because Twitter will push them to more people.

Assuming this is the case, what's the cost?

My followers will have to start copying, then pasting any links that I post.

(2 / 4)
Seems like a small nuance, but reading comments on tweets with disabled links, the process is very error-prone and a lot of people have trouble accessing the content.

This is not what I want.

(3 / 4)
Read 4 tweets
3 Dec
Transitioning from Software Engineering to Machine Learning.

πŸ§΅πŸ‘‡
I'll tell you my story.

It might work for you. It might not.

Hopefully, it gives you another perspective. Hopefully, it helps.

(2 / 14)
Many people see "Software Engineering" and "Machine Learning Engineering" as two completely different specialization areas.

There are many differences, for sure.

But I personally like to think about them as a single, fluid, all-encompassing position.

(3 / 14)
Read 14 tweets
2 Dec
The HTML + CSS Twitter conspiracy.

A tread πŸ§΅πŸ‘‡
A lot of people out there recommend starting with HTML and CSS to aspiring developers.

They suggest this combination is a stepping stone for you to reach your goals.

That's nonsense.

(2 / 9)
There's absolutely nothing wrong with HTML and CSS.

But they aren't necessarily the foundation that you need when starting out.

Yes, they are simple to learn compared to a fully-fledged programming language, but they are also very different.

(3 / 9)
Read 9 tweets
1 Dec
Django versus Flask versus FastAPI.

🐍 πŸ§΅πŸ‘‡
Django

▫️ Rapid development
▫️ A lot of out-of-the-box functionality
▫️ Easy for building complex, full web applications
▫️ MVC design paradigm
▫️ Robust security features
▫️ Extensible (a lot of components out there)
▫️ Large community

πŸ‘‡
Flask

▫️ Very light
▫️ Doesn’t make decisions for you
▫️ Doesn’t bring anything that you don’t need
▫️ Modular, so it’s easy to extend
▫️ You can plug in your favorite ORM
▫️ Great documentation
▫️ Very easy to start with
▫️ Large community

πŸ‘‡
Read 5 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Follow Us on Twitter!