Santiago Profile picture
25 Nov, 14 tweets, 6 min read
Working on problems is the best way to learn Machine Learning.

Here are 10 projects to start your journey.

🧵👇
I picked all 10 projects from Kaggle.

When you are getting started, having a community ready to help is very important.

Also, every one of these problems has been solved by many people, and you can find those answers if you get stuck!

👇
I sorted the problems in the way I'd recommend you to start.

They more or less increase in complexity as you move through the list.

Let's get started!

👇
1. Titanic: Machine Learning from Disaster

This is the perfect project to get started with classification algorithms.

It will teach you some data engineering practices, and you can solve it with simple Decision Trees.

kaggle.com/c/titanic

(1 of 10)
2. House Prices

Now that you understand classification problems, it's time to look into regression.

Here you'll predict the price of a house given its characteristics.

kaggle.com/c/house-prices…

(2 of 10)
3. Wine Quality

Another classic problem where you can explore regression and classification algorithms is the Wine Quality challenge.

kaggle.com/rajyellow46/wi…

A great candidate to reaffirm what you've learned so far.

(3 of 10)
4. Mall Customer Segmentation Data

Now it's time to look into Unsupervised Learning techniques, and this dataset is a great start:

kaggle.com/vjchoudhary7/c…

Your goal here is to determine which customers are a good target for your marketing department.

(4 of 10)
5. Digit Recognizer

This is the "Hello World" of Computer Vision, so you want to start here before getting any deep.

kaggle.com/c/digit-recogn…

You'll learn how to "read" images to determine the specific digit they show.

(5 of 10)
6. Dogs vs. Cats

After getting the hang of neural networks, you can upgrade to a more complex Computer Vision problem: the Dogs vs. Cats dataset.

kaggle.com/c/dogs-vs-cats

Your goal is simple: does the input picture show a dog or a cat?

(6 of 10)
7. Bag of Words Meets Bags of Popcorn

As soon as you want to get into Natural Language Processing, this is a great problem to pick up:

kaggle.com/c/word2vec-nlp…

Here you'll be doing sentiment analysis on IMDB movie reviews.

(7 of 10)
8. The Walmart Challenge

Time series analysis is another big area covered by Machine Learning, and the Walmart dataset will get you started.

kaggle.com/bletchley/cour…

Here you will be predicting Walmart's weekly sales based on past data.

(8 of 10)
9. The Movies Dataset

Do you want to build a recommendation system (similar to what Netflix does)? You can start here:

kaggle.com/rounakbanik/th…

Recommendation systems are very useful and have multiple applications.

(9 of 10)
THIS PAGE INTENTIONALLY LEFT BLANK

(10 of 10)
11. Open Images 2019 - Object Detection

I wanted to end the list with another extremely useful application of Computer Vision: Object Detection.

kaggle.com/c/open-images-…

Here you have 9 million images to play with.

(11 of 10)

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Santiago

Santiago Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @svpino

24 Nov
Machine Learning doesn't need to be overwhelming.

Here is a strategy that you can use to get started without too many distractions.

🧵👇
If you start today, you'll probably feel overwhelmed by how much —apparently— you need to understand.

But it doesn't need to be like that.

You can take a much more practical approach to learn what you need and start providing value right away.

👇
Instead of starting "from the beginning," you can hack your way "from within."

The idea is simple:

1. Pick a simple problem —or an area— that's interesting to you.

2. Take the necessary steps to learn how to solve that problem.

3. Keep adding complexity as you see fit.

👇
Read 11 tweets
22 Nov
10 questions that spark conversations, make you think, and give you a solid foundation of practical Machine Learning.

🧵👇
(Some) interviews are broken.

They focus on trivia and expect candidates to recall concepts that aren't even relevant for the job.

This is garbage.

Instead, focus on problems that scientists and engineers face every day while doing their jobs: 👇
Acme Inc. is building a model to classify images in several different categories.

Unfortunately, they don't have a lot of images for some of the classes.

How would you handle such an imbalanced dataset?

(1 of 10)
Read 13 tweets
21 Nov
A plan to get a job as a Machine Learning Engineer.

🧵👇
Put in the work, level up, and get ready to demonstrate that you can deliver value.

You'll have to answer technical questions. Study up.

(If you aren't prepared, you won't pass the first round of interviews.)

(1 of 10)
Focus on showing, not telling.

What can you do today that will serve you as an asset when justifying your experience?

Creating a strong portfolio showing what you are capable of is the most important step you can take.

(2 of 10)
Read 12 tweets
19 Nov
Everything I know about great Software Developers.

🧵👇
1. Great Software Developers are humble.

They never put themselves above anyone else. They are willing to leverage existing solutions and listen to others.

(1 of 15)
2. Great Software Developers are self-motivated to learn.

They never stop improving and never get complacent. They understand the importance of growing their skills.

(2 of 15)
Read 24 tweets
24 Oct
33 applications of Machine Learning, 3 different categories.

(And there are so many more it's not even funny!)

It doesn't matter what you enjoy in life. There's something here for you!

🧵👇
▫️ Natural Language Processing Applications

1. Speech recognition
2. Answering questions
3. Translation
4. Generating content
5. Summarizing documents
6. Sentiment analysis
7. Virtual assistants
8. Classifying text
9. Autocorrection
10. Urgency detection
11. Text extraction

👇
▫️ Computer Vision Applications

1. Face recognition
2. Image captioning
3. Image coloring
4. Object detection
5. Image classification
6. Pose estimation
7. Image transformation
8. Image analysis
9. Automatic drone inspections
10. Defect detection
11. Image restoration

👇
Read 4 tweets
22 Oct
A quick, non-technical explanation of Dropout.

(As easy as I could make it.)

🧵👇
Remember those two kids from school that sat together and copied from each other during exams?

They aced every test but were hardly brilliant, remember?

Eventually, the teacher had to set them apart. That was the only way to force them to learn.

👇
The same happens with neural networks.

Sometimes, a few hidden units create associations that, over time, provide most of the predictive power, forcing the network to ignore the rest.

This is called co-adaptation, and it prevents networks from generalizing appropriately.

👇
Read 7 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Follow Us on Twitter!