Santiago Profile picture
15 Jun, 19 tweets, 4 min read
Do you know what's holding you back?

The same boring projects than everyone else is working on. How do you break off the mold and make a difference?

Thread: 7 machine learning projects that will teach you the technical skills you need to succeed out there.
Today, companies are dumping insane amounts of money on people with the right skills.

But here we are, showing up with the same "MNIST Digit Recognition" and "Iris dataset" experience.

These are good to start, but you need to take your learning to another level.
The good news for you: Most people can't be bothered.

Most people will bookmark this thread and will never do anything with it.

Even if you do one of these 7 projects, you'll be doing more than 99% of everyone who reads this!

It has never been easier to stand out!
Why should you care about solving these projects?

1. They will give you experience.
2. They will stretch your abilities.
3. They will force you to learn.
4. They will show off beautifully in your portfolio.
5. They will make you stand out.

The first one is critical!
What makes these projects "interesting"?

They aren't the same projects that everyone is working on.

These are hard but doable. They are different, and they match the skills that companies are hiring for.

These projects will get you a job!
Here are your 7 projects:

1. Detect movement in a video
2. Stock trading bot
3. Track a person throughout a video
4. Speech emotion recognition
5. License number plate recognition
6. Customer segmentation
7. Identify the language used in a text

Let's now talk specifics.
First, I'm not adding a lot of details on purpose.

These are open-ended ideas that you can take whereever you want, depending on your skills and interest.

Creativity is a feature.
Where can you get the data for these?

• 1, 3: You won't need any data.
• 4, 6: Link to datasets below.
• 2, 5, 7: Easy to find online.

Speech emotion recognition dataset:
drive.google.com/file/d/1wWsrN2…

Customer segmentation dataset:
drive.google.com/file/d/19BOhwz…
You are going to need a few things to solve these problems. Some suggestions:

1. Python
2. Deep learning
3. Regular machine vision algorithms (OpenCV library)
4. Facebook's fastText model
5. Reinforcement learning

This is not an exhaustive list.
Something to keep in mind:

There's no "Solving these problems at 100%." These are hard, and there's no end to any of these.

But, you can definitely come up with a solution that's good enough for a controlled scenario. That's a win!
For example, building an automatic trading bot for one stock that beats one arbitrary index is possible.

Doing that for any stock is not that easy.

Detecting *every* language out there? Yeah, that's hard too.
A rough list of steps:

• Pick a problem
• Define your boundaries
• Break it down in smaller pieces
• What's out there that can help you?
• Learn what you need
• Build as much as you can
• Publish it online for others to see

Important: write about what you learned.
I post threads like this every week.

Stay tuned as I help you get to the core of practical machine learning.

You can find the rest of my threads here: @svpino.
TL;DR:

7 machine learning projects you can build:

1. Detect movement in a video
2. Stock trading bot
3. Track a person throughout a video
4. Speech emotion recognition
5. License number plate recognition
6. Customer segmentation
7. Identify the language used in a text
For automatic trading bots, there's a course in Udacity called "Machine Learning for Trading."

It will give you what you need.

Here are another 5 ideas:

1. Recommendation system for e-commerce
2. House Price prediction
3. Music Genre detection
4. Sentiment Analysis from tweets
5. Titanic Survival ← Start here.
This is a great idea, for sure! You have “unlimited” high-quality images to play with.
The Automatic Trading Bot seems to be the most popular project in the list.
🪨 📄 ✂️ is another great problem you can try using your webcam.

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Santiago

Santiago Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @svpino

15 Jun
If you are planning to get started with Machine Learning:

My introductory course is on sale:

• 50 copies @ $5, starting right now.
• Free copies for those who can't afford this.

gum.co/kBjbC/five

If you want to support my content, like/retweet this. Everyone wins.
10 copies sold, 40 copies left @ $5.
20 copies sold, 30 copies left @ $5.
Read 11 tweets
14 Jun
How close are we to building a truly intelligent agent?

Most scientists think we are still decades away, but today, a group of scientists from @DeepMind claims they know how to get there.

Let's talk about what's going on.
What is "Artificial General Intelligence" (AGI)?

An agent capable of learning any intellectual task than a person can also learn.

Today, AI has been limited to systems that can learn particular tasks. A system that can learn anything you teach it, just like a human, is AGI.
Unfortunately, there's no way to build such a general, intelligent agent without formulating a custom solution for every individual task.

This sucks. This doesn't scale. This doesn't get us to AGI.

But maybe we aren't that far off...
Read 16 tweets
12 Jun
6 lies you have been told about machine learning:

1. You need a lot of math to start
2. You need a Ph.D. to get a job
3. You always need a lot of data
4. You need to buy expensive hardware
5. It's hard to become proficient in it
6. It's the solution for most problems

Bullshit.
In the last 6 months, I've posted more than 100 threads here on Twitter talking about machine learning and how you can build a career on it.

And I'm just getting started!

Stay tuned. A lot more is coming.
First misconception: All machine learning is hardware-hungry.

Deep learning stretches you, but outside that, it gets much better.

If you need GPUs/TPUs, there are many free/cheap options you can use, especially while learning.

Read 9 tweets
11 Jun
Software developers want to get into machine learning.

Many make the same mistakes. I've seen a few, and I have some ideas on how to avoid them.

This is what I've learned ↓
Lesson 1

Most people love the idea of starting something new. Only a few take the first step.

Preparing for something new is fun and exciting. It can also turn into glorified procrastination.

Stop collecting resources. Take what's right in front of you and run with it.
Lesson 2

Learning is a marathon, not a sprint. Strap yourself for a long, lifelong road.

If you are looking to make a quick buck, look elsewhere. If you are looking for shortcuts, this ain't it.

Make sure you come for the right reasons and are willing to go the distance.
Read 12 tweets
10 Jun
Machine learning superpower: Be the one that makes better predictions.

I can teach you how to do this by putting 2+ models together.

Thread: beginner-friendly introduction to Ensembles.

• What are they?
• Why do they work?
• Real-life examples.
• Practical tips.

A group of models working together is called an "ensemble."

Instead of using a single model, you could build 2 different models and have them vote to select the best answer.

You could also build 3, 4, or however many models you want.

This is powerful.
Here is the surprising part:

Ensembles usually perform better than all of their individual models.

Let's look at one example.
Read 22 tweets
9 Jun
There are thousands of machine learning algorithms out there, but that's mostly noise.

You'll rarely need more than a handful.

A good start:

• Linear/Logistic Regression
• Decision Trees
• Neural Networks
• XGBoost
• Naive Bayes
• PCA
• KNN
• SVM
• t-SNE
Why these 9 instead of your favorite ones?

No specific reason. Your list will certainly include algorithms that I haven't even heard about.

But you must start somewhere, and these are certainly a good foundation.
In case you are curious, I also studied the following algorithms in the first couple of years of starting with machine learning:

• Random Forest
• AdaBoost
• K-Means
• Expectation Maximization
• Simulated Annealing
• Genetic Algorithms
Read 5 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Follow Us on Twitter!

:(