Why You're Very Likely to Become A Millionaire in Data Science or Machine Learning

🧵[1/n]

#datascience #jobs #money #machinelearning
[2/n]

The reasons that you're likely to become a millionaire in data science:

1. salaries are already high in 2021
2. competition for high salaries is weaker than you think
3. salaries are likely to increase in the 2020s

Let's look at each of these.

#datascience #money #jobs
[3/n]

Let's start with current salaries.

According to Kaggle, the median salary for a US Data Scientist in 2021 is close to $200,000.

kaggle.com/kaggle-survey-…

#datascience #data #jobs
[4/n]

The above figure probably does not include stock options and stock grants, which can be very large.
[5/n]

For example, when I worked at Apple, it was common for people to receive RSU grants (restricted stock) that could be worth many multiples of salary.
[6/n]

To be clear: these stock grants typically vest over 4 years ... so you need to stay with the company to get the benefit

But, the point is that an in-demand data scientist could make close to $200,000 in salary, + stock grants that could be worth hundreds of thousands.
[7/n]

If you look at this compensation, it's easy to see how you could earn $1,000,000 as a data scientist within 5 to 10 years.

#datascience #jobs
[8/n]

And frankly, you could get to $1,000,000 in earnings faster if you are reasonably skilled at some niche topics ... particularly more advanced forms of machine learning and AI.

#datascience #machinelearning #jobs
[9/n]

For example, US data scientists in roughly the top 20% can make from $200,000 to $1,000,000 *per year*

#datascience #machinelearning #jobs
[10/n]

So if you are highly skilled ...

And you can get a job at a decent company (especially a FAANG company) ...

Then you have a good shot at making well over $200,000+ per year.
[11/n]

And if you're decent at managing your expenses and investing any extra cash, you should be able to get to $1M net worth within a few years
[12/n]

There are a couple of other things that I should point out:

1. your competition is lower than you think
2. salaries are likely to increase in the 2020s

#datascience #jobs
[13/n]

Concerning competition:

I hate to be an asshole, but most people who are competing for data science jobs are not that skilled.

I see a *lot* of data science code and at least 50% is trash.

#datascience #jobs
[14/n]

Even at "elite" companies, many people are mediocre.

Lots of people that I worked with at Apple were just phoning it in.

(Although, a few were exceptional.)
[15/n]

If you can become "fluent" in data science ... highly skilled and able to perform under time pressure

Then you'll do well against the other people who are competing for high paying jobs.

#datascience #jobs
[16/n]

Concerning salary increases:

Due to some demographic trends (both in the US, and worldwide) ...

We're likely to see salaries increase through the 2020s.

[17/n]

To summarize: the Baby Boomer generation was large compared to more recent generations.

And the vast majority of Boomers will retire by the mid 2020s.

#jobs #economics
[18/n]

As the Boomers retire, there won't be enough people to replace all of them.

This will create a labor shortage.

The labor shortage will be most acute in high-skilled domains and in management.

#Economics #jobs
[19/n]

As the supply of high-skill labor decreases, the price of high-skill labor will increase.

#Economics #jobs
[20/n]

So: the retirement of the Boomers in the 2020s will likely cause salaries to increase even more.
[21/n]

Said differently:

The demographic trends are likely to cause data science salaries to increase even more in the 2020s.
[22/n]

Now to be clear: most of the data scientists that I know are Millennials and GenX.

So the retirement of the Boomers may not directly cause a data scientist shortage.
[23/n]

But, the retirement of the Boomers will create a labor shortage in *management* roles.

#jobs
[24/n]

What that means, is that older Millennials and GenX will be moved to management.
[25/n]

So some of the high-skill data scientists from Millennial and GenX generations will be moved from front-line data science roles to management roles.

In turn, this will decrease the supply of data scientists, and therefore increase salaries.

#DataScience #jobs
[26/30]

So, data science salaries are already high, and demographic trends are likely to push them even higher in the 2020s.

... particularly for the highest-skilled individuals.
[27/30]

Let's recap:

You're likely to become a millionaire in data science because:

1. salaries are already high in 2021
2. competition for high salaries is weaker than you think
3. salaries are likely to increase in the 2020s
[28/30]

If you want to maximize your probability of reaching $1 million in data science, here's what I recommend:

1. Practice data science so you are highly skilled
2. Get into the top 50%, then top 20%, then top 5%
3. Build a network, so you have access to the best jobs
[29/30]

Each of those topics is somewhat complex, so I'll write about them another time.
[30/30]

If you want to master data science and make more money, then follow me here: @Josh_Ebner

Every day, I post threads and advice on how to master data science and machine learning in Python and R.

#DataScience #machinelearning #Python #rstats

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Joshua Ebner

Joshua Ebner Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @Josh_Ebner

11 Jan
How to Do a Data Analysis

🧵[1/42]

#datascience #DataAnalytics #Python #rstats
[2/42]

When you do data analysis, you first need to start by clarifying objectives.

Why are you doing the analysis?
What’s the end goal? (e.g., the thing you’re trying to improve, understand, etc)

#datascience #DataAnalytics #data
[3/42]

To do this in a business setting, you’ll typically talk with stakeholders, business partners, and other team members who are familiar with the subject of the analysis.
Read 42 tweets
30 Dec 21
Merging two or more datasets is extremely important in data science.

Here's a quick thread that covers the basics of data merges in Python.

🧵[1/19]

#Python #datascience #DataAnalytics
[2/19]

In Python ...

You can combine two Pandas dataframes using the "merge" function.

You can also use the "join" function (which defaults to joining on the index)

#Python #datascience #DataAnalytics
[3/19]

When you merge dataframes, you'll typically have a so-called "key" variable.

This is the variable upon which you'll join the dataframes.

#Python #datascience
Read 19 tweets
29 Dec 21
In Python ...

You can combine Numpy arrays vertically or horizontally using np.concatenate

#Python #pythoncode #datascience
The first argument to the function is a list (or collection) of arrays that you want to combine.

You can actually combine many arrays ...just put them inside the list.
The axis parameter controls the direction along which you combine the arrays.

For 2D arrays ...

'axis = 0' combines vertically
'axis = 1' combines horizontally
Read 6 tweets
29 Dec 21
In Matplotlib ...

You can get the RGBA representation of a color with the to_rgba() function.

#Python #pythoncode #datavisualization
You'll notice that the output of to_rgba is a tuple with four floats: (%red, %green, %blue, alpha)

#Python
In RGBA, the alpha channel represents the opacity of the color, where:

– 0.0 is fully transparent
– 1.0 is fully opaque
Read 4 tweets
28 Dec 21
In Python, you can visualize images with the Plotly IMshow function.

🧵[1/8]

sharpsightlabs.com/blog/plotly-im…

#Python #pythonlearning #datascience #datavisualization
[2/8]

You can use Plotly IMshow for a few uses.

You can use it to plot heatmaps ...

But you can also use it to plot images.
[3/8]

The syntax for Plotly IMshow is pretty simple.

You call the function as px.imshow and then provide the name of the image file you want to visualize.

(This assumes you've imported Plotly express as px)

#Python #pythoncode Image
Read 8 tweets
28 Dec 21
In Python ...

You can use the Pandas dropna method to drop rows with missing values.

#Python #pythoncode #datascience ImageImageImageImage
As seen above, you can limit dropna to specific columns with the 'subset=' parameter.

With 'subset=', you can specify the columns in which dropna will look for missing values

#Python #pythonlearning #datascience
If it finds missing values in any of those columns, it will drop the row.

But it will ignore missing values in other columns.
Read 6 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(