Joshua Ebner Profile picture
Feb 23 19 tweets 8 min read
7 Secrets To Help You Master Python Data Science

🧵

#Python #datascience #dataanalytics
1. Focus on Foundations ImageImage
Many people try to start with the coolest, sexiest topics (like machine learning).

You'll do much better if you focus on foundational topics like data wrangling, data visualization, and data analysis.

#datascience #Python #datavisualization
2. Identify the Most Important Techniques ImageImageImage
Next, find the most important, most commonly used techniques.
This is ultimately an application of the 80/20 rule:

Find the 20% of techniques that account for 80% of your data science work.

#datascience #Python
3. Identify the Most Commonly Used Parameters ImageImage
This is sort of another application of the 80/20 rule.

Most parameters are rarely used.

Find the few parameters that are commonly used, and focus on learning those.
4. Break Everything Down into Simple Units Image
Now, you need to break everything down. Break down syntax into function names, method names, parameters, and other keywords.

Break everything down into the "minimal learnable units" (MLUs) that can be practiced.
5. Practice the Minimal Learnable Units ImageImage
Next, you practice. Find a way to quiz yourself on the MLUs to see if you can recall them.

Quiz yourself on the function names and parameters.
6. Repeat Your Practice Over Time ImageImage
Next, you need to repeat your practice over time.

Your memory will strengthen if you repeatedly practice syntax over days, weeks and months.

If you do this right, you'll eventually memorize the syntax.
7. Reintegrate the Pieces into a Coherent Whole ImageImage
Finally, you need to put the pieces back together.

Real data science work is done with multiple functions, methods, and techniques.

You need to learn how to put together small tools in complex ways to accomplish tasks.
So to sum up:

1. Focus on foundations
2. Identify most important techniques
3. Identify most important parameters
4. Break everything down
5. Practice
6. Repeat
7. Reintegrate
You can read the full article on which this thread was based here:

sharpsightlabs.com/blog/7-secrets…

#data #datascience #Python
And to learn more about data science, follow me here: @Josh_Ebner

Every day, I post tutorials and threads about how to learn data science in Python and R.

#datascience #DataAnalytics #Python #rstats

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Joshua Ebner

Joshua Ebner Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @Josh_Ebner

Feb 21
How to Make Small Multiple Charts in Python, with Plotly

🧵

sharpsightlabs.com/blog/plotly-sm…

#datascience #datavisualization #Python #pythonlearning
Remember: small multiple charts break out a visualization by a categorical variable.

We take a simple chart and break it out into panels.

#Datavisualization #datascience
There are a variety of tools to make small multiple charts in Python, including the Seaborn FacetGrid technique.

#Python #Datavisualization #datascience

Read 14 tweets
Jan 12
How to Create Small Multiple Charts in Python, with Plotly

🧵[1/23]

sharpsightlabs.com/blog/plotly-sm…

#python #datascience #pythoncode #datavisualization
[2/]

Before I get into the mechanics of how to create a small multiple charts in Python, let me quickly explain why they are so important.
[3/]

Small multiple charts are one of my favorite chart types.

They are very powerful, and also highly under-used.

#datascience #dataanalytics #datavisualization
Read 23 tweets
Jan 11
How to Do a Data Analysis

🧵[1/42]

#datascience #DataAnalytics #Python #rstats
[2/42]

When you do data analysis, you first need to start by clarifying objectives.

Why are you doing the analysis?
What’s the end goal? (e.g., the thing you’re trying to improve, understand, etc)

#datascience #DataAnalytics #data
[3/42]

To do this in a business setting, you’ll typically talk with stakeholders, business partners, and other team members who are familiar with the subject of the analysis.
Read 42 tweets
Jan 10
Why You're Very Likely to Become A Millionaire in Data Science or Machine Learning

🧵[1/n]

#datascience #jobs #money #machinelearning
[2/n]

The reasons that you're likely to become a millionaire in data science:

1. salaries are already high in 2021
2. competition for high salaries is weaker than you think
3. salaries are likely to increase in the 2020s

Let's look at each of these.

#datascience #money #jobs
[3/n]

Let's start with current salaries.

According to Kaggle, the median salary for a US Data Scientist in 2021 is close to $200,000.

kaggle.com/kaggle-survey-…

#datascience #data #jobs
Read 30 tweets
Dec 30, 2021
Merging two or more datasets is extremely important in data science.

Here's a quick thread that covers the basics of data merges in Python.

🧵[1/19]

#Python #datascience #DataAnalytics
[2/19]

In Python ...

You can combine two Pandas dataframes using the "merge" function.

You can also use the "join" function (which defaults to joining on the index)

#Python #datascience #DataAnalytics
[3/19]

When you merge dataframes, you'll typically have a so-called "key" variable.

This is the variable upon which you'll join the dataframes.

#Python #datascience
Read 19 tweets
Dec 29, 2021
In Python ...

You can combine Numpy arrays vertically or horizontally using np.concatenate

#Python #pythoncode #datascience
The first argument to the function is a list (or collection) of arrays that you want to combine.

You can actually combine many arrays ...just put them inside the list.
The axis parameter controls the direction along which you combine the arrays.

For 2D arrays ...

'axis = 0' combines vertically
'axis = 1' combines horizontally
Read 6 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(