Joshua Ebner Profile picture
Follow me to learn how to use AI, machine learning, and data to grow your business. Ex @Apple, @Cornell physics, 20 years in the data industry.
Feb 23, 2022 19 tweets 8 min read
7 Secrets To Help You Master Python Data Science

🧵

#Python #datascience #dataanalytics 1. Focus on Foundations ImageImage
Feb 21, 2022 14 tweets 13 min read
How to Make Small Multiple Charts in Python, with Plotly

🧵

sharpsightlabs.com/blog/plotly-sm…

#datascience #datavisualization #Python #pythonlearning Remember: small multiple charts break out a visualization by a categorical variable.

We take a simple chart and break it out into panels.

#Datavisualization #datascience
Jan 12, 2022 23 tweets 22 min read
How to Create Small Multiple Charts in Python, with Plotly

🧵[1/23]

sharpsightlabs.com/blog/plotly-sm…

#python #datascience #pythoncode #datavisualization [2/]

Before I get into the mechanics of how to create a small multiple charts in Python, let me quickly explain why they are so important.
Jan 11, 2022 42 tweets 19 min read
How to Do a Data Analysis

🧵[1/42]

#datascience #DataAnalytics #Python #rstats [2/42]

When you do data analysis, you first need to start by clarifying objectives.

Why are you doing the analysis?
What’s the end goal? (e.g., the thing you’re trying to improve, understand, etc)

#datascience #DataAnalytics #data
Jan 10, 2022 30 tweets 12 min read
Why You're Very Likely to Become A Millionaire in Data Science or Machine Learning

🧵[1/n]

#datascience #jobs #money #machinelearning [2/n]

The reasons that you're likely to become a millionaire in data science:

1. salaries are already high in 2021
2. competition for high salaries is weaker than you think
3. salaries are likely to increase in the 2020s

Let's look at each of these.

#datascience #money #jobs
Dec 30, 2021 19 tweets 8 min read
Merging two or more datasets is extremely important in data science.

Here's a quick thread that covers the basics of data merges in Python.

🧵[1/19]

#Python #datascience #DataAnalytics [2/19]

In Python ...

You can combine two Pandas dataframes using the "merge" function.

You can also use the "join" function (which defaults to joining on the index)

#Python #datascience #DataAnalytics
Dec 29, 2021 6 tweets 3 min read
In Python ...

You can combine Numpy arrays vertically or horizontally using np.concatenate

#Python #pythoncode #datascience The first argument to the function is a list (or collection) of arrays that you want to combine.

You can actually combine many arrays ...just put them inside the list.
Dec 29, 2021 4 tweets 2 min read
In Matplotlib ...

You can get the RGBA representation of a color with the to_rgba() function.

#Python #pythoncode #datavisualization You'll notice that the output of to_rgba is a tuple with four floats: (%red, %green, %blue, alpha)

#Python
Dec 28, 2021 8 tweets 7 min read
In Python, you can visualize images with the Plotly IMshow function.

🧵[1/8]

sharpsightlabs.com/blog/plotly-im…

#Python #pythonlearning #datascience #datavisualization [2/8]

You can use Plotly IMshow for a few uses.

You can use it to plot heatmaps ...

But you can also use it to plot images.
Dec 28, 2021 6 tweets 4 min read
In Python ...

You can use the Pandas dropna method to drop rows with missing values.

#Python #pythoncode #datascience ImageImageImageImage As seen above, you can limit dropna to specific columns with the 'subset=' parameter.

With 'subset=', you can specify the columns in which dropna will look for missing values

#Python #pythonlearning #datascience
Dec 28, 2021 5 tweets 4 min read
In Python ...

You can use Numpy all to test conditions about the properties of a Numpy array.

#Python #pythoncode #datascience ImageImageImageImage ☝️

So for instance, in the example above, I test if all of the values are greater than 2, by column.

#Python
Aug 31, 2021 11 tweets 5 min read
The big thing that I'd change here is the color palette.

This color palette is hard to interpret and frankly, just look a little ugly.

#datascience #DataVisualization

[1/11] [2/11]

The fix here is pretty simple.

The data are sequential in nature. There's a low and a high.

When you have sequential data, you should almost always look at sequential color palettes.

Aug 30, 2021 24 tweets 13 min read
If you want to create great data visualizations, you need to understand color palettes.

Here are a few quick tips:

[1/n]

#datascience #datavisualization #Python #rstats [2/n]

For data that has a sequential ordering (i.e., low to high), you should use sequential color scales.

matplotlib.org/stable/tutoria…

#Python #matplotlib Image
Aug 28, 2021 5 tweets 4 min read
@JoshuaSteinman My understanding based on some research last year and beyond, is that these are deep-water.

... and there's possibly more ports with shallow depth @JoshuaSteinman For example, @PeterZeihan wrote that Texas has "thirteen world-class deepwater ports"

amzn.to/3BihKN6
Aug 28, 2021 5 tweets 3 min read
@JoshuaSteinman Regarding: An American Shenzhen

There's a *lot* of good ports along the Texas coast, and I think much of it under-used.

Great for logistics into the American Heartland, and also into LatAm and Mexico. @JoshuaSteinman The Texas/Mexico combo provides a unique mix of high-skill, medium-skill, and low-skill labor.

High end design and MFG in TX, lower skill MFG and assembly in MX.
Aug 27, 2021 14 tweets 7 min read
If you want to master data science in Python, you need to learn Pandas method chaining.



[thread: 1/14]

#data #datascience #Python [2/14]

Pandas method chains enable you to combine together several individual Pandas techniques in complex ways.
Aug 26, 2021 9 tweets 6 min read
How to Add New Variables to a Python Dataframe

sharpsightlabs.com/blog/pandas-as…

[Thread: 1/9]

#data #datascience #Python Image [2/9]

There are several ways to add a variable to a Python dataframe ...

But my preferred way is the Pandas "assign" method.
May 3, 2020 15 tweets 12 min read
If there's a large migration of talented people from SF and NYC to Austin, Austin has a shot at being the next Silicon Valley.

#SF #NewYork #SiliconValley #technology #Austin #Texas

(Thread) 2/n

Remember: Texas actually has a long tradition of innovation.

For example, the integrated circuit was invented at Texas Instruments.

en.wikipedia.org/wiki/Texas_Ins…

#technology #tech #Texas ImageImage