Levi Profile picture
Oct 21 9 tweets 3 min read Twitter logo Read on Twitter
How to describe distribution?

Skewness 🧵 https://www.allaboutcircuits.com/technical-articles/understanding-the-normal-distribution-parametric-tests-skewness-and-kurtosis/
Skewness is a measure of the asymmetry of a distribution.

Distribution is symmetric if it looks the same to the left and right of the center point.

Skewness differentiates extreme values in one versus the other tail.

1/8
The formula to get the skewness:

2/8 Image
Left skew / Negative skew:

Negative values for the skewness indicate data that are skewed left - the left tail is long relative to the right tail.

The mean of a left-skewed distribution is almost always less than its median.

3/8 Image
Right skew / Positive skew:

Positive values for the skewness indicate data that are skewed right - the right tail is long relative to the left tail.

Here the mean is almost always greater than the median.

Because extreme values affect the mean more than the median.

4/8 Image
Zero skew:

When a distribution has zero skew, it is symmetrical. Its left and right sides are mirror images.

The skewness for a normal distribution is zero.

The mean = median

5/8 Image
Pandas has a built-in method to calculate the skewness of the data.

Since the value is negative, the data is skewed to the left - the left tail is slightly longer than the right tail.

6/8 Image
That's it for today.

I hope you've found this thread helpful.

Follow me @levikul09 for more.

Like/Retweet the first tweet below for support, Thanks 😉

7/8
If you haven't already, join our newsletter DSBoost.

We share:

• Interviews

• Podcast notes

• Learning resources

• Interesting collections of content



8/8dsboost.dev

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Levi

Levi Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @levikul09

Oct 18
Backpropagation is fundamental in ANNs

But how does it work?

I will explain it now visually.

1/9 Image
Consider this:

We have a model that you try to fit to the points. This model is a result of a NN.

It is clear that this NN provides a bad fit, so we need to adjust the parameters.

How can we do that?

Let's see step by step.

2/9 Image
1. We calculate the errors.

Comparing the observed values with the predicted values will give us the errors.

If we sum the errors we can plot them 🔽

(Note: this is not a thread on error metrics you can think of any of them.)

3/9 Image
Read 9 tweets
Oct 15
6 Statistical and Machine Learning pitfalls.

Avoid these traps to be a better data person.

1/9 Gutman, Alex J., and Jordan Goldmeier. Becoming a Data Head : How to Think, Speak, and Understand Data Science, Statistics, and Machine Learning. Indianapolis, Indiana, John Wiley & Sons Inc., 2021.
1️⃣ Correlation = Causation

They are related, it is crucial to understand that correlation does not imply causation!

We cannot measure causation statistically!

Resist the temptation to build a causal narrative around correlated variables.

2/9
2️⃣ P-hacking

Statistically significant results do not always imply real-life significance.

Studies can manipulate or selectively analyze data in order to obtain statistically significant results.

It is important to follow transparent and rigorous research practices.

3/9
Read 9 tweets
Oct 14
Regression to the mean is often misunderstood.

It refers to extreme events followed by average outcomes.

But it's not just randomness.

Let me explain

🧵 Image
Do you know what the "Madden Curse" is?

Madden is an American football video game.

The game usually features the best players from the NFL on the cover.

For most players being on the cover means the next season will be worse.

Why?

1/10 Image
Of course, there is no curse at all.

It is because of the regression to the mean.

When variables are significantly higher or lower than average on the first measurement, they move closer to the average on the second measurement.

2/10
Read 11 tweets
Oct 13
Weights and Biases are the engines in Neural Networks.

I will explain how they work.

1/8 Image
When data is flowing between different neurons or layers, it is not just going from A to B.

Different transformations happen to them.

These transformations are described with Weights and Biases.

Let's discuss each 🔽

2/8
1️⃣ Weight

Weights determine how important each factor is in the overall prediction.

This value will determine the influence input data has on the output product.

They work similarly as in weighted means: The input is multiplied by the weights.

3/8 Image
Read 8 tweets
Oct 12
Free Datasets from tech giants.

Microsoft, Google, IBM, Amazon...

Their valuable data is just a few clicks away.

Find the links below 👇 Image
1️⃣ Registry of Open Data on AWS

This registry exists to help people discover and share datasets that are available via AWS resources.

registry.opendata.aws
Image
2️⃣ Data Asset eXchange by IBM

developer.ibm.com/exchanges/data/
Image
Read 7 tweets
Oct 11
When you start learning Statistics you may feel there are a million things to memorize.

You're wrong.

What you need is 5-6 basic concepts. Everything else is built around them.

Here are 5+1 Statistical principles to begin with:

🧵 Image
You will never stop learning if you enjoy the journey.

I partnered with @brilliantorg for this reason. They introduced the joy of studying Data Science.

The best part?

They cover all these topics in their intuitive lessons.

Check the link in the last tweet and start learning!
1. Central tendency

Mean, mode, and median are measures that offer quick insights into the 'center' of the data.

It helps to see what a "normal" data point looks like in a group.

We hear about these measures a lot, so it's good to know them well.
Read 9 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(