Sachin Kumar Profile picture
Apr 15 11 tweets 3 min read Twitter logo Read on Twitter
Day 32 of #100dayswithmachinelearning

Topic - Encode Numerical Features ( Binning & Binarization )

A Thread 🧵 Image
Discretization: It is process of transforming continuous variables into categorical variable by creating set of intervals, which are contiguous, that span over the range of the variable’s values. It is also known as “Binning”, where the bin is an analogous name for an interval
Benefits of Discretization or Binning :

1⃣ Handles the Outliers in a better way.
2⃣ Improves the value spread.
3⃣ Minimize the effects of small
observation errors.
Types of Binning:
(a) Unsupervised Binning:

1⃣Equal width binning: It is also known as “Uniform Binning” since the width of all the intervals is the same. The algorithm divides the data into N intervals of equal size
2⃣ Equal frequency binning: It is also known as “Quantile Binning”. The algorithm divides the data into N groups where each group contains approximately the same number of values.
3⃣ K-means binning: This technique uses the clustering algorithm namely ” K-Means Algorithm”.

This technique is mostly used when our data is in the form of clusters.
(b) Custom binning: It is also known as “Domain” based binning. In this technique, you have domain knowledge about your business problem statement and by using your knowledge you have to do your custom binning.
▶️Binarization: It is a special case of Binning Technique. In this technique, we convert the continuous value into binary format i.e, in either 0 or 1.

Very useful Technique in Image Processing, for converting a colored image into a black and white image.
📷If this thread was helpful to you:

1. Follow me @Sachintukumar
for daily content like this

2. Connect with me on Linkedin : linkedin.com/in/sachintukum…

3. RT the tweet below to share it with your friends

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Sachin Kumar

Sachin Kumar Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @Sachintukumar

Apr 16
" PowerBI Project For Data Analyst "

A Thread 🧵 Image
1⃣ HR Analytics Dashboard

linkedin.com/posts/sachintu…
2⃣ Finance Dashboard

linkedin.com/posts/sachintu…
Read 8 tweets
Apr 16
Day 33 of #100dayswithmachinelearning

Topic - Handling Mixed Variable in Feature Engineering 👨‍💻

A Thread 🧵 Image
Handling missing Variable is very important as many machine learning algorithms do not support data with missing values. If you have missing values in the dataset, it can cause errors and poor performance with some machine learning algorithms. Image
Variable deletion involves dropping variables (columns) with missing values on a case-by-case basis. This method makes sense when there are a lot of missing values in a variable and if the variable is of relatively less importance. Image
Read 7 tweets
Apr 15
30 Most Important SQL Interview Question { Must Read }

A Thread 🧵 Image
Read 8 tweets
Apr 14
" SQL Interview Questions " ( Q26 - Q30 )
Must Read 👨‍💻

A Thread 🧵 Image
Read 7 tweets
Apr 14
Day 31 of #100dayswithMachinelearning

Topic - Power Transformer in ML

A Thread 🧵
🔸Power Transformation techniques are the type of feature transformation technique where the power is applied to the data observations for transforming the data.

🔸Two types of Power Transformation techniques:

1⃣ Box-Cox Transform
2⃣ Yeo-Johnson Transform
▶️Box-Cox Transform :

This is mainly used for transforming the data observation by applying power to them. The power of data observation is denoted by Lambda(λ). There are mainly 2⃣ conditions associated with power in this transform which is lambda equal zero and not equal to0⃣
Read 8 tweets
Apr 5
🎯Free Resume Writers with AI Website

🧵
World is changing and AI is changing the way we work. Some websites which can help you in saving time and making an amazing resume with good ATS score, writing 10X faster blog posts.
📌 thisresumedoesnotexist – 1000 examples (ChatGPT famous resumes)
🔗lnkd.in/dkp95Ye9

📌 enhancv: 1000+Professional CV's
🔗lnkd.in/dznSfcab
Read 7 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(