Sukriti Macker Profile picture
Aug 6, 2021 β€’ 15 tweets β€’ 4 min read β€’ Read on X
What is Accuracy in Machine Learning? Why shouldn't you use accuracy as a Performance Parameter? Is there a drawback? πŸ€”

A beginner-friendly explanation! πŸ§΅πŸ‘‡ Performance Metrics and Acc...
A compiled version of the thread: diveintodata.in/accuracy-perfo…
β˜‘οΈ What are Performance Metrics?
To study the effectiveness of a product or device in any field, we use performance metrics. Additionally, we can use performance metrics to measure how a machine learning model performs. The performance metrics provide the results in numbers.
Therefore, it becomes pretty easy to understand and evaluate the performance of most machine learning models.

Let's say I tell you that a movie is 3 on 5. You can directly understand that the film is okay and not great.
Therefore, when someone QUANTIFIES or puts a number value to the performance of a device, product or machine learning model, you can quickly understand whether or not it performs well. In other words, you can evaluate the product quickly by seeing a number score.
β˜‘οΈ ACCURACY
Accuracy is a performance metric. It is calculated as the number of correct predictions divided by the total number of predictions.

See the formula below:- Formula for accuracy
Let's say:-
There are 700 images of dogs and cats.
Out of 700, 350 images are of dogs, and 350 images are of cats.

Out of 350 dog images, the model correctly identifies 320 images
Out of 350 cat images, the model correctly identifies 310 images Example of Accuracy Calcula...
Therefore, the model correctly identifies 320 as dog images and 310 as cat images. Thus, the model correctly identifies 630 images.

Now, according to the formula, the accuracy of the model is 0.9 or 91%. Part 1 of Accuracy CalculationPart 2 of Accuracy Calculation
IMPORTANT OBSERVATION:-
The accuracy metric works best when the dataset is balanced. It is not reliable for an imbalanced dataset.
β˜‘οΈ Balanced vs Imbalanced Dataset

-> Balanced Dataset:-
The dataset is balanced when the target labels are nearly equal or equal in number in a classification problem. For example, out of the 700 images, if 350 are dog images and 350 are cat images, this is a balanced dataset. Balanced Dataset
-> Imbalanced Dataset
We say the dataset is imbalanced when the target labels are not equal and incomparable in a classification problem.
In other words, when there are more number target labels of one class than the other.
For example, out of 700 dogs and cats images, there are only 10 dog images and 690 cat images. This is an imbalanced dataset. Imbalanced Dataset
β˜‘οΈ DRAWBACK - Why shouldn't you use accuracy as a performance metric?
Let us take the same example for the imbalanced dataset we took above. In which there are 700 images of dogs and cats. From the 700 images, there are 10 photos of dogs and 690 photos of cats. Example of Imbalanced Dataset
Assume that our model correctly identifies 630 cat images. But, it fails to identify any dog images. Let us calculate the accuracy for this scenario. Calculation of Accuracy for...
Wow! We still got the same accuracy. It is 90%.
But do you think it should be 90% when our model could not identify any dog image correctly? No, right?
Therefore, when the dataset is imbalanced or skewed, we must not use accuracy to determine our model's performance.

β€’ β€’ β€’

Missing some Tweet in this thread? You can try to force a refresh
γ€€

Keep Current with Sukriti Macker

Sukriti Macker Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @Sukriti_Macker

Apr 4, 2022
Machine Learning Roadmap for BEGINNERS with resources!! πŸ€–πŸ§΅

1. Study mathematical concepts:-
a) Linear Algebra
b) Calculus
c) Probability
d) Statistics

2. Pick your programming language (preferably Python)
3. Start of with the basic Machine Learning Algos

4. Projects

5. Practice, practice, practice

Resources πŸ‘‡πŸ§΅
A) Linear Algebra
πŸ”— khanacademy.org/math/linear-al…

B) Calculus
πŸ”— youtube.com/playlist?list=…

C) Probability
πŸ”— edx.org/course/probabi…

D) Statistics
πŸ”— Notes: mathsbox.org.uk/twi/astats.pdf

πŸ”— Lec:
Read 5 tweets
Jan 31, 2022
Let's talk about R, shall we? πŸ‘©β€πŸ’»

8 Top Courses (FREE & Certification) for R Programming!! πŸ‘‡ 🧡
➑️ R Programming by @freeCodeCamp
Free 2 Hours course to get you started!

πŸ”—
➑️ R Programming by @simplilearn
Free 7 Hours course. Great for Beginners!

πŸ”—
Read 10 tweets
Jan 24, 2022
Statistics plays a crucial role in Data Science!! πŸ“Š

Do you should know how to understand & interpret your data?? πŸ€”

Top FREE Courses for Statistics you must check out πŸ‘‡πŸ§΅
➑️ FreeCodeCamp on YouTube

πŸ”—
➑️ KhanAcademy
This learning platform is the best! I just love it.

πŸ‘‰ High School Level Stats (for foundation)
πŸ”— khanacademy.org/math/probabili…
Read 7 tweets
Jan 11, 2022
Being in the industry of data science and machine learning, I cannot stress the importance of knowing Microsoft Excel. 🀯

Top 4 Free MS Excel Resources πŸ‘‡ 🧡
➑️ Microsoft Excel by freeCodeCamp

➑️ Microsoft Excel by Edureka

Read 6 tweets
Nov 27, 2021
Looking for Linux Command Line Courses?
Look no further! 🀩

Top 4 Free Linux Command Line Courses πŸ‘‡πŸ‘‡
Some might not know the answer to:-

What is Linux?? ➑️ linux.com/what-is-linux/
➑️ Linux Commands by Edureka
πŸ”—
Read 7 tweets
Nov 20, 2021
Learn Git and GitHub together!

Some of you may be new to the programming world. Don't worry! I got you. πŸš€

5 Top FREE resources for Git & GitHub πŸ‘‡ 🧡
➑️ Git and GitHub for Beginners - Crash Course
πŸ”—
➑️ Git & GitHub Crash Course For Beginners
πŸ”—
Read 7 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(