Selçuk Korkmaz, PhD Profile picture
May 6 10 tweets 4 min read Twitter logo Read on Twitter
🧵1/10: Systematic vs. Random Error 🎯

Welcome to this thread where we'll explore the differences between systematic and random error, two types of error that can impact the accuracy and precision of your data. Let's dive in! #Statistics #DataScience Image
🧵2/10: Error in Measurements 📏

In any measurement process, there's a possibility of errors occurring. Understanding the types of errors that can arise helps us to design experiments that minimize their impact and improve the quality of our results. #DataScience
🧵3/10: Systematic Error 📐

Systematic errors, or biases, are consistent and reproducible inaccuracies that occur in the same direction every time. These errors can be due to faulty equipment, incorrect calibration, or even observer bias. #DataScience
🧵4/10: Identifying Systematic Error 🔍

To identify systematic errors, compare your measurements to known values or perform repeated measurements. If the error consistently occurs in the same direction, it's likely a systematic error. #DataScience
🧵5/10: Correcting Systematic Error ⚙️

To correct systematic errors, identify the source of the error and take appropriate action, such as calibrating your equipment, using a different measuring technique, or providing better training for observers. #DataScience
🧵6/10: Random Error 🎲

Random errors are unpredictable fluctuations that occur in both directions (positive or negative) and vary between measurements. They can be due to factors such as instrument noise, environmental conditions, or human error. #DataScience
🧵7/10: Identifying Random Error 🔍

Random errors can be identified by analyzing the spread or dispersion of your data. If the data is scattered and there's no consistent pattern or direction, the errors are likely random. #DataScience
🧵8/10: Minimizing Random Error 🛡️

To minimize random errors, increase your sample size or take multiple measurements and average the results. This will help to cancel out the random fluctuations and improve the precision of your data. #DataScience
🧵9/10: Accuracy & Precision 🌟

•Accuracy refers to how close a measurement is to the true value.
•Precision refers to how close repeated measurements are to each other.
•Systematic errors impact accuracy, while random errors impact precision. #DataScience
🧵10/10: Understanding the differences between systematic & random errors helps improve the quality of your data. Minimize systematic errors by identifying and correcting their sources, and reduce random errors by increasing sample size or averaging multiple measurements.

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Selçuk Korkmaz, PhD

Selçuk Korkmaz, PhD Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @selcukorkmaz

May 7
1/ 📊📈 Let's dive into the fascinating world of #statistics and explore two key concepts: Odds Ratio and Relative Risk! Understanding the differences and applications of these two measures is crucial for interpreting study results and making informed decisions. #DataScience Image
2/ 🎲 Odds Ratio (OR): The Odds Ratio is a measure of association between an exposure and an outcome. It represents the odds of an event occurring in one group compared to the odds in another group. OR is particularly useful in case-control studies. #DataScience
3/ 🌡️ Relative Risk (RR): Also known as Risk Ratio, RR is the ratio of the probability of an event occurring in the exposed group to the probability of the event occurring in the non-exposed group. RR is often used in cohort studies to assess risk. #DataScience
Read 8 tweets
May 7
1/ 📊📏 Let's dive into the world of #statistics & explore the Levels of Measurement! Understanding these levels is crucial for choosing the right statistical methods for data analysis. Today, we'll cover the 4 main levels: Nominal, Ordinal, Interval, and Ratio. #DataScience Image
2/ 🏷️ Nominal Level: At this level, data is purely qualitative and categorical. There's no inherent order or ranking involved. Examples include colors, genders, or nationalities. It's important to note that mathematical operations like addition or subtraction don't apply here.
3/ 🥇🥈🥉 Ordinal Level: This level involves data that has an inherent order or ranking, but the difference between categories is not uniform. Examples include survey responses (Strongly Disagree to Strongly Agree) or educational levels (elementary, high school, college).
Read 9 tweets
May 5
🧵1/8 🎲 Ever heard of the Monte Carlo Simulation?

It's a powerful mathematical technique used to model complex systems, make predictions, and optimize decision-making. Let's dive into this fascinating world! #MonteCarloSimulation #Statistics #DataScience
🧵2/8 How does it work? 🤔

Monte Carlo Simulation uses random sampling and statistical models to estimate unknown values. It simulates a system multiple times with different random inputs and aggregates the results to produce predictions. #RandomSampling #DataScience
🧵3/8 Applications 💼

From finance to engineering, Monte Carlo Simulation is used across many fields. It helps with risk analysis, portfolio optimization, and even predicting the weather. The versatility of this method is truly remarkable. #DataScience
Read 8 tweets
Apr 28
1/10: 🧪📊 Introducing Generalized Linear Models (GLMs) and how to perform them using R! A thread. #GLM #Rstats #DataScience Source: https://towardsdata...
2/10: 💡GLMs are a general class of regression models that extend linear regression, allowing for a variety of response distributions & link functions. They're used for modeling relationships between a response variable & one or more explanatory variables. #RStats #DataScience
3/10: 📐The main components of a GLM are:

Random Component: The response variable's distribution (e.g., Gaussian, Poisson, Binomial)
Systematic Component: Linear predictor (linear combo of explanatory variables)
Link Function: Connects the two components. #Rstats #DataScience
Read 10 tweets
Apr 27
🧵1/10 Multivariate Normality: A Guide to Controlling It Using the MVN Package in R 📊
Multivariate normality is a key assumption in many statistical techniques. Let’s explore how to test and control for MVN using the MVN package in R. #rstats #DataScience
cran.r-project.org/web/packages/M…
🧵2/10 Installing the MVN Package 📦
First, let's install and load the MVN package in R:

install.packages("MVN")
library(MVN)

This package offers a range of functions to assess and visualize multivariate normality. #rstats #DataScience
🧵3/10 Testing for MVN 🔎
To test your dataset for MVN, use the mvn() function. The function takes a data frame or a matrix as input and returns a list of tests, including Mardia's, Henze-Zirkler's, and Royston's tests.

# setosa subset of the Iris data
setosa <- iris[1:50, 1:4]
Read 10 tweets
Apr 27
🧵1/16 🚀 Package Exploration! 🌌 We all know popular #RStats packages like ggplot2, dplyr, and shiny, but there are tons of hidden gems 💎 in the CRAN universe waiting to be discovered! Let's explore some lesser-known packages that can supercharge your #DataScience journey! Image
🧵2/16 🌈 colorfindr: This nifty package extracts the most common colors from your images! Whether you're working with visualizations, web design, or marketing materials, colorfindr has you covered. Check it out: cran.r-project.org/package=colorf… #RStats #DataScience
🧵3/16 📦 pacman: Tired of typing install.packages() for each new package you need? pacman is here to save the day! It's a Swiss Army knife 🛠️ for package management, making installing, loading, and updating packages a breeze! cran.r-project.org/package=pacman #RStats #DataScience
Read 16 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(