Discover and read the best of Twitter Threads about #dataquality

Most recents (4)

12 Best #SQL Online Course Certificate Programs for #DataScience in 2023 — compiled by @tut_ml
————
#Databases #BigData #DataScientists
mltut.com/best-sql-onlin…
7 Best Advanced #SQL Courses & Training Online You Must Know in 2023 — compiled by @tut_ml
————
#BigData #DataScience #DataScientists #Coding #Database #Analytics
mltut.com/best-advanced-…
Read 3 tweets
Hello Data Analysts,

Let’s talk about data quality.

Bad data can have a big impact on a company's bottom line.

Poor-quality data is frequently blamed for operational blunders, incorrect analytics, and poorly thought-out company initiatives.

What should be reviewed?
Organizations can detect data mistakes that need to be fixed and determine whether the data in their IT systems is suitable for the intended use by measuring data quality levels.

1. Check for completeness/uniqueness. Presence of missing data? Are data entries duplicated?
2. Check for accuracy and consistency. Are the formulas correct and consistent? Are the entered Data accurate?

3. Check for conformance and validity. Do the data meet required specifications?

4. Timeliness. Is it up to date? Is it readily available?
Read 5 tweets
Understanding data quality is crucial for reliable ML. In our #ICML2022 paper, @NabeelSeedat01, @JonathanICrabbe & @MihaelaVDS present a Data-Centric framework for the understudied problem of identifying incongruous examples of in-distribution data.

🧵1/10
TLDR.
*Do you want to know which examples will be reliably predicted, independent of the downstream predictive model?

* Do you want to get insights into your data to understand possible limitations?

If so, Data-SUITE our new #DataCentricAI framework is for you!

2/10
There has been a significant focus on out-of-distribution data (OOD) for reliable ML.

However, in Data-SUITE we tackle an equally important yet understudied problem.

How do we assess In-Distribution data, with feature space heterogeneity?

3/10
Read 10 tweets
Tweet 🧵

1/15
Screening People with Tuberculosis for High Risk of Severe Illness at Notification: Programmatic Experience from Karnataka, India mdpi.com/1151162 in journal #TMID via @MDPIOpenAccess

#OperationalResearch led by @SharathBN and me
2/15
Before #COVID19, #TB was the leading infectious disease killer

People with TB r not systematically screened for severe illness @ diagnosis. Something we do in COVID19

To #EndTBDeaths, @TbDivision recommends assessment of severity @ diagnosis n referral 4 inpatient care
3/15
Existing #TBProgramme guidance to assess severity among people with TB requires clinical capacity and diagnostic and radiology infrastructure

This is usually not present in peripheral health institutes where patients are diagnosed
Read 15 tweets

Related hashtags

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3.00/month or $30.00/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!