FREE data cleaning dataset for beginners in Data Analytics:
1. Invoices Data With Merged Categories and Merged Amounts.
Because a single transaction (identified by an order ID) includes multiple items, the data was initially captured with one row per order, combining items and amounts into two fields.
However, it is better to list each item and its amount on separate rows, repeating the order ID as needed.
This separation allows for more effective analysis of items and amounts.
How to get the Data:
Click the ALT on the image below and copy the link and paste in your browser to download it.
2. Hospital Data With Mixed Numbers and Characters.
Non-professionals collected this data, occasionally using letters instead of numbers (e.g., 'S' for '5').
Ensure your cleaned data matches the totals in the provided clean dataset.