Rob Profile picture
Aug 5, 2022 10 tweets 6 min read Read on X
Learning data science is fun, so then why do we always use the same boring datasets? It's common to see projects using the iris, cars, or titanic data. Stand out! Check these 9 datasets on I created on #kaggle perfect for a unique portfolio project. #datascience #datasets🧵👇
1. MrBeast Youtube Stats

Includes metadata for every MrBeast Youtube video including: title, description, view, comment counts, likes AND thumbnails. Updated daily so you can track this viral sensation’s video trends over time.

🔗 kaggle.com/datasets/robik…
2. Workplace Injury Data

Dataset of over 200k OSHA reportable injuries spanning 5 years. Do some investigative data science to see which industries produce the most injuries and which companies keep their employees safe.

🔗 kaggle.com/datasets/robik…
3. Roller Coaster Metadataset

This dataset contains metrics for ~1,000 different roller coasters from around the world. Includes tons of metrics like top speed, number of flips, year built, and even lat/lon locations so you can plot them on a map!

🔗 kaggle.com/datasets/robik…
4. TextOCR Dataset

Want to beef up your computer vision skills? This is the perfect dataset with over ~1M high quality word annotations on images. Train a custom model capable of OCR text extraction.

🔗 kaggle.com/datasets/robik…
5. Eye State Classification Dataset

This is the perfect dataset to learn binary classification. See if you can create an algorithm that uses EEG measurements to tell if the subject’s eyes are open or closed.

🔗 kaggle.com/datasets/robik…
6. PGA Tour Golf Data

This dataset contains results from all major PGA events going back to 2015. Once you give it a try you won’t be able to help yourself yelling “FORE”

🔗 kaggle.com/datasets/robik…
7. Monthly measurements of Zillow’s home values for each US state going back to 2000. This small dataset is perfect for any beginner interested in working with time series data.
🔗 kaggle.com/datasets/robik…
8. Annotated Car Driving Footage

Multi-Object tracking is one of the most cutting edge fields in computer vision. This dataset provides video footage of cars driving through cities and labels are provided for every car, pedestrian and stop light.
🔗 kaggle.com/datasets/robik…
9. Historic Global Exchange Rates

This dataset is updated daily with exchange rates from around the world. Do some data exploration to see if you can find unique trends related to world events.

🔗 kaggle.com/datasets/robik…

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Rob

Rob Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(