πŸ”₯ Matt Dancho (Business Science) πŸ”₯ Profile picture
May 10, 2023 β€’ 8 tweets β€’ 7 min read β€’ Read on X
Learning data science on your own is tough...

...(ahem, it took me 6 years)

So here's some help.

5 Free Books to Cut Your Time In HALF.

Let's go! 🧡

#datascience #rstats #R Image
1. Mastering #Spark with #R

This book solves an important problem- what happens when your data gets too big?

For example, analyzing 100,000,000 time series.

You can do it in R with the tools covered in this book.

Website: therinspark.com Image
2. Geocomputation with #R

Interested in #Geospatial Analysis?

This book is my go-to resource for all things geospatial.

This book covers:
-Making Maps
-Working with Spatial Data
-Applications (Transportation, Geomarketing)

Website: r.geocompx.org Image
3. Tidy Finance with #R

What tools exist in R for #Finance?
And how do I use them?

Answers to these questions are covered in this book!

P.S.- This book uses my R package, #tidyquant

Website: tidy-finance.org Image
4. Text Mining with R

This is a fantastic introduction to text analysis and text mining with the #tidytext R package.

This book singlehandedly made me MORE CONFIDENT with text analysis.

Website: tidytextmining.com Image
5. #Forecasting Principles and Practice

This is the best β€œtheory” book on #timeseries analysis and forecasting.

Topics Covered:
- ARIMA,
- Exponential Smoothing,
- TimeSeries Decomposition
- A lot more!

Website: otexts.com/fpp3/ Image
1-Dollar Bonus Book:

This is a massive value- Gives you a complete plan for EVERYTHING you need to know about learning data science.

It's only a buck.

And it will cut 2-3 years off your journey.

Website: learn.business-science.io/if-i-had-to-le… Image
Want even more help becoming a 6-figure data scientist?

I have a free workshop that will help you become a $100K+ earner as a #DataScientist even in a Recession.

πŸ‘‰Register Here: us02web.zoom.us/webinar/regist… Image

β€’ β€’ β€’

Missing some Tweet in this thread? You can try to force a refresh
γ€€

Keep Current with πŸ”₯ Matt Dancho (Business Science) πŸ”₯

πŸ”₯ Matt Dancho (Business Science) πŸ”₯ Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @mdancho84

May 15
Understanding P-Values is essential for improving regression models.

In 2 minutes, I'll crush your confusion.

Let's go: Image
1. The p-value:

A p-value in statistics is a measure used to assess the strength of the evidence against a null hypothesis. Image
2. Null Hypothesis (Hβ‚€):

The null hypothesis is the default position that there is no relationship between two measured phenomena or no association among groups. For example, under Hβ‚€, the regressor does not affect the outcome. Image
Read 15 tweets
May 14
Tableau is about to die.

Introducing PandasAI, a free alternative for fast Business Intelligence.

Let dive in: Image
1. PandasAI

PandaAI transforms your natural language questions into actionable insights β€” fast, smartly, and effortlessly.
2. Powerful dashboards in seconds

The problem with Tableau? Analysts have to build them from scratch.

PandasAI solves this problem making it lightning fast to create dashboards from multiple sources. Image
Read 9 tweets
May 13
🚨 BREAKING: Microsoft launches a free Python library that converts ANY document to Markdown

Introducing Markitdown. Let me explain. 🧡 Image
1. Document Parsing Pipelines

MarkItDown is a lightweight Python utility for converting various files to Markdown for use with LLMs and related text analysis pipelines. Image
2. Supported Documents

MarkItDown supports:

- PDF
- PowerPoint
- Word
- Excel
- Images (EXIF metadata and OCR)
- Audio (EXIF metadata and speech transcription)
- HTML
- Text-based formats (CSV, JSON, XML)
- ZIP files (iterates over contents)
- Youtube URLs
- EPubs Image
Read 10 tweets
May 11
The 10 types of clustering that all data scientists need to know.

Let's dive in: Image
1. K-Means Clustering:

This is a centroid-based algorithm, where the goal is to minimize the sum of distances between points and their respective cluster centroid. Image
2. Hierarchical Clustering:

This method creates a tree of clusters. It is subdivided into Agglomerative (bottom-up approach) and Divisive (top-down approach). Image
Read 14 tweets
May 9
RIP Tableau and PowerBI.

Enter Julius AI.

This is what Julius can do: Image
1. The $10 Billion problem with Tableau and PowerBI?

Dashboards are static.

But businesses are dynamic.

That's why I'm so excited about this new tool: Julius AI Image
2. Julius AI is for Data Analytics

Julius AI is built to analyze any database, PDF, or spreadsheet, and combine results into summarized business intelligence in seconds. Image
Read 11 tweets
May 9
Principal Component Analysis (PCA) is the gold standard in dimensionality reduction.

But almost every beginner struggles understanding how it works (and why to use it).

In 3 minutes, I'll demolish your confusion: Image
1. What is PCA?

PCA is a statistical technique used in data analysis, mainly for dimensionality reduction. It's beneficial when dealing with large datasets with many variables, and it helps simplify the data's complexity while retaining as much variability as possible. Image
2. How PCA Works:

PCA has 5 steps; Standardization, Covariance Matrix Computation, Eigen Vector Calculation, Choosing Principal Components, and Transforming the data.
Read 13 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(