Learning data science on your own is tough...

...(ahem, it took me 6 years)

So here's some help.

5 Free Books to Cut Your Time In HALF.

Let's go! 🧵

#datascience #rstats #R Image
1. Mastering #Spark with #R

This book solves an important problem- what happens when your data gets too big?

For example, analyzing 100,000,000 time series.

You can do it in R with the tools covered in this book.

Website: therinspark.com Image
2. Geocomputation with #R

Interested in #Geospatial Analysis?

This book is my go-to resource for all things geospatial.

This book covers:
-Making Maps
-Working with Spatial Data
-Applications (Transportation, Geomarketing)

Website: r.geocompx.org Image
3. Tidy Finance with #R

What tools exist in R for #Finance?
And how do I use them?

Answers to these questions are covered in this book!

P.S.- This book uses my R package, #tidyquant

Website: tidy-finance.org Image
4. Text Mining with R

This is a fantastic introduction to text analysis and text mining with the #tidytext R package.

This book singlehandedly made me MORE CONFIDENT with text analysis.

Website: tidytextmining.com Image
5. #Forecasting Principles and Practice

This is the best “theory” book on #timeseries analysis and forecasting.

Topics Covered:
- ARIMA,
- Exponential Smoothing,
- TimeSeries Decomposition
- A lot more!

Website: otexts.com/fpp3/ Image
1-Dollar Bonus Book:

This is a massive value- Gives you a complete plan for EVERYTHING you need to know about learning data science.

It's only a buck.

And it will cut 2-3 years off your journey.

Website: learn.business-science.io/if-i-had-to-le… Image
Want even more help becoming a 6-figure data scientist?

I have a free workshop that will help you become a $100K+ earner as a #DataScientist even in a Recession.

👉Register Here: us02web.zoom.us/webinar/regist… Image

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with 🔥 Matt Dancho (Business Science) 🔥

🔥 Matt Dancho (Business Science) 🔥 Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @mdancho84

Apr 17
🚨NEW: Python library for LLM Prompt Management

This is what it does: Image
The Python library is called Promptify.

It combines a prompter, LLMs, and pipeline to Solve NLP Problems with LLM's.

You can easily generate different NLP Task prompts for popular generative models like GPT, PaLM, and more with Promptify. Image
Don't understand what that means? Let's take an example:

This is an NLP Classification Task.

The prompt combines a model, prompter, and pipeline to perform a Medical classification of the patient's symptoms. Image
Read 9 tweets
Apr 16
ROC and AUC are important concepts for evaluating classification models in business (e.g. lead scoring).

In 3 minutes, I'll demystify AUC. Image
1. ROC Curve:

The ROC curve, which stands for the Receiver Operating Characteristic curve, is a graphical representation used to evaluate the performance of a binary classifier system as its discrimination threshold is varied. Image
2. True Positive Rate (TPR):

On the y-axis, the ROC curve plots the True Positive Rate (also known as sensitivity, or recall) which measures the proportion of actual positives that are correctly identified as such. It's calculated as TPR = TP / (TP + FN), where TP is true positives and FN is false negatives.Image
Read 10 tweets
Apr 15
Logistic Regression is the most important foundational algorithm in Classification Modeling.

In 2 minutes, I'll crush your confusion.

Let's dive in: Image
1. Logistic regression is a statistical method used for analyzing a dataset in which there are one or more independent variables that determine a binary outcome (in which there are only two possible outcomes). This is commonly called a binary classification problem.
2. The Logit (Log-Odds):

The formula estimates the log-odds or logit. The right-hand side is the same as the form for linear regression. But the left-hand side is the logit function, which is the natural log of the odds ratio. The logit function is what distinguishes logistic regression from other types of regression.Image
Read 9 tweets
Apr 15
Data Scientists are OUT.
AI Data Scientists are IN.

95% of data scientists are overlooking this fact.

That's a massive opportunity for you. Image
You just need 3 AI Skills:

1. LangChain $0
2. LangGraph $0
3. OpenAI API ($12/month)

Cost: $12 per year
Salary: $210,000 per year

That's a no-brainer. Want help? Image
On Thursday, April 24th, I'm sharing one of my best AI Projects: Business Intelligence SQL Agent with AI

Register here (limit 500 seats): learn.business-science.io/ai-registerImage
Read 5 tweets
Apr 13
🚨 Google published a 69-page prompt engineering masterclass.

This is what's inside: Image
Table of Contents:

- Prompt Engineering
- LLM Output Configuration
- Prompting Techniques
- Best Practices Image
Important concepts:

1. One-shot versus multi-shot

Google does a great job examining both approaches and demonstrating when to use them and how they work. Image
Read 9 tweets
Apr 13
❌Move over PowerBI. There's a new AI analyst in town.

💡Introducing ThoughtSpot. Image
1. AI Analyst

ThoughtSpot’s Spotter is an AI analyst that uses generative AI to answer complex business questions in natural language, delivering visualizations and insights instantly.

It supports iterative querying (e.g., “What’s next?”) without predefined dashboards. Image
2. Self-Service Analytics

Unlike Tableau and Power BI, which rely on structured dashboards, ThoughtSpot emphasizes self-service analytics with a search-based interface, making it accessible to non-technical users.

Its AI-driven approach feels like “ChatGPT for data.” Image
Read 7 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(