Chelsea Parlett-Pelleriti Profile picture
Aug 9, 2023 14 tweets 6 min read Read on X
📚 A Few of My Favorite Data Science Related Textbooks

(I use the term textbook loosely)

Here are some (non-exhaustive) of my favorites: https://t.co/BCgfQadCmN
📕 Elements of Statistical Learning

This book was the BIBLE in Grad School. It’s incredibly in depth and dense, but not so much that you can’t get through it. It’s comprehensive, well written, and is my go to reference to understand a ML algo more deeply😍 The book “the Elements of Statistical Learning”
📗Introduction to Statistical Learning with Applications in R

A gentler cousin of ESL, ISLR (and now ISLP!) is an great intro to ML algos. This book can be appreciated by undergrads, grads, and industry workers alike. The code examples are incredibly useful, and text is clear The book “an introduction to Statistical Learning”
(I dont currently assign a required textbook in my course, but if I ever do, it’s ISLP 100%)
📘Deep Learning

Not for the faint of 🫀! Like ESL, DL is a DENSE BOOK. There are chapters I’ve read half a dozen times and I’m still learning more each time. This book saved my 🍑 in grad school over and over. And I go to it often when prepping my deep learning course. The book “Deep Learning”
📙Pattern Recognition and Machine Learning

This book DIGS INTO the math, and yet remains clear and has a great flow. ML Math can be intimidating but this book takes you by the hand and says “look, you have to learn this but I’m going to help” The book Pattern Recognition and Machine Learning
📕Generative Deep Learning

Unlike the previous entries, this book isn’t a comprehensive reference book, it’s a gentle and clear intro to generative deep learning models. I read it to recommend to my undergrads and it’s perfect for them. It’s clear and has great figures. The book Generative Deep Learning (from O’Reilly)
📗Tidy Modeling With R

Julia Silge and @topepos have DONE IT AGAIN. I had the honor of tech reviewing this book and it is gold cover to cover. If you’re doing ML in R, you must read this book. They explain things so well, and their graph game is unmatched. Just, gorgeous content The book Tidy Modeling With R
@topepos 📘Statistical Rethinking

My emotional support textbook 🥹 @rlmcelreath has a way of writing that makes you EXCITED about what he’s saying. From basic causality to Bayesian stuff, he makes you go “i can’t wait to use this tool”. Bayesians and non-Bayesians alike will enjoy. The book Statistical rethinking
@topepos @rlmcelreath 📙Regression and Other Stories

This book is such a joy to read. You want to deeply understand regression? This is the book. I often go to this when consulting because it helps me explain models to my clients😍 these are the ideas and models that social scientists NEED to know The book Regression and Other Stories
@topepos @rlmcelreath 📕Regression Modeling Strategies

@f2harrell is the G.O.A.T. This book is like an encyclopedia for all the regression models you’re dying to use in your work. From ordinal models, to survival analysis…this book has it all. And endless case studies to see them in action. The book “Regression Modeling Stratégies”
@topepos @rlmcelreath @f2harrell 📕How to Ace Calculus: the Streetwise Guide

This book saved me in college calculs *mumbles* years ago. It’s silly, it’s clear, it makes learning calculus few FUN. This type of book really molded my scicomm style🥹 The book How to Ace Calculus the Streetwise guide
@topepos @rlmcelreath @f2harrell 📗Bayesian Data Analysis

This book is Jam packed with Bayesian goodness. It’s not the first book I’d give to a newbie, but it’s the book I think every practitioner should have on their desk if they’re going to use Bayesian Models. It has everything you want to know and 10% more Image
@topepos @rlmcelreath @f2harrell 📘Think Like a Progammer

If you’re learning C++ but are already familiar with a higher level language like python or R, this will help you wrap your brain around some lower level concepts (wtf is a pointer and how does it relate to an asterisk?!?) The book Think Like a Programmer

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Chelsea Parlett-Pelleriti

Chelsea Parlett-Pelleriti Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @ChelseaParlett

Apr 10
🪧Solution Thread🪧

We want a confidence interval for a two-sample z test! We're interested in the difference between replication rates for projects with and without open data.

This question has answer choices, so let's talk about my FAVORITE WAY to save time on these Qs!

2 sample proportion  James is interested in whether projects on OSF (an open science platform) with open data (data that is available for download) are more replicable than projects without open data. He collects separate random samples of projects from OSF. Here are the results:  (a 2x2 table is show with counts of replication and whether or not a paper had open data. The patterns show that open data papers tend to replicate more often).   James wants to use these results to construct a 95% confidence interval to estimate the difference in the proportion of projects that reproduce (open - ...
Answer choices. The answer choices vary in their critical z value, and the way the standard error is calculated. The correct choice is A.
FIRST: take a look at what's different between the answer choices. Here, the differences are:

- the critical z value
- the formula for calculating the standard error

so we don't even need to look at the other parts, they're all the same!
Let's start with the critical z value. The question asks for a 95% confidence interval. Either using our memory, or the z table given to us on the AP test, we can figure out that the *correct* critical z value is: 1.96.

So we can cross off answer choices B and D.
Read 7 tweets
Jul 15, 2022
United healthcare’s student health insurance was a HUGE stressor during grad school🙃

I had to float large sums of money waiting for reimbursement (which they messed up ~40% of the time at first), spend hours on the phone with them, and had a hard time finding a therapist😤
But I’m glad they made SOOOOO MUCH money 🙄

Healthcare is a right, and shouldn’t be tied to school/employment. It should be accessible, largely free, and it should focus on helping people NOT MAKING PROFITS.

This stuff pisses me off.
And while I’m on the subject, MAKE MENTAL HEALTHCARE MORE ACCESSIBLE.

Therapists deserve to make a living wage, AND clients who can’t afford it should have to spend $100s of dollars a month for therapy/meds.
Read 5 tweets
Jul 31, 2021
Another #ChelseaExplains 🧵 (trying to start with simpler topics).

Today that's 💫Conjugate Priors💫
First, PRIORS. In Bayesian Statistics, we use probability distributions (like a normal, Cauchy, beta...) to represent uncertainty about the value of parameters.

Instead of choosing ONE number for a param, a distribution describes how likely a range of values are A distribution with the x-axis label "Possible Paramete
Bayesian Stats works by taking previously known, PRIOR information (this can be from prior data, domain expertise, regularization...) about the parameter

and combining it with data to make the POSTERIOR (the distribution of parameter values AFTER including data)
Read 11 tweets
Jul 30, 2021
True to my word, the best statistical model, a Thread🧵

As an applied statistician and freelance consultant, I work a LOT with people trying to figure out the best model. Here are things I consider and ask.
1. Does the model ACTUALLY answer a question you have?

If you don’t have a question THEN THE BEST INFERENTIAL/PREDICTIVE MODEL IS NO MODEL. Do some EDA first!

Stop doing hypothesis testing if you don’t have a hypothesis to test. ✋
2. Is the model realistic?

We love prior predictive checks bc they allow u to generate data based on ur priors😻 but also ask whether ur leaving out important effects, whether the variable u use actually measures what you want, or if linear relationships are realistic…
Read 7 tweets
Jul 6, 2021
SINCE @kierisi has threatened to sarcastically/chaotically say incorrect things about p-values during #sliced tonight 😱 just to annoy people 😉,

I thought I’d do a quick thread on what a 🚨p-value🚨actually is.

🧵
(1/n)
Computationally, a p-value is p(data as extreme as ours | null). Imagine a 🌎 where the null hypothesis is true (e.g. there is no difference in cat fur shininess for cats eating food A vs. food B for 2 weeks), and see how extreme your observed data would be in that 🌎 (2/n)
So, you can think of the p-value as representing our data’s compatibility with the null hypothesis. Low p-values mean our data is not very likely in a world where the null is true. High p-values mean it is relatively likely. (3/n)
Read 20 tweets
Nov 20, 2020
⚠️ SO YOU WANT TO BE A BAYESIAN⚠️ :

(since I compiled a quick list of Bayesian resources today, I figured I should share; these are just my opinion!)
📚 Beginner book (R): Bayesian Statistics the fun way —amazon.com/dp/B07J461Q2K/…

📚 Beginner-Intermediate book (R + JAGS + Stan): Doing Bayesian Data Analysis: amazon.com/Doing-Bayesian…
📚 Intermediate book (R + Stan) : Statistical Rethinking—
amazon.com/Statistical-Re…

📚 Intermediate/Advanced Book (R + Stan) : Bayesian Data Analysis (amazon.com/Bayesian-Analy…)
Read 9 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(