Still responding to folks re: my transition to data science post! I'll get to everyone, promise!

Given the interest I thought people might want to know the (almost all free/low cost!) resources I used to train myself for a data science role

A (hopefully helpful) 🧵
R, Part I

My first real #rstats learning experience was using swirl. I loved that I could use it inside of R (rather than having to go back and forth between the resource and the RStudio console)

swirlstats.com/students.html
R, Part II

A cliche rec, but it's cliche for a reason. R for Data Science by @hadleywickham & @StatGarrett transitioned me from "kind of messing around" to "wow, I did that cool thing" in R. It's absolutely a steal that it's available for free

r4ds.had.co.nz
R, Part III

If you want to do machine learning at all, Tidy Modeling with R by @topepos and @juliasilge is already amazing (and the book isn't even technically done!) The tidymodels ecosystem is also 🔥

tmwr.org
R (with Git/Github), Part IV

Happy Git with R by @JennyBryan has changed my entire workflow for the better times a million. It's *the* resource for version-controlling R code effectively

happygitwithr.com
Statistical (Re)Thinking/Causal Inference

Statistical Rethinking by @rlmcelreath is hands down the best advanced stats book I've ever read. There's also a glorious amount of free resources available around it (including an entire series of lectures)

xcelab.net/rm/statistical…
Causal Inference, Part II

This blog post by @dingding_peng is still the best explanation I've ever seen of conditioning on a collider (Even if, like me until recently, you don't know what any of those words are, you care about this)

the100.ci/2017/03/14/tha…
The Data Science Field, Part I

Build a Career in Data Science podcast (free!) and book (well worth the money!) by @robinson_es and @skyetetra made me actually believe I could transition into a data science role!

I also learned a bunch + they're hilarious (links in next tweet)
The Build a Career in Data Science

Podcast: open.spotify.com/show/78Nft51Tu…

Book: manning.com/books/build-a-…
The Data Science Field, Part II

The nightly (Mon-Thurs) data science Twitch stream with @nickwan and the #SLICED data competition hosted by him and @MeganRisdal

Hands on, helpful stuff, solving real DS problems

Twitch: twitch.tv/nickwan_datasci
Sliced: notion.so/Sliced-Show-c7…
The Data Science Field, Part III

Following lots of cool people on Twitter! I particularly appreciate @asmae_toumi and @kierisi, plus their "following" lists are great for finding other awesome folks
Python, Part I

Lots of great resources here (and I admittedly know much less than I know about R)

I've used Codecademy's modules and found them really helpful, especially as someone who already knows R pretty well

codecademy.com
Python, Part II

I've found RStudio's resources on running Python code inside RMarkdown super useful!

rstudio.github.io/reticulate/art…
Ok, I might add to this list over time, but I think I'll stop here for now!

Feel free to reply with other resources you've found super useful on your data science journey

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Dr. Michael Mullarkey

Dr. Michael Mullarkey Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @mcmullarkey

28 Feb
I just found out a paper we first submitted ~3 years ago was accepted! We used an N > 1,000 sample, open data/code, and robust methods

I'm proud of this paper, and it also helped radicalize me against a lot of the stories we tell ourselves about peer review

A 🧵
The many reviews we received were almost uniformly hostile, confused, non-constructive, or some combination
The paper definitely got better throughout the process, and that had ~0 to do with the reviews

Real reason #1: A wonderful, ongoing collaboration with a stellar biostatistician/many other great collaborators

Real reason #2: I got better at coding/new tools became available
Read 22 tweets
5 Sep 19
Trying to balance:
- Having genuine empathy for people who are staring down the barrel of their life's work not replicating
- Not reinforcing power structures and practices that led to a world where those barrels are all too common
Hearing @minzlicht talk about this on the "Replication Crisis Gets Personal" @fourbeerspod episode brought home to me how lucky I am to be early in my career now as opposed to 20 or even 10 years ago
But his example* reminds me people in power have a choice when confronted with a much messier literature than initially described

They can double down, or they can engage meaningfully with a more complicated world

*And many others, my mentions aren't ever comprehensive!
Read 12 tweets
24 May 19
About to live tweet "Recent Advances in the Use of Modeling to Explain and Predict Psychological Phenomena From Nomothetic & Idiographic Perspectives" with @EikoFried @talyarkoni @DepressionLab @aaronjfisher #aps19dc

It's already won the award for longest title, so good start!
@EikoFried @talyarkoni @DepressionLab @aaronjfisher Twitter-less (I think!) Don Robinaugh and Jonas Dalege are also presenting
@EikoFried @talyarkoni @DepressionLab @aaronjfisher .@EikoFried starts us off by reminding us that psychological modeling are complex, multicausal constructs and our approaches to these constructs often don't match that complexity
Read 61 tweets
6 Feb 19
New preprint from @JSchleiderPhD & me: Emotion and anxiety mindsets share little unique variance with internalizing problems in adults once you account for hopelessness (Ns = 200, 430)

Open code & data + interpretations in this thread!

psyarxiv.com/qtrxs/
We used commonality analysis (CA), which allows us to directly examine how much predictive variance is unique & shared among predictors

This technique can help us identify important individual predictors even when they're highly correlated (A no-no in traditional linear models)
If you want to try it out for yourself, the code and the data are part of this OSF project!

osf.io/wrc2m/

You can also apply the code to your own data. Would be great to see more CA papers out there given how often relevant predictors are highly correlated
Read 11 tweets
9 Dec 18
Two other researchers and I just went from a partial draft of a Methods section to a full draft of Introduction, Methods, and Results in less than a day

How? Let's talk how we approached our Paper In a Day (Trademark @JnfrLTackett @cmbrandes @kathleenwade @allisonshieldsy)
A vast majority of the legwork was done and it took much longer than a day!

This paper is a systematic review, so we spent months meeting once a week to eat queso, drink beer, and code articles

But momentum had slowed down, and we didn't want that effort to go to waste!
We all worked on the same document in Google Docs simultaneously using Paperpile

Its add-on for Google Docs is free and makes it super easy to write/cite in parallel rather than having to bounce drafts back and forth
Read 11 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Follow Us on Twitter!

:(