We are R-Ladies Profile picture
Rotating Curator for the @RLadiesGlobal community #RLadies. This week’s curator: @ChilomboCareers
Episurgeon Profile picture Claire Narraway Profile picture Sophie Carruthers Profile picture 3 added to My Authors
Dec 10, 2021 10 tweets 12 min read
After that inspiring panel of women in Data Science, I want to give a spotlight and appreciation round to all those #RLadies that are active on #rspatial and that have definitely inspired me to also try and be part of the community 🤩 I invite you all to give a shoutout to those amazing #RSpatialLadies that have crossed your path!
Dec 10, 2021 14 tweets 13 min read
Very excited about this panel coming up! 👇
#WomenInTech #RLadies Loved this line from @McconnellKyla:
"I realized I was becoming a copy/paste data scientist and decided that I should invest more time in this"
Dec 10, 2021 13 tweets 8 min read
Anyone wants to get started or has used #rspatial and learned from tutorials/courses using packages such as {rgdal} {rgeos} {sp} {raster}? Are you getting used to them already? You might want to reconsider 🤔
🧵 1/n For the past few months the R Spatial community of developers and active users has been dealing with the news of the retirement of {rgdal} & {rgeos}
🧵 2/n
Dec 9, 2021 6 tweets 4 min read
Hi all 👋 today I would like to talk about #CV tips. Did you know you can create your CV in #rstats? There are so many cool 📦 out there!
I personally use {vitae} with a good range of eye-catching templates to choose from, I highly recommend it! What I like the most about building my CV in R is that I can organize everything in an R-project, I push to GitHub which gives me track changes and I can use the great advantages of #rmarkdown and #latex. Here is the repo of my #vitae CV github.com/loreabad6/R-CV
Dec 8, 2021 7 tweets 3 min read
🙌 Today was a great example of how the #rstats community can help getting learning resources! Thank you for all the amazing material about #deeplearning & #reproducibility with R 🙏 This feels like a good preamble to the remaining poll results about learning strategies 👇 Results to poll about learn... With all these materials out there I am now wondering when will I have time to read it all, same with practicing code and the new skills I will learn after going through them!
Dec 8, 2021 4 tweets 4 min read
Morning! Let's continue with the polling results, today and get some #learningtips. Here is what people voted for regarding learnings strategies! Not surprisingly, the big majority likes some hands-on exercises, although a mix between theory and practice is also welcome 😎 Results to poll about learn... 📚 Very few people voted for learning from books about coding, but have you already seen all the resources out there? #rstats has a big pile, and the best way to browse them all is with the Big Book of R #rmarkdown #bookdown
bigbookofr.com/index.html
Dec 7, 2021 12 tweets 4 min read
📢 Results for yesterday's poll are in everyone! Thank you all for your participation! 🙏There is so much to talk about, so today, let's start discussing the first question: why have you not tried programming yet? 🧐
🧵 1/6 Results to poll about reaso... Most often (43%), the reason to not learn coding is lack of time. Uff! Having no time is always an issue, trust me! There will never be enough time to do all the things we want
🧵 2/6
Feb 5, 2020 17 tweets 37 min read
The @RLadiesMelb community really wanted to learn advanced #rstats topics, so in this thread I will highlight some of the useful resources and tips from our events incl. #reproducibleresearch, #rmarkdown, #bookdown, #nlp, #randomforest, #webscraping, #mixOmics 📦, #rshiny & more @RLadiesMelb For a great overview of the "evolution" of #reproducibleresearch, best practices for #rmarkdown, an introduction to #workflowr and how to work with #bigdata in R via #AWS check out the presentation by @MrsLaviniaG here: mrslaviniag.github.io/xaringan_prese…
Feb 2, 2020 19 tweets 6 min read
I thought it could be helpful to have a thread on ANOVA in R. As a statistical consultant, this is the most frequent FAQ I get from clients - how to run a linear model on their data, conduct hypothesis tests, extract predicted means and perform contrasts. To put is plainly - running anova in R is a hot mess. It is a patchwork of functions from different packages and different authors with differing philosophies.
Dec 10, 2019 8 tweets 4 min read
The great thing about #rstats is that there are so many open source learning resources, it is tough to know where to start. Sometimes resources designed by beginners (or people who have just learned the thing you are trying to learn) are best. 1/n Why? People who have only just learned something don’t yet suffer from the curse of knowledge. They still know what it feels like to be totally overwhelmed and make fewer assumptions about what you might already know. 2/n

Dec 10, 2019 7 tweets 4 min read
So you want to learn #rstats? I’ll tell you about my favourite learning resources in a minute, but first… some tips and tricks. 1/n Tip 1: Find a reason. It is tough to get motivated to learn something new when your old workflow, while inefficient and not reproducible, gets the job done. If you try to learn #rstats in service of a real project, you are more likely to persevere. 2/n
Oct 5, 2019 7 tweets 9 min read
So, I love the amazing variety of colour palettes we have available in R! 🎨 🖌️

I'll share some of my favorites! #RLadies #dataviz

First up: Wes Anderson by @_inundata

🔗: github.com/karthik/wesand… @_inundata Next: FISH 🐟🐠🍥

I discovered this one by chance today, and can't wait to try it out!

#fishualize by @NinaSchiett 🌊

🔗: nschiett.github.io/fishualize/ind…
Aug 6, 2019 6 tweets 3 min read
Morning! So here's a few more of my tricks for emulating the amazing coding I've seen from people using R and RStudio!
For example did you know you can hit Tab between "" & it shows you the directories + files in your current workspace? Do you know the name of the file you're looking for but no idea where in the folder structure it is? No problem, type a few chars of your file name btw "" and hit Tab -> the chars you typed are replaced with the path to the file you're looking for.
Jul 4, 2019 5 tweets 5 min read
Happy 4th of July!!
One area of intergration of ML and econometrics is providing inference after variable selection (Post selection Inference) #rstats #econtwitter #Rladies #ML #econometrics 1/n Most popular technique in economics is the 'Double LASSO' which provides inference on the treatment effect after variable selection using LASSO. Check out the R package 'hdm'. cran.r-project.org/web/packages/h…
#rstats #econtwitter 2/n
Jul 2, 2019 6 tweets 2 min read
First up: The LASSO family

Perhaps the most popular ML technique in economics is LASSO - a variable selection technique. The R package 'glmnet' gives users a range of distributions of the response variable to choose from: normal, binomial, poisson, multinomial,cox and others! Interaction terms in the model are common in econ (& other soc. sciences). Want to select interactions along with the main effects? R package 'hierNet' implements Hierarchical LASSO. Users can choose the kind of hierarchy condition based on the research Q!
May 10, 2019 11 tweets 8 min read
Let's chat about building a R community within your office. I was the only R user in the office and I also wanted people I could go to for help and just thought partner to solve problems. So, here are some of my lessons learned from building a R culture in the office!
👇👇 📌Lesson 1: Don't tell or lecture about the benefits of #rstats, instead show them a demonstration!

People generally are motivated when they see how easy it is to start something small and with a specific example!

my ex: being able to work with multiple dataframes concurrently.
May 8, 2019 6 tweets 3 min read
I cannot curate this week without mentioning the amazing janitor package by @samfirke! A new version was recently released! sfirke.github.io/janitor/news/i…

📌 clean_names: cleans columns names by removing white space, renaming % to percent, etc. You can use it with %>% or %<>%! @samfirke 📌get_dupes is another function to identify duplicates by one or multiple columns.

df%<>%
get_dupes()
Jan 28, 2019 5 tweets 2 min read
So - #rspatial work in insurance! Other than natural perils (which I will talk about at some point) it’s also used to identify the features of individual dwellings that might impact their risk of things like burglary or malicious damage. For example, I found every house in Australia that’s in a culdesac (Less people going through might reduce theft, but access for fire engines is harder and increases fire risk). Here’s a quick screenshot of visual checks I did using the package leaflet to make sure it was working
Nov 28, 2018 31 tweets 8 min read
I'm sitting down to start a task in R.

I don't entirely know how to complete it.

I'm going to try to document my process in this thread in real time. The rubric for my students' projects includes "Consistency of effort" which is partly assessed by examining their github commit history over the weeks since they started.