Discover and read the best of Twitter Threads about #ggplot2

Most recents (24)

If you're starting out in data science (or if your wondering what you need to learn), don't believe everything you read. 🧵

Spot BS and focus on these 4 steps to grow your career.

#datascience #rstats #career Image
My friend Rafael Nicolas Fermin Cota (Nico) pointed me to this modified graphic from a Harvard Business Review Article on "Prioritizing Which Data Science Skills Your Company Needs".
With ChatGPT, AI, and the "trendiness" of buzzwords, this graphic becomes even more dangerous. hbr.org/2018/10/priori…
Read 13 tweets
1/ 🌟 R's Hidden Gems: Lesser-Known Functions and Packages You Need to Know! 🎉 Discover powerful functions and packages that can help you become an R power user. Let's dive in and learn together! #rstats #AdvancedR #DataScience Image
2/ 🧙‍♂️ Functions: R has many built-in functions that often go unnoticed. Here are a few:
•with(): Apply expressions to a data frame without using the $ operator
•switch(): Simplify conditional expressions
do.call(): Call a function with arguments in a list
3/ 📦 janitor: Clean and preprocess your data with ease. Functions like clean_names() and remove_empty() make your data tidier and more manageable. Say goodbye to messy data frames! #rstats #datascience
Read 10 tweets
As the #rstats course material is not public (yet?) or available as online training (yet?), I thought I am sharing some slides from the deck.

The course covers all steps of the #DataScience workflow as featured in @hadleywickham's fantastic #R4DS 📕 r4ds.had.co.nz/index.html The title slide of the workshop "Reproducible Data AnalThree avatars (customised versions of the lovely Open Peeps A chapter slide entitled "What is This Course About?&quA conceptional representation of the data science workflow:t
Let's start with session 1:
"Introduction to #rstats and #rstudio" ®️ The title slide for the first session of the "ReproduciA slide explaining what the R programming language is:  &quoA comparison of R and Rstudio, taken from ModernDive: R is tA screenshot of Rstudio with the default panes: Script (uppe
The fundamentals of R includes:

* values
* assignments and objects
* functions
* data types
* unknown values
* vectors
* factors
* packages
* tabular data
* data generation
* data import A colorful overview of how functions work: the function nameA function might return a value, which is printed on a new lA colorful representation of objects and assignments in R: tA colorful explanation of vectors that start with the vector
Read 16 tweets
🌎 ¡Acabo de encontrar una serie de mapas increíbles realizados con #RStats! Desde mapas interactivos hasta diseños 3D, hay algo para todos los amantes de los datos espaciales
👇 ¡8 cuentas a las que definitivamente vale la pena darle un vistazo!🧵
#dataviz #maps #geospatial #gis
✅ Tyler Morgan-Wall @tylermorganwall

🔗 buff.ly/3ZBWNKe
Mapa 3D giratorio con puntos de luz (mapa anterior)

🔗 buff.ly/3QDfXLx
La red de cable submarino de fibra óptica de la Tierra.

Utiliza #rayshader #rayrender #rayverse
#dataviz #maps #geospatial #gis
@tylermorganwall ✅ Milos Popovic @milos_agathon
🔗 buff.ly/3CJM6LN
Mapa % de empleados en la fabricación, datos Eurostat.

#dataviz #maps #geospatial #gis #rstats #DataVisualization #stats #DataScience
Read 9 tweets
Organised another workshop on #DataViz in R using #ggplot2 within our Forest Entomology group at @WSL_research. Managed to cover most of the basics, looking forward to the "Advanced DataViz with ggplot2" workshop next week! A few issues / special cases we discussed today: 1/n ImageImageImageImage
When plotting raw data on top of e.g. boxplots, to increase visibility, use geom_jitter() to "jitter" the datapoints around the center of the boxplot, along the x-axis. However, make sure you include the argument height = 0, otherwise, the datapoints will be jittered 2/n ImageImage
along the y-axis, which will change their actual value! Easy to spot when the min / max don't align with the "outliers" (given in black). Left: height = 0, right: height not set to 0, the datapoints are therefore jittered along the y-axis. 3/n ImageImage
Read 13 tweets
Are you interested in getting started in #bioinformatics but not sure where to begin? Here are some tips to help you get started on your journey. A THREAD🧵🧵:
Start by learning a high-level #programming language, such as #Python or #R, and familiarizing yourself with data structures and #algorithms commonly used in #bioinformatics. The #BioPython and #Bioconductor libraries are great resources for this.
Next, learn about #genomic data formats and standards, such as #FASTA, #FASTQ, and #GFF. This will allow you to effectively manipulate and analyze large-scale #genomic #datasets. The #NCBI SRA and #EBI ENA databases are great places to find real-world data to work with.
Read 12 tweets
(1/n) A fun and practical idea for an R package: a #ggplot2 extension to modify plots so they adhere to an organization’s visual identity 📊📈📉⬇️
(2/n) A visual identity allows an entity to be instantly recognized. Made up of their logo, typography, and colors, it can be applied when creating external or internal communications. For example, here’s the visual building blocks of my alma mater @UBC: bit.ly/3EN0oNs Left: a sequential palette made up of 7 colors. Right: a lis
@UBC (3/n) My first crack at package development was during my master's studies with @UBCMDS. My classmates and I created a #ggplot2 extension to allow users to change the appearance of plots based on the visual identities of different Canadian universities: bit.ly/3VHssIe
Read 7 tweets
(1/n) I love telling stories with data and I have recently discovered #gganimate as a way to take my #dataviz to the next level.

gganimate.com
Here is some sample code to generate a random walk, create a line plot, and layer in the animation. I love how it's simple to layer within the #ggplot2 framework. Image
(3/n) Here is the resulting plot! using the `transition_reveal()` it keeps the previous points on the graph, but it is highly configurable. For example, I could use `transition_time()` or `transition_state()` to only show the point being added.
Read 7 tweets
👩🏼‍💻 How do you set up your own #ShinyApp?

It's easy in #rstats! Start a new #Rproject and select "Shiny Application". It will create a project with an "app.R" file for you ✨ Snapshot showing RStudio Project Wizard where "Shiny Ap
Once it's open, you can replace the code that is already in the "app.R" file with this code snippet below👇 It does all the magic and shows how you can build a simple #ShinyApp 🔮

(it's here for you to try: bit.ly/shinyapp-test) Screenshot showing a ShinyApp with selection boxes left (to
What the ShinyApp does:

You have checkboxes on the left side that let you choose countries (it's the ISO3 abbreviation, so "RWA" stands for Rwanda) and, depending on what you selected, your #ShinyApp will show a (non-realistic) population size for each country in a new plot.
Read 24 tweets
Hello everyone 👋 my name is Cosima and I usually tweet at @cosima_meyer.
In my professional life, I work as a data scientist in the field of AI. I love my job because it allows me to work with fantastic teams to solve puzzles👩🏼‍💻 Image showing two people holding two puzzle pieces to the sk
@cosima_meyer Before working in the industry, I explored my passion for research in academia. I hold a PhD from the University of Mannheim in #politicalscience (my dissertation dealt with post-civil war stability 🕊).
@cosima_meyer While working at @MZESUniMannheim, I co-founded a #datascience blog “Methods Bites” (@mzes_ssdl) which provides several hands-on tutorials on #NLP, scraping #Twitter data, #dataviz, or how to publish your package on #CRAN.
Read 11 tweets
Starting out in #R can be tough.

Here are the 7 packages that have helped me tremendously.

#rstats
1. #dplyr for data wrangling

github.com/tidyverse/dplyr
2. #tidyr for tidying, wrangling and pivoting

github.com/tidyverse/tidyr
Read 10 tweets
"Graphic Design with #ggplot2" 👨‍💼👩‍💻🧑‍💻

Do you want to recap the 2-day workshop at #rstudioconf? Or do you feel sad you've missed it?

🔥 All course material incl. latest updates can be found on the workshop webpage—9 sessions, 760 slides, 314 ggplots!

👉 rstudio-conf-2022.github.io/ggplot2-graphi… A screenshot of the rstudio::conf(2022) workshop "GraphA screenshot of the title slide of that workshop.The motivational example illustrating the capability of ggpl
@rstudio The session pages contain not only the slides but
🔵 hands-on #rstats codes
🔵 recap notes
🔵 exercises incl.
🔵 prepared scripts, either as #quarto or #rmarkdown
🔵 step-by-step solutions

➕ all source codes on GitHub: A screenshot of a session page, showing the recap notes and A screenshot of the script to answer exercise 1 of the firstThe title slide of the solution slide deck for exercise 1 inThe raw code for one of the exercises.
Some exemplary #dataviz from the workshop, 💯 done in #rstats thanks to #ggplot2 A colorful boxplot with overlaid juitterstrips. Three differOne of the exercise graphics using sina plots and errorbars,A bar plot showing reported bike counts as sums per season aThe "Speed of Language" visualization using normal
Read 4 tweets
Forget regular heat maps. Use bubbles on a grid instead 🔵 🟢 🤯

A short #dataviz thread 🧵 🧵

#rstats #ggplot2

1/8 Image
Regular heat maps have the crucial flaw of not showing how much samples were used. 🤔

That's totally fine when the different sizes are shown (e.g. with colors). It's what I did with my calendar plot a while back (special heat map)


2/8
But look what happens when I use a color gradient for a summary statistic.

Here, I try to show a relationship between sale price of a house and a property's size + location.

This looks right. But it isn't. Some medians were estimated with ridiculously small samples. 😱 💔

3/8 Image
Read 9 tweets
We've been exploring the new #T2T human genome assembly compared to previous versions (our news and views article is out today, link below). Some observations:
(1) Its pretty amazing what the human genome project was able to do in 2004: large-scale gene order is totally conserved
(2) Some of the chromosomes are nearly identical between the 2013 GRCh38 assembly and T2T. For example, here is a zoom in on Chr10 - just a few gaps in the pericentromere that were filled in with tandem repeats in T2T.
(3) But other chromosomes have been dramatically improved. For example, here are Chr9 and acrocentric Chr14.
Plot legend: sliding window proportion of sequence in genes (dark orange = exons) and repeats (light blue = #repeatMasker, dark blue = #trf); red segments = GRCh38 gaps.
Read 8 tweets
From standard output to powerful storytelling.

Here's how you can recreate the storytelling with data look in #ggplot2.

A detailed thread 🧵⬇️
#rstats #dataviz
This thread is based on my newest blog post. Find it at albert-rapp.de/post/2022-03-2…

There, you will find more details and ressources than I can offer in this thread.

The blog post was inspired by a makeover video by storytelling with data. You can find it at
This is our original dataviz. We will improve it to tell a compelling story.
Read 17 tweets
@waitbutwhy uses powerful visuals to illustrate his ideas.

One visual stuck in my head. And I think the message is so important that it cannot be said often enough.

So, I practiced recreating the visual with #ggplot2.

🧵w/ original and ressources⬇️ #rstats #dataviz Image
Here is the orginal tweet (check out his other tweets too if you get a chance).

I like to think that I came close to the original but the stick men were too tricky. Also, it turns out that the original uses its own font that I could not replicate.

For this visual, I used Bézier curves. I could try to explain them but I am not qualified to that, I think.

Rather, check out this visually pleasing video from @FreyaHolmer. It gives you a quick and beautiful rundown.

Read 11 tweets
@dr_allancameron and I are thrilled that the #rstats package {geomtextpath}, a #ggplot2 extension, that we've been rearing over the past few months has landed on CRAN 📢! We loved that there was engagement during development, so a few of our favourites in the 🧵⬇️
Allan and I both answer ggplot2 questions on SO a bit, so it was no coincidence that we were intrigued by the same question that sparked the package
stackoverflow.com/q/69867669/113…
Early on when we just got it working in principle, there was already a blogpost out that put us *on the right path*:
Read 13 tweets
🌟Publication alert!🌟
Botanists are discovering several endemic plants in the Indian savannas—ecosystems long misinterpreted as wastelands of low conservation value.
Article link (DM for access if needed): onlinelibrary.wiley.com/doi/10.1111/bt…
Read on for a summary. 1/n
A huge proportion of biodiversity remains undescribed globally, and this biodiversity knowledge shortfall is disproportionately > for tropical savannas compared to tropical forests 2/n
Some of this is coz tropical savannas have been misinterpreted as secondary ecosystems having negligible endemics. Eg. Malagasy savannas were once thought to be t most degraded landscapes on earth with no endemics. Recent studies now show tht 40% of Malagasy grasses r endemic 3/n
Read 17 tweets
🔥 THREAD: global fire patterns 🔥

Recent drought-related #wildfires have underscored how #climate change can increase fire risks regionally.
But various factors (e.g. human ignitions/suppression, vegetation growth) control fire, making regional fire patterns horribly complex!
It's tough to distill all these complexities into general statements, so here's a thread to showcase what 🛰 observations can reveal about 🔥 trends at various regional scales.

I invite you to get lost in the maps that follow and appreciate the complexity in all its glory! 🤘
First up, a basic question: Where do most fires happen?

👇 Plot shows mean annual burned area during 2001-2020, summed within regions and mapped thematically.

(Note: log colour scale)
Read 25 tweets
📊🧵 Collection of tweets featuring open-access materials that I have shared over the last years:
Talks, seminars, blog posts, hands-on notebooks, codes, and more!
#rstats #ggplot2 #tidyverse #dataviz 🧙‍♂️


Title slide of my "ggplot Wizardry" talk at Outlier conference 2021.
Final visualization of my "Evolution of a ggplot2" blog post.
Overview of plots contained in my "A ggplot2 tutorial for beautiful plotting in R"
Title slide of my "Beyond Bar and Box Plots: Chart alternatives and how to design them with ggplot2" talk for USGS
Read 10 tweets
☀️New load of #rstats and #python charts 🚢

Would you know how to build them?

🧵🔗⬇️🙏
1/5 One more example by @CedScherer showing how to add proper labels on a line chart:

R: r-graph-gallery.com/web-line-chart…
Python: python-graph-gallery.com/web-line-chart…
2/5 One more example by @alextuowang: a good looking radar chart:

R: r-graph-gallery.com/web-radar-char…
Python: python-graph-gallery.com/web-radar-char…

btw, I know radar chart are criticised:
data-to-viz.com/caveat/spider.…
Read 7 tweets
I bet you haven't read very many studies on restoring Indian #grasslands. Read on for a summary of our recent article in @SERestoration s flagship journal: onlinelibrary.wiley.com/doi/10.1111/re…
#restoration #dryland #invasive #India #biodiversity #Gujarat #GenerationRestoration #Prosopis 1/n
When we think of restoration, we often think of planting trees. While adding trees might be useful for restoring (some) forests, restoring grasslands often need tree-removal. Encroaching trees on grasslands, like t #invasive Prosopis juliflora in India, are a restortn challng 2/n
Understanding how to best manage encroaching woody plants in biodiverse #tropical grasslands is critical for sustaining biodiversity and people. 3/n
Read 11 tweets
Neste semestre, a @BeaMilz e o @jtrecenti ministraram a primeira turma do curso de Relatórios e Visualização de dados. Os trabalhos finais ficaram muito legais, e 3 foram premiados com uma bolsa de estudos em qualquer curso! Segue o 🧶:
#RStats #ggplot2 #TidyTuesday
Em ordem alfabética, o primeiro trabalho é da Ariane Hayana @arianehayana!

O trabalho foi um #TidyTuesday com dados sobre consumo de alimentos e emissões de CO2.

🖥️ Leia o trabalho em: curso-r.github.io/202103-visuali…

#rstats #ggplot2 #dataviz #DataScience
O trabalho da Brunna Escouto @brunnace foi um #TidyTuesday com dados relativos ao custo de obras de infraestrutura de trânsito, em especial metrôs, com informações de 54 países.

🖥️ Leia o trabalho em: curso-r.github.io/202103-visuali…

#rstats #ggplot2 #dataviz #DataScience #xaringan
Read 6 tweets
(1/12) Announcing R CHARTS, the new #rstats and #rstatsES #dataviz site which includes, per language:

📈 Over 1100 #ggplot2 and base R charts
🧑‍🏫 Over 120 tutorials
📊 Over 50 different chart types
🎨 Several color & palette tools

r-charts.com

Made with #blogdown 🤩
(2/12) The site contains 8 different categories (the Spatial section is under construction) based on the Financial Times Visual Vocabulary:

raw.githubusercontent.com/ft-interactive… Image
(3/12) Once you pick a category you will see all the available #rstats charts for that category.

You can filter them by their chart type and base R or #ggplot2.

r-charts.com/correlation/
Read 12 tweets

Related hashtags

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3.00/month or $30.00/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!