When I first learned R, I struggled making data visualizations with ggplot2.

Here are the 3 things that helped me.

#rstats #R #datascience #datavisualization
Data visualizations are absolutely the most important thing to learn because of story telling...

... the ability to help your business take action.

AND the most powerful R library for static data visualization is ggplot2.
But ggplot2 has a STEEP learning curve.

3 things that helped me...
1. The Canvas

Just like a painter starts with a blank canvas, so does the data scientist when making a plot.
KEY CONCEPT.
The canvas connects our columns in our data frame to parts of our canvas.

X and Y axis maps columns to the plot.

Color maps a column that acts as a color gradient in this case.
2. Geometries

Geometries are the structure that forms lines, points, smoothers, and so on.
KEY CONCEPT.
Each geom_ function applies a geometry building your plot in layers.

As you progressively add more geometries (scatter points, lines, smoothers)...

...You build your plot up in layers.

This makes it easy to go from a simple plot to very sophisticated quickly.
3. Formatting

Formatting is one of the toughest parts because there are so many options.
PRO-TIP.
Don't try to learn everything.

Just learn the most common options.

Here are the most common options I use...
My most common formatting options:

- Expanding limits

- Applying scales to adjust colors and how the axis labels appear

- Adding labs for title, subtitle, and so on

- Adjust the theme for professional quality plots
SUPER IMPORTANT POINT
If you are just starting out learning R for business analytics, it takes time...

... You aren't going to become a PRO overnight.

BUT, you need a roadmap, and that's what I can help you with.
I've put together a FREE 40-minute webinar that shows you the 10 skills in R that helped me become a data scientist.

And, if I was starting again, this training would have easily shaved 6-months to 1-year off of my 5-year journey.

Free R Webinar: learn.business-science.io/free-rtrack-ma…
Here's the ggplot visualization from the code...
And if you want more R-Tips, you can join my R-Tips newlsetter here. learn.business-science.io/r-tips-newslet…

Just sent this one out today. business-science.io/code-tools/202…

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Matt Dancho (Business Science)

Matt Dancho (Business Science) Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @mdancho84

Sep 27
Why does every beginner data scientist fall for the "deep learning trap"?

True story 🧵

#rstats #datascience #deeplearning Image
When I was first learning data science this cost me at least 6-months. Seriously...

I was building a model for predicting which quotes would become orders.
I had just finished using a linear regression (didn't know about logistic yet) to make a predictive model.

Yeah I know - I was a noobie using regression instead of classification. So what?!
Read 15 tweets
Sep 26
I hate to say it but...

#Shiny is giving tableau a run for it's money.

Here's why...

#rstats
Tableau is a great tool. For descriptive analysis...

...but it's terrible at predictive analysis.

Enter Shiny.
Shiny's big con is that it takes forever to build an app.

You still need to know HTML & CSS to make it look good.

UNTIL NOW.
Read 10 tweets
Sep 26
Embarrassed by your #R code?

Here are 4 mistakes beginner R coders make AND how to avoid them.

#rstats #datascience
The reality is you aren't going to become a master R programmer over night.

But I see beginners making the same mistakes time and time again.

And they are easy to correct.

Here are the 4 most common mistakes and how to easily correct them.
1. Not using comments

This is a huge no-no.

Why?

Because comments help others understand your code INCLUDING future you.
Read 13 tweets
Sep 17
Shiny is a powerful tool that data scientists can use for web apps & production.

But most data scientists struggle.

Here are 7 resources on shiny that helped me.

#rstats #shiny #excel #python
1. The Shiny website

The 1st place to go to learn shiny.

shiny.rstudio.com
2. Flexdashboard website

Flexdashboard combines Rmarkdown & Shiny to make quick apps.

pkgs.rstudio.com/flexdashboard/
Read 10 tweets
Sep 14
TODAY. I'm excited to share 2 years of research + 6 software packages that went into Time Series Analysis...

And it's not what you think... 🧵

#rstats #datascience #timeseries #python #excel Image
I won't be talking about ARIMA.

Or, focusing on stationarity.
And, I most certainly will NOT be talking about:

1. Prophet

2. Exponential smoothing

3. Holt winters

4. Time series decomposition

5. OR any other "common techniques"
Read 5 tweets
Sep 14
When it comes to Time Series, colleges and universities have it all wrong.

A time series thread 🧵

#rstats #excel #python #timeseries
Universities are stuck in the past, teaching ARIMA.

But the cold reality is that ARIMA is NOT winning time series competitions & ARIMA is NOT helping companies solve BIG forecasting problems.
To be frank, ARIMA is too slow.

When you use ARIMA, you fall into a trap. You think, hey, this is what they're teaching me...

It must be good, right?
Read 12 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(