Tweet

How to get URL link on Twitter App

On the Twitter thread, click on or icon on the bottom
Click again on or Share Via icon
Click on Copy Link to Tweet
Paste it above and click "Unroll Thread"!
More info at Twitter Help

Selçuk Korkmaz, PhD

@selcukorkmaz

Apr 23 • 10 tweets • 7 min read Twitter logo

Read on Twitter

[1/10] 🚀 Advanced R Debugging: Debugging & error handling are essential skills for every R programmer. In this thread, we'll explore powerful tools & techniques like traceback(), browser(), & conditional breakpoints to make debugging in R a breeze. #rstats #datascience

[2/10] 📝 traceback(): When your code throws an error, use traceback() to get a detailed call stack. This function helps you identify the exact location of the error in your code, making it easier to pinpoint the issue. #rstats #debugging #datascience

[3/10] 🔍 browser(): With browser(), you can pause the execution of your code & step through it one line at a time. This interactive debugging tool allows you to inspect the values of variables and expressions, which can be a game-changer when diagnosing complex issues. #rstats

[4/10] 🎯 Conditional breakpoints: Use the debugonce() or debug() functions to set conditional breakpoints in your R code. When a specific condition is met, the execution will stop, allowing you to inspect the environment and identify potential issues. #rstats #datascience

[5/10] 🧪 Testing your code: Writing tests for your code using the testthat package can help you catch errors early on. This practice will save you time and effort in the long run, ensuring your code is reliable and robust. #rstats #testthat #datascience

[6/10] 🕵️‍♀️ Debugging Shiny apps: Use the reactiveLog() and reactiveConsole() functions to debug Shiny apps. These tools will help you understand the relationships between reactive expressions and track their execution. #rstats #shinyapps #datascience

[7/10] ⏲️ Debugging performance issues: Use the profvis package to profile your R code and identify performance bottlenecks. Optimizing your code can lead to significant speed improvements and a better user experience. #rstats #profvis #datascience

[8/10] 📚 Learning from others: When you encounter an error, don't hesitate to search for similar issues on platforms like Stack Overflow or the R mailing list. Chances are, someone has already encountered a similar problem & shared their solution. #rstats #datascience

[9/10] 🛠️ Customize error messages: Use the stop() and warning() functions to create custom error messages in your R code. This practice makes your code more user-friendly, and it can also help you identify errors more easily. #rstats #errorhandling #datascience

[10/10] 🎉 Happy debugging! Mastering these advanced debugging techniques will make you a more efficient R programmer and help you tackle even the most challenging coding issues. Keep practicing and learning, and you'll be unstoppable! 💪 #rstats #debuggingtips #datascience

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @selcukorkmaz

Selçuk Korkmaz, PhD

@selcukorkmaz

Apr 25

🧵 1/10 🧵
🎯 Demystifying the #Apply Functions Family in #R 🎯

Are you an #Rstats enthusiast? Let's dive into the powerful 'apply' family of functions to help you manipulate and analyze data efficiently! 👩‍💻👨‍💻

#DataScience #RStats

🧵 2/10
Meet the Family! 🏡

There are six main functions in the 'apply' family:

1️⃣ apply()
2️⃣ lapply()
3️⃣ sapply()
4️⃣ vapply()
5️⃣ mapply()
6️⃣ tapply()

Each has its own use case and is designed to work with different data structures. Let's explore them! 🕵️‍♂️🔍

#RStats

🧵 3/10 🧵
1️⃣ apply()

Use apply() for applying a function across the rows or columns of a matrix or array.

Syntax: apply(X, MARGIN, FUN, ...)

X: array or matrix
MARGIN: 1 for rows, 2 for columns
FUN: function to apply
... : additional arguments

#RStats #DataScience

Read 10 tweets

Selçuk Korkmaz, PhD

@selcukorkmaz

Apr 24

1/ 🎯 Introduction 📌
The #caret package in #R is a powerful tool for data pre-processing, feature selection, and machine learning model training. In this thread, we'll explore some useful tips & tricks to help you get the most out of caret. #DataScience #MachineLearning #RStats

2/ 🧹 Data Pre-processing 📌
caret offers various data pre-processing techniques, like centering, scaling, and removing near-zero-variance predictors. Use the preProcess() function to apply these methods before model training.🧪 #RStats #DataScience

3/ ⚙️ Feature Selection 📌
Use the rfe() function for recursive feature elimination. This method helps you find the most important features in your dataset, improving model performance & interpretation.🌟 #RStats #DataScience

Read 8 tweets

Selçuk Korkmaz, PhD

@selcukorkmaz

Apr 24

1/🧶📝 Welcome to a Twitter thread discussing the pros & cons of the #R packages, #knitr and #sweave. These packages allow us to create dynamic, reproducible documents that integrate text, code, and results. Let's dive into the strengths and weaknesses of each. #Rstats

2/🔍 #knitr is a more recent and widely-used package that simplifies the creation of dynamic reports. It's an evolution of #sweave and supports various output formats, including PDF, HTML, and Word. Plus, it's compatible with Markdown and LaTeX! #Rstats

3/🌟 Pros of #knitr:
✅ Better syntax highlighting
✅ Cache system to speed up compilation
✅ Inline code chunks
✅ Flexible output hooks
✅ More output formats
✅ Integrates with other languages
Overall, it provides more control and customization in document creation. #RStats

Read 9 tweets

Selçuk Korkmaz, PhD

@selcukorkmaz

Apr 24

🧵1/9 Let's talk about methods for identifying the optimal number of clusters in cluster analysis!

Cluster analysis is a technique used to group data points based on their similarity. Here are some popular methods & R packages. #RStats #DataScience

🔍2/9 Elbow Method: The Elbow Method involves plotting the explained variation (inertia) as a function of the number of clusters. The "elbow point" on the curve represents the optimal number of clusters. R package: 'factoextra' #RStats #DataScience cran.r-project.org/web/packages/f…

📈3/9 Silhouette Score: This method evaluates the quality of clustering by calculating the average silhouette score of each data point. Higher silhouette scores indicate better cluster assignments. Optimal clusters have the highest average silhouette score.cran.r-project.org/web/packages/c…

Read 9 tweets

Selçuk Korkmaz, PhD

@selcukorkmaz

Apr 24

🧵1/7 Understanding the difference between test set and validation set is crucial for building accurate and robust machine learning models. In this thread, we'll discuss the key differences between these two sets and their importance in model development. #MachineLearning #RStats

🧵2/7 Validation set: It is used during model development to tune hyperparameters and make decisions about the model architecture. It helps evaluate the model's performance and prevents overfitting by providing an unbiased estimate of how well the model generalizes to new data.

🧵3/7 Test set: This is a separate dataset not used during model training or validation. It's only used after the model has been finalized to assess its performance on completely unseen data. This provides an unbiased evaluation of the final model. #RStats #DataScience

Read 17 tweets

Selçuk Korkmaz, PhD

@selcukorkmaz

Apr 23

[1/9] 🎲 Let's talk about the difference between probability and likelihood in #statistics. These two terms are often confused, but understanding their distinction is key for making sense of data analysis! #Rstats #DataScience

[2/9]💡Probability is a measure of how likely a specific outcome is in a random process. It quantifies the degree of certainty we have about the occurrence of an event. It ranges from 0 (impossible) to 1 (certain). The sum of probabilities for all possible outcomes is always 1.

[3/9] 📊 Likelihood, on the other hand, is a measure of how probable a particular set of observed data is, given a specific set of parameters for a statistical model. Likelihood is not a probability, but it shares the same mathematical properties (i.e., it's always non-negative).

Read 10 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Enter Twitter Thread URL to Unroll

Selçuk Korkmaz, PhD

People who liked this thread also liked...

Try unrolling a thread yourself!

More from @selcukorkmaz

Selçuk Korkmaz, PhD

Selçuk Korkmaz, PhD

Selçuk Korkmaz, PhD

Selçuk Korkmaz, PhD

Selçuk Korkmaz, PhD

Selçuk Korkmaz, PhD

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!