Post

How to get URL link on X (Twitter) App

On the Twitter thread, click on or icon on the bottom
Click again on or Share Via icon
Click on Copy Link to Tweet
Paste it above and click "Unroll Thread"!
More info at Twitter Help

🔥 Matt Dancho (Business Science) 🔥

@mdancho84

Apr 20, 2023 • 25 tweets • 7 min read • Read on X

Scrolly

BIG NEWS: #ChatGPT breaks #Python vs #R Barriers in Data Science!

Data science teams everywhere rejoice.

A mind-blowing thread (with a FULL chatgpt prompt walkthrough). 🧵

#datascience #rstats

It's NOT R VS Python ANYMORE!

This is 1 example of how ChatGPT can speed up data science & GET R & PYTHON people working together.

(it blew my mind)

This example combines #R, #Python, and #Docker.

I created this example in under 10 minutes from start to finish.

I’m an R guy.

And I prefer doing my business research & analysis in R.

It's awesome. It has:

1. Tidyverse - data wrangling + visualization
2. Tidymodels - Machine Learning
3. Shiny - Apps

But the rest of my team prefers Python.

And they don't like R... it's just weird to them.

So I wanted to see if I could show them how we could work together...

Let’s start with a prompt.

I asked chatgpt to find a data set that I used for this example.

...ChatGPT found it...

... And gave me this code to read the data...

I prefer the tidyverse, so I asked Chatgpt to update the code.

That looks better.

With the data in hand, it’s time for some Data Science.

I asked this simple question.

ChatGPT's response was impressive.

But, even though I’m an R guy, my team uses Python for Deployment…

In the past, that’s a huge problem.

(resulting in days of translations from R to Python with Google and StackOverflow)

But now, that’s 1 minute of effort with chatGPT.

Can I show you?

I asked chatgpt to convert the R script to python...

And in 10 seconds chatgpt made this python code with pandas and scikit learn.

ChatGPT did in 10 seconds something that would have taken me 2 hours.

But let’s continue.

The reason we had to convert to Python is for “deployment”

Deployment is just a fancy word for allowing others to access my model so they can use it on-demand.

So I asked chatGPT this:

And ChatGPT made me a Python API using FastAPI.

But this code is useless…

… Without a docker environment.

So I asked chatGPT to make one:

And chatGPT delivered my Docker Environment's Dockerfile:

So in under 10 minutes, I had ChatGPT:

1. Make my research script in R.

2. Create my production script in Python for my Team

3. And create the API + Docker File to deploy it.

But when I showed my Python team, instead of excited...

...They were worried.

And I said, "Listen. There's nothing to be afraid of."

"ChatGPT is a productivity enhancer."

They didn't believe me.

My Conclusion:

You have a choice. You can rule AI.

Or, you can let AI rule you.

What do you think the better choice is?

If you want help, I'd like you to join me on a free #ChatGPT for #DataScientists Workshop on April 26th. And I will help you Rule AI.

What's the next step?

👉Register Here: us02web.zoom.us/webinar/regist…

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @mdancho84

🔥 Matt Dancho (Business Science) 🔥

@mdancho84

Nov 15

The AI Agent Development Process.

How to go from idea to production.

(A thread) 🧵

1. What is the AI Agent Development Process?

A repeatable path to ship an agent from idea to production: define → design → build → train → validate → deploy → monitor → improve.

2. Phases of the Process

Step 1: Defining Purpose

- Identify goals & users
- Determine objectives and success criteria (SLA, budget, accuracy)

Read 11 tweets

🔥 Matt Dancho (Business Science) 🔥

@mdancho84

Nov 11

K-means is one of the most powerful algorithms for data scientists.

But it's confusing for beginners. Let's fix that:

1. What is K-means?

Is a popular unsupervised machine learning algorithm used for clustering. It's a core algorithm used for customer segmentation, inventory categorization, market segmentation, and even anomaly detection.

2. Unsupervised:

K-means is an unsupervised algorithm that is used on data with no labels or predefined outcomes. The goal is not to predict a target output, but to explore the structure of the data by identifying patterns, clusters, or relationships within the dataset.

Read 11 tweets

🔥 Matt Dancho (Business Science) 🔥

@mdancho84

Nov 1

🚨NEW Whitepaper on AI Agents by OpenAI

The maker of ChatGPT shares how it builds AI Agents.

Get the 34-page white paper here:

This Whitepaper covers:

1. Building, evaluating, and deploying AI agents
2. Architectures, tool integration, and scaling
3. Agent ops and evaluation frameworks

Get it here:

I have one more thing before you go.

If you want to become a generative AI data scientist in 2025 ($200,000 career), then I'd like to help:cdn.openai.com/business-guide…

🚨WANT TO BECOME A GENERATIVE AI DATA SCIENTIST IN 2025 ($200,000 career)?

Discover how I built an AI Customer Segmentation Agent with Python:

- Scikit Learn
- K-Means
- LangChain
- LangGraph
- OpenAI

👉Register here (500 seats): learn.business-science.io/ai-register

Read 5 tweets

🔥 Matt Dancho (Business Science) 🔥

@mdancho84

Oct 26

This is wild.

A new paper shows how you can predict real purchase intent without asking people.

~90% of human test–retest reliability.

Here's what's inside the 28 page paper:

1. Problem with direct Likert from LLMs:

When you ask LLMs to output 1–5 ratings directly, the distributions are too narrow/skewed and don’t look like human survey data, limiting usefulness for concept testing.

2. Proposed fix — Semantic Similarity Rating (SSR):

Have the LLM write a short free-text purchase-intent statement, then map that text onto a 5-point Likert score using embedding cosine similarity to predefined anchor sentences (i.e., semantic matching instead of raw numbers).

Read 9 tweets

🔥 Matt Dancho (Business Science) 🔥

@mdancho84

Oct 22

How to build AI agents:

A great cheat sheet (bookmark for later).

Here's how to use it:

1️⃣ System Prompt: Define your agent’s role, capabilities, and boundaries. This gives your agent the necessary context.

2️⃣ LLM (Large Language Model): Choose the engine. GPT-5, Claude, Mistral, or an open-source model — pick based on reasoning needs, latency, and cost.

3️⃣ Tools - Equip your agent with tools: API access, code interpreters, database queries, web search, etc. More tools = more utility. Max 20.

4️⃣ Orchestration: Use frameworks (like LangChain, AutoGen, CrewAI) to manage reasoning, task decomposition, and multi-agent collaboration.

Read 7 tweets

🔥 Matt Dancho (Business Science) 🔥

@mdancho84

Oct 20

Understanding P-Values is essential for improving regression models.

In 2 minutes, I'll crush your confusion.

1. The p-value:

A p-value in statistics is a measure used to assess the strength of the evidence against a null hypothesis.

2. Null Hypothesis (H₀):

The null hypothesis is the default position that there is no relationship between two measured phenomena or no association among groups. For example, under H₀, the regressor does not affect the outcome.

Read 15 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Enter URL or ID to Unroll

🔥 Matt Dancho (Business Science) 🔥

Try unrolling a thread yourself!

More from @mdancho84

🔥 Matt Dancho (Business Science) 🔥

🔥 Matt Dancho (Business Science) 🔥

🔥 Matt Dancho (Business Science) 🔥

🔥 Matt Dancho (Business Science) 🔥

🔥 Matt Dancho (Business Science) 🔥

🔥 Matt Dancho (Business Science) 🔥

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!