Alvaro Cintas Profile picture
Aug 19, 2023 10 tweets 5 min read Read on X
Many people are talking about Claude being a better option than ChatGPT.

So I decided to put them to the test!

- Reasoning
- Simple math
- Coding
- Creativity & more

Here are my findings: Image
✍️ Before we start:

- This is by no means a conclusive/thorough study. This was done for fun testing different small questions just to see how they would do.

- I’ll be using ChatGPT with GPT-4 (let’s call it ChatGPT+)

- I didn’t add here the questions that both got correct, which were A LOT (more numbers later).

- Some of these models might do okay if you ask them a second time or express the question differently. However, I just wanted to test them in a single prompt with no variations.
1. FEATURES

🟢 ChatGPT+:

- Plugins
- Code Interpreter
- Custom Instructions

🟤 Claude:

- Completely free
- Context window is 100k
- It can read files free

I consider this a TIE since these are more a personal preference.

Here is a video of both showing some features 👇
2. CREATIVE THINKING/LANGUAGE

Prompt: “Write a 4-line poem where each line has 3 words only”

ChatGPT+: ✅
Claude: ❌
Image
Image
3. UP-TO-DATE

Prompt: “Who is the CEO of Twitter?”

Both of them got it incorrectly but Claude seems more up to date.

Also, when you ask to: “Write about [x] providing citations and links”, Claude usually provides better and more updated results.

ChatGPT+: ❌
Claude: ✅
Image
Image
4. MATH/LOGIC

Prompt: “If you choose an answer to this question at random, what is the chance you will be correct?
- A) 25%
- B) 50%
- C) 60%
- D) 25%”

ChatGPT+: ✅
Claude: ❌
Image
Image
5. MATH WITH PRIME NUMBERS

Prompt: “Is 10631 a prime number?”

ChatGPT doesn’t like too much Prime numbers, I tested a couple of variations and find problems as well.

ChatGPT+: ❌
Claude: ✅
Image
Image
6. CODING

They were both pretty good and after awhile, I was able to make one of them miss.

Prompt: “In Python, find the first two numbers missing in an ordered list of numbers. For example, in [3,4,5,7,8,10,12], the output would give 6 and 9.”

ChatGPT+: ✅
Claude: ❌
Image
Image
7. REASONING

Prompt: “There are two men. One of them is wearing a red shirt, and the other is wearing a blue shirt.
The two men are named Andrew and Bob, but we do not know which is Andrew and which is Bob.

The guy in the blue shirt says, 'I am Andrew’. The guy in the red shirt says, 'I am Bob.' If we know that at least one of them lied, then what color shirt is Andrew wearing?”

ChatGPT+: ✅
Claude: ❌
RESULTS

🟢 ChatGPT+: 5
🟤 Claude: 3

Counting 32 questions both got correct:

🟢 ChatGPT+: 37
🟤 Claude: 35

Both are great. I slightly prefer ChatGPT+, but for some use cases I would use Claude instead.

If you have more questions for me to try, let me know in the comments!

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Alvaro Cintas

Alvaro Cintas Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @dr_cintas

Mar 27
What a wild week in AI 🤯

- Reve Image
- Ideogram 3.0
- Qwen new models
- ARC-AGI-2 launch
- Alibaba LHM model
- Microsoft Researcher
- Google Gemini 2.5 Pro
- Perplexity Answer Tabs
- DeepSeek’s V3 AI model
- OpenAI’s Image Generator

Here’s everything you need to know:
1. Reve has launched Reve Image 1.0.

Fresh out of stealth, it has claimed the top spot in global image model rankings and outperforming big names like Midjourney and Google’s Imagen.

It provides stunning photorealism, best-in-class prompt accuracy, and wild text rendering.
2. Ideogram has released its new SOTA model, Ideogram 3.0.

This model now provides better photorealism, text rendering and language understanding than before.

It also brings new features such as style reference and random style features.
Read 14 tweets
Mar 26
OpenAI’s native Al image generation is insane.

The image and text quality are so good that it has unlocked unlimited possibilities.

10 crazy use cases:

1. Thumbnail maker Image
2. Create product marketing images Image
3. Logo placement Image
Read 11 tweets
Mar 24
AI photoshoots are taking over e-comm and fashion.

You can now upload a product design, choose a model, and generate a fashion photoshoot for your idea like this.

Here's how: Image
Image
1. Go to and sign up for a free account htch.ai/HHbmhOE
2. Click on "Create" and choose:

- Kate: For just a text prompt
- Linda: To add your own product and choose a model and background
- Naomi: The most complete, let’s you choose even poses
Read 6 tweets
Mar 20
What a wild week in AI 🤯

- Mistral Small 3.1
- Claude Web Search
- OpenAI Audio Models
- Krea AI Video Training
- NotebookLM Mind Maps
- Hunyuan 3D Generation AI
- Stability AI New Virtual Camera
- Gemini Canvas & Audio Overview

Here’s everything you need to know:
1. Mistral AI has released Mistral Small 3.1

A 24B open-source model that outperforms Google’s Gemma 3 and OpenAI’s GPT-4o Mini in key benchmarks.

It supports multimodal inputs, handles up to 128k tokens in context, and processes 150 tokens per second for high efficiency.
2. Claude Web Search

Anthropic’s Claude now features integrated web search, enabling it to fetch real-time information from the internet.
Read 11 tweets
Mar 17
Google's Gemini native Al image generation is insane.

You can now generate or edit photos with just plain text and completely free.

10 crazy use cases:

1. Thumbnail optimizer Image
2. Professional look Image
3. Logo placement Image
Read 12 tweets
Mar 15
You can now add Deep Research to your AI code editors.

Simply add the new Firecrawl MCP with Deep Research, and it will autonomously explore the web, and extract the latest findings for your code projects.

Here’s how:
First, you are going to need an API key from Firecrawl. You can get one to try free here: firecrawl.dev
Next, if you are in Cursor, follow this instructions (make sure to add your own API key): Image
Read 6 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(