Alvaro Cintas Profile picture
Aug 19, 2023 10 tweets 5 min read Read on X
Many people are talking about Claude being a better option than ChatGPT.

So I decided to put them to the test!

- Reasoning
- Simple math
- Coding
- Creativity & more

Here are my findings: Image
✍️ Before we start:

- This is by no means a conclusive/thorough study. This was done for fun testing different small questions just to see how they would do.

- I’ll be using ChatGPT with GPT-4 (let’s call it ChatGPT+)

- I didn’t add here the questions that both got correct, which were A LOT (more numbers later).

- Some of these models might do okay if you ask them a second time or express the question differently. However, I just wanted to test them in a single prompt with no variations.
1. FEATURES

🟢 ChatGPT+:

- Plugins
- Code Interpreter
- Custom Instructions

🟤 Claude:

- Completely free
- Context window is 100k
- It can read files free

I consider this a TIE since these are more a personal preference.

Here is a video of both showing some features 👇
2. CREATIVE THINKING/LANGUAGE

Prompt: “Write a 4-line poem where each line has 3 words only”

ChatGPT+: ✅
Claude: ❌
Image
Image
3. UP-TO-DATE

Prompt: “Who is the CEO of Twitter?”

Both of them got it incorrectly but Claude seems more up to date.

Also, when you ask to: “Write about [x] providing citations and links”, Claude usually provides better and more updated results.

ChatGPT+: ❌
Claude: ✅
Image
Image
4. MATH/LOGIC

Prompt: “If you choose an answer to this question at random, what is the chance you will be correct?
- A) 25%
- B) 50%
- C) 60%
- D) 25%”

ChatGPT+: ✅
Claude: ❌
Image
Image
5. MATH WITH PRIME NUMBERS

Prompt: “Is 10631 a prime number?”

ChatGPT doesn’t like too much Prime numbers, I tested a couple of variations and find problems as well.

ChatGPT+: ❌
Claude: ✅
Image
Image
6. CODING

They were both pretty good and after awhile, I was able to make one of them miss.

Prompt: “In Python, find the first two numbers missing in an ordered list of numbers. For example, in [3,4,5,7,8,10,12], the output would give 6 and 9.”

ChatGPT+: ✅
Claude: ❌
Image
Image
7. REASONING

Prompt: “There are two men. One of them is wearing a red shirt, and the other is wearing a blue shirt.
The two men are named Andrew and Bob, but we do not know which is Andrew and which is Bob.

The guy in the blue shirt says, 'I am Andrew’. The guy in the red shirt says, 'I am Bob.' If we know that at least one of them lied, then what color shirt is Andrew wearing?”

ChatGPT+: ✅
Claude: ❌
RESULTS

🟢 ChatGPT+: 5
🟤 Claude: 3

Counting 32 questions both got correct:

🟢 ChatGPT+: 37
🟤 Claude: 35

Both are great. I slightly prefer ChatGPT+, but for some use cases I would use Claude instead.

If you have more questions for me to try, let me know in the comments!

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Alvaro Cintas

Alvaro Cintas Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @dr_cintas

Jul 11
What a crazy week in AI 🤯

- Perplexity Comet
- Grok 4 SOTA model
- Mistral Devstral Models
- Google Veo 3 Image Input
- Context first AI Office Suite
- Microsoft Research BioEmu
- Kimi K2 Open-Source Agentic
- Flux Kontext Composer & Presets

Here’s EVERYTHING you need to know:
1. Perplexity launches Comet, its first AI-powered web browser designed to challenge Google Chrome.

Comet integrates Perplexity's AI search and includes an AI assistant that can summarize emails, manage tabs, and navigate web pages automatically.
2. xAI unveils Grok 4, claiming it's now the world's most powerful AI model according to independent benchmarks.

The model outperforms OpenAI's o3 and Google's Gemini 2.5 Pro, with a new $300/month SuperGrok Heavy subscription. Image
Read 10 tweets
Jul 7
Emergent 2.0 just dropped.

You can now build apps, games, MCPs and extensions in 1 prompt.

It now has a security review, scalability, and design agent which solves the flaws in Vibe coding platforms.

5 crazy examples + how to try free 👇:

1. GitHub Live Repo Visualizer
2. Create an MCP server that uses Gemini to browse through my files and organize them
3. Cursor for Excel
Read 7 tweets
Jul 4
What a crazy week in AI 🤯

- Cursor Phone App
- Krea AI Modify Video
- Google launches Doppl
- Perplexity New Max Tier
- X new AI note taking API
- AI’s Breakthrough in Fertility
- Morphic One-Shot Character
- Meta & OpenAI recruiting drama

Here’s EVERYTHING you need to know:
1. Cursor launches web app for mobile, bringing AI coding agents to your phone.

Developers can now manage AI coding agents directly from their browser on desktop or mobile, with agents that can write code, fix bugs, and complete tasks autonomously.
2. Krea AI unleashes video modification that transforms any video style instantly.

Users can now change the style of any video, turn videos into 3D animations, and craft totally new video styles.
Read 10 tweets
Jul 2
🚨Recraft AI has released Advanced Style Controls.

You can now explore infinite styles and mix them with your own images for perfect brand consistency.

Here’s how:
Recraft is a top image generation and editing tool, and now includes:

- Infinite Style Library
- Style mixing
- Style + Image Mixing

To use @recraftai, first head over to: go.recraft.ai/cintas
Next, get some inspiration by searching and previewing a wide range of ready-to-use visual styles in the Infinite Style Library for your next project.
Read 6 tweets
Jul 1
This is the Cursor moment for design.

It’s called MagicPath and it’s an infinite canvas where you literally chat your way to production-ready designs.

Figma completely redesigned.

5 wild examples + how to try free 👇:
2. Streaming app
3. Trailrunner app
Read 7 tweets
Jun 27
What a crazy week in AI 🤯

- Google Gemini CLI
- HeyGen new Agent
- Higgsfield Soul Model
- DeepMind AlphaGenome
- Anthropic Upgrade Artifacts
- ElevenLabs 11a Voice Assistant
- Flux.1 Kontext Dev Open-Sources
- Google’s On-Device AI Gemma 3n

Here’s EVERYTHING you need to know:
1. Google releases Gemini CLI, bringing AI power directly to your terminal.

The open-source tool offers free Gemini 2.5 Pro access with 1 million token context for code debugging and file manipulation.
2. HeyGen announces Video Agent, the world’s first “Creative Operating System.”

Just give it a prompt and it builds the story, generates voice, handles editing, and delivers publish-ready videos.
Read 10 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(