Alvaro Cintas Profile picture
Aug 19, 2023 10 tweets 5 min read Read on X
Many people are talking about Claude being a better option than ChatGPT.

So I decided to put them to the test!

- Reasoning
- Simple math
- Coding
- Creativity & more

Here are my findings: Image
✍️ Before we start:

- This is by no means a conclusive/thorough study. This was done for fun testing different small questions just to see how they would do.

- I’ll be using ChatGPT with GPT-4 (let’s call it ChatGPT+)

- I didn’t add here the questions that both got correct, which were A LOT (more numbers later).

- Some of these models might do okay if you ask them a second time or express the question differently. However, I just wanted to test them in a single prompt with no variations.
1. FEATURES

🟢 ChatGPT+:

- Plugins
- Code Interpreter
- Custom Instructions

🟤 Claude:

- Completely free
- Context window is 100k
- It can read files free

I consider this a TIE since these are more a personal preference.

Here is a video of both showing some features 👇
2. CREATIVE THINKING/LANGUAGE

Prompt: “Write a 4-line poem where each line has 3 words only”

ChatGPT+: ✅
Claude: ❌
Image
Image
3. UP-TO-DATE

Prompt: “Who is the CEO of Twitter?”

Both of them got it incorrectly but Claude seems more up to date.

Also, when you ask to: “Write about [x] providing citations and links”, Claude usually provides better and more updated results.

ChatGPT+: ❌
Claude: ✅
Image
Image
4. MATH/LOGIC

Prompt: “If you choose an answer to this question at random, what is the chance you will be correct?
- A) 25%
- B) 50%
- C) 60%
- D) 25%”

ChatGPT+: ✅
Claude: ❌
Image
Image
5. MATH WITH PRIME NUMBERS

Prompt: “Is 10631 a prime number?”

ChatGPT doesn’t like too much Prime numbers, I tested a couple of variations and find problems as well.

ChatGPT+: ❌
Claude: ✅
Image
Image
6. CODING

They were both pretty good and after awhile, I was able to make one of them miss.

Prompt: “In Python, find the first two numbers missing in an ordered list of numbers. For example, in [3,4,5,7,8,10,12], the output would give 6 and 9.”

ChatGPT+: ✅
Claude: ❌
Image
Image
7. REASONING

Prompt: “There are two men. One of them is wearing a red shirt, and the other is wearing a blue shirt.
The two men are named Andrew and Bob, but we do not know which is Andrew and which is Bob.

The guy in the blue shirt says, 'I am Andrew’. The guy in the red shirt says, 'I am Bob.' If we know that at least one of them lied, then what color shirt is Andrew wearing?”

ChatGPT+: ✅
Claude: ❌
RESULTS

🟢 ChatGPT+: 5
🟤 Claude: 3

Counting 32 questions both got correct:

🟢 ChatGPT+: 37
🟤 Claude: 35

Both are great. I slightly prefer ChatGPT+, but for some use cases I would use Claude instead.

If you have more questions for me to try, let me know in the comments!

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Alvaro Cintas

Alvaro Cintas Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @dr_cintas

Apr 25
What a crazy week in AI 🤯

- Grok’s AI Vision
- Genspark AI Slides
- Perplexity Assistant
- OpenAI gpt-image-1
- Tavus SoTA lipsync model
- Dia SoTA speech AI model
- Dreamina AI's Top AI Image
- ChatGPT Deep Research Mini

Here's EVERYTHING you need to know:
1. Grok Vision launches with multimodal capabilities, letting users point their phone cameras at objects or environments for real-time analysis.

This comes alongside multilingual audio support and real-time search capabilities.
2. Genspark introduces AI Slides, transforming presentation creation with a powerful agentic approach.

Its AI researches topics, generates supporting visuals with images and charts, and can transform various document types into polished slide decks.
Read 10 tweets
Apr 24
This is such a powerful AI coding workflow.

You can now use Cursor or Windsurf for coding, and let CodeRabbit’s AI agent refactor messy code, help you debug, and find security vulnerabilities.

Here’s how:
1. Visit , click "Sign in with GitHub”, and authorize CodeRabbit coderabbit.ai
2. Go to your dashboard and click "Add Repositories"

Then, select the repositories you want to enable
Read 4 tweets
Apr 23
Genspark might be the best AI agent I’ve tried yet.

It can agentically conduct research, create web pages, generate videos, and even have the AI call and make reservations for you.

10 powerful use cases:
2. AI Slides (@genspark_ai just launched this)

This Super Agent conducts research and creates polished slides and icons, and users can also edit them. 

To try it out, you can go here: genspark.ai
3. Create minute-long South Park Episode about recent news
Read 12 tweets
Apr 23
🚨BREAKING: Top AI image model just launched.

It’s called Seedream 3.0 by Dreamina AI and it ranks #1 at creating photorealistic images up to 2k resolution.

Dreamina AI can also upscale, inpaint, expand and even generate videos.

It’s now fully available. Link to try free below Image
This is the link to access @dreamina_ai:

First, select where it says “Image Generator” dreamina.capcut.com/ai-tool/home/?…
Next, enter your prompt, select “Image 3.0”, and choose your aspect ratio.

Click “Generate” to get four different visuals.
Read 8 tweets
Apr 21
AI videos are getting scary good.

So I tested some of the hardest prompts for all the major leading models:

• Kling 2.0
• Sora
• Runway Gen-4
• Google Veo 2

10 side-by-side examples:
2. A lion driving an open jeep in the tanzania safari
3. The abominable snowman and bigfoot in the snow sitting on chairs across from each other in a jam session with a guitar and a violin.
Read 11 tweets
Apr 17
What a crazy week in AI 🤯

- Kling 2.0 AI video
- Canva Visual Suite 2.0
- Microsoft Copilot Vision
- Grok Studio and Memories
- ChatGPT 4.1, o3, & o4-mini
- OpenAI’s new coding agent
- ByteDance Seaweed AI video
- Claude Autonomous Research

Here’s EVERYTHING you need to know:
1. Kling AI released its new model Kling 2.0.

This launch features improved prompt understanding, enhanced character motion dynamics for more natural fluid movements, and a Multi-Elements Editor for easier video editing.
2. Canva unveiled Visual Suite 2.0, their biggest product launch since founding.

New AI features include the ability to create everything from documents and presentations to websites in a single design. Image
Read 12 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(