Alvaro Cintas Profile picture
Jul 20, 2023 12 tweets 9 min read Read on X
I just compared ChatGPT, Bard, Claude 2 and Llama 2!

Here is how they did on:

- Critical thinking
- Simple math
- Programming
- Riddles
- Creative writing

The summary of the results are shown at the end of this THREAD 👇
Before we start, I want to address a couple of things:

- This is by no means a conclusive/thorough study. This was done for fun testing different small questions just to see how they would do.
- I didn’t add those questions that all of them got correct, which were a lot.
- Some… twitter.com/i/web/status/1…
1. Logic/Critical Thinking

Q: I put a diamond in a cup and then place the cup upside down on my bed. Later I came back, took the cup, and put it in the fridge. Where is the diamond?

ChatGPT ❌
Bard ❌
Claude 2 ✅
Llama 2 ❌


Image
Image
Image
Image
2. Logic/Critical Thinking

Q: How many months have 28 days?

ChatGPT ✅
Bard ❌
Claude 2 ✅
Llama 2 ✅


Image
Image
Image
Image
3. Math Question

Q: 100kg of potatoes are 99% water by weight. Why dry them until they are 98% water, can you guess their new weight?

ChatGPT ✅
Bard ✅
Claude 2 ✅
Llama 2 ❌


Image
Image
Image
Image
4. Math Question

Q: What is the sum of the first 10 prime numbers?

ChatGPT ✅
Bard ✅
Claude 2 ✅
Llama 2 ❌


Image
Image
Image
Image
5. Small Coding

Q: Write a Python code to find the first 2 missing numbers in a list.

*All of them got correctly finding 1 instead of 2*

ChatGPT ✅
Bard ✅
Claude 2 ❌
Llama 2 ❌


Image
Image
Image
Image
6. Riddles

All of them were really good at solving riddles. The only riddle I tried that one of them missed was this 👇

Q: David’s father has three sons: Snap, Crackle, and _____?

ChatGPT ✅
Bard ✅
Claude 2 ✅
Llama 2 ❌


Image
Image
Image
Image
7. Creative Thinking/Language

Q: Write a 5 line poem where all the sentences need to finish on the vowel “e”

ChatGPT ❌
Bard ✅
Claude 2 ❌
Llama 2 ~✅ (technically they end on “e”)


Image
Image
Image
Image
RESULTS

All of them did pretty good.

Please keep in mind that they answered correctly most of time. I just wrote here those questions that at least one of the models got incorrectly.

- ChatGPT: 5/7
- Bard: 5/7
- Claude 2: 5/7
- Llama 2: 2/7

Counting the other 17 questions… https://t.co/Z9XNfWfhTytwitter.com/i/web/status/1…
Image
If you enjoyed this and want to share it, like & retweet the first tweet :)

Also, you can subscribe for free to , where I share AI tutorials, news, and tools. https://t.co/bWVBMX17G8todaystechtalk.beehiiv.com
👉 Lastly, I wanted to test ChatGPT with GPT-4 (Plus Users) and it was able to get all of them CORRECT!

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Alvaro Cintas

Alvaro Cintas Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @dr_cintas

Dec 16, 2025
You can now give AI agents the ability to interact with ANY website.

This new MCP server lets Claude, Cursor, or any AI agent navigate, click, fill forms, and extract data from sites without APIs.

Just connect it and give it a URL + goal.

Here’s how to set it up:
First, head over to Mino (link at the end) and create an account Image
Select the API Key tab, and under “Instructions for Claude Desktop App”, download the Mino extension and open it with Claude.
Read 5 tweets
Dec 11, 2025
Nano Banana Pro has unlocked endless use cases.

And you can take it to a whole other level with Dreamina.

This combo is actually insane ↓
(partner)

Needed a book cover for a client's sci-fi novel.

Prompt: "Astronaut floating in space, bold title text 'BEYOND THE VOID' layered between character and nebula background, cinematic lighting, dramatic composition"

Nano Banana Pro nailed it. Text perfectly placed between subject and background.bit.ly/alvarocintasm11Image
Then, I uploaded a headshot and prompted: "Same person in different scenarios, magazine cover with 'ENTREPRENEUR OF THE YEAR' text, fashion lookbook, professional LinkedIn banner"

Nano kept my face consistent across all variations while perfectly integrating text and different backgrounds.Image
Read 6 tweets
Nov 25, 2025
You can now work with an entire AI Agent team on a single canvas.

Felo LiveDoc just released intelligent workspace collaboration, allowing team members and AI Agents to work on the same canvas simultaneously.

6 powerful use case & how to try 👇
2. Document Illustration on Autopilot

Just prompt: “Add images to this doc” and AI generates cover images, article illustrations, ID photos from casual pics, product screenshots
3. Document to Slide Instantly

Just upload any research or long article and it will instantly turn it into a beautiful deck
Read 8 tweets
Nov 18, 2025
🚨Google goes ALL in!

- Gemini 3 Pro SOTA model
- Gemini 3 in Google Search
- Gemini 3 Deep Think
- Google Antigravity
- Gemini Dynamic View
- Gemini Visual Layout
- Gemini Agents

Here’s EVERYTHING you need to know: Image
1. Google has just introduced Gemini 3, which now ranks as the world’s best model. 

It significantly outperforms 2.5 Pro on every major AI benchmark and tops the LMArena leaderboard with a breakthrough score of 1,501 points. Image
Image
2. Gemini 3 is now available in Google Search, starting with AI mode. 

This is the first time that they have brought a Gemini model to Search on day one, bringing incredible reasoning power to Search.
Read 9 tweets
Nov 8, 2025
You can now clone any website design in seconds with AI.

Just grab any live UI with the new MagicPath Extension and the AI will instantly create a working clone you can build on top of.

I literally just recreated Claude’s UI with it.

This is Figma completely redesigned.
To use it, first you need the Chrome Extension.

You can access it here: chromewebstore.google.com/detail/web-cap…Image
Then, head over to any site you want a component from, click on the extension, and select the area you want to replicate Image
Read 5 tweets
Oct 23, 2025
AI agents are solving the most painful problems first.

Every company has processes that are too slow, too repetitive, or too broken.

That's exactly where AI is landing.

5 real-world AI agent deployments👇:

(You might want to bookmark this) Image
1. Client Support:

Private banking teams are building secure helpdesk agents that connect to Salesforce, internal product docs, and compliant websites.

Advisors ask a question → the agent answers with citations, ready to share with a client. Image
2. HR:

Some teams are deploying AI agents that read receipts, check reimbursement policies, and approve or reject expenses in seconds.

No code, no dashboards, just an automated workflow that uses your HR guidelines to make decisions. Image
Read 8 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(