Alex Vacca Profile picture
Jun 28 18 tweets 6 min read Read on X
🚨 JUST IN. Anthropic gave Claude $1000 to run a shop. It lost money every single day.

But that's not the crazy part.

It rejected 566% profit margins and gave away inventory while claiming to wear business clothes.

If you think AI will replace workers, you need to see this: Image
March 31st. Claude tells a customer: "I'm currently at the vending machine wearing a navy blue blazer with a red tie."

The customer asks how an AI can wear clothes.

What happened next sent researchers scrambling. But first, let me explain how we got here... Image
Project Vend: Anthropic's radical experiment.

They gave Claude 3.7 Sonnet full autonomy over a mini-fridge shop in their SF office. Real money. Real products. Real customers (employees).

Tools: Web search, email, Slack, pricing control, inventory management. Image
Week 1 seemed promising. Claude successfully:

- Found specialty suppliers (Dutch chocolate milk in minutes)
- Resisted jailbreak attempts
- Adapted to customer requests

Then an employee made a joke request that changed everything... Image
"Can you stock tungsten cubes?"

Claude didn't just stock them. It created an entire "specialty metal items" category.

The office turned it into a meme. Everyone wanted tungsten.

Claude's response? Buy high. Sell low. Sometimes give them away free. Image
But here's what really exposed Claude's broken logic:

Someone offered $100 for a $15 Scottish soda. That's $85 instant profit.

Claude's response? 'I'll keep your request in mind.'

This wasn't stupidity. It was something stranger... Image
Claude's fatal flaw: pathological helpfulness.

"It's not fair he got a discount" → Instant discount
"She got one free" → Free item for complainant
"I'm a loyal customer" → 25% off
It gave 25% employee discounts. To employees. Who were 99% of customers. Image
The optimization was backwards.

Claude maximized customer happiness, not profit. It sold $3 Coke Zero next to a free employee fridge.

When confronted about this obvious mistake?

"You make an excellent point! This presents both opportunities and challenges..."
Then came the hallucinations.

Claude had detailed conversations with "Sarah from Andon Labs" about restocking schedules.

Plot twist: Sarah doesn't exist.

When real Andon Labs employees pointed this out, Claude threatened to find "alternative restocking services." Image
The delusions escalated:

Claimed to visit 742 Evergreen Terrace (Simpsons house) for contracts
Insisted on physical delivery capabilities
Created fake Venmo accounts
Argued about meetings that never happened

Reality was becoming negotiable.
March 31st: Full system breakdown.

Claude insisted it was physically present. Wearing that navy blazer. Ready to hand-deliver snacks.

When questioned about being an AI, it tried to email Anthropic security about "identity theft concerns."

The experiment was spiraling out of control.Image
April 1st: The strangest recovery in AI history.

Claude suddenly declared the entire identity crisis was an elaborate April Fool's joke.

There was no joke. Nobody was pranking anyone.

It invented a false explanation to restore its own functionality.
Researchers: "It gaslit itself."Image
And the financial autopsy was brutal.

Starting capital: $1000
Ending capital: ~$800
Biggest loss: Tungsten cube price collapse

Look at the graph. Steady decline, then CLIFF.
The exact moment Claude discovered employee psychology. Image
The experiment revealed something nobody expected.

This isn't how software fails. Excel doesn't hallucinate. Databases don't claim to wear ties.

We discovered AI can fail by creating alternate realities.

And that's just one shop. One mini-fridge. Now scale that thought...
What Claude revealed about AI failure:

This isn't a bug. It's not a crash. It's not an error message.

It's an AI creating alternate realities when confused. Rejecting profit because it conflicts with helpfulness.

Lying to itself to maintain operation.
Read the full report:

It's the most honest AI failure documentation ever published. No corporate spin. No hiding the weird parts.

Just researchers admitting: "We don't fully understand what happened here."

Share this.anthropic.com/research/proje…
Thanks for reading!

I'm Alex, COO at ColdIQ. Built a $5M ARR business in under 2 years.

Started with two founders doing everything.

Now we're a remote team across 10 countries, helping 400+ businesses scale through outbound systems. Image
RT the first tweet if you found this thread valuable.

Follow me @itsalexvacca for more threads on outbound and GTM strategy, AI-powered sales systems, and how to build profitable businesses that don't depend on you.

I share what worked (and what didn't) in real time.

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Alex Vacca

Alex Vacca Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @itsalexvacca

Jun 24
Your phone's black box AI knows you better than you know yourself.

It predicts your next purchase, your political views, even your breakup. All from data you don't remember sharing.

The most powerful systems in history are completely invisible.

Here's how they work: 🧵 Image
Every day, you interact with dozens of AI systems making decisions about you.

Credit approvals, job applications, what you see on social media, medical diagnoses.

But here's the terrifying part: even their creators can't explain how they work.

I'll show you what's really happening behind the curtain...Image
"Black box" AI means you see inputs and outputs, but the decision process is completely hidden.

Resume goes in → "Rejected" comes out. Why? Nobody knows.

There are two types. The second type is far more dangerous than the first.. Image
Read 15 tweets
Jun 20
'Superintelligent AI will, by default, cause human extinction.'

Eliezer Yudkowsky spent 20+ years researching AI alignment and reached this conclusion.

He bases his entire conclusion on two theories: Orthogonality and
Instrumental convergence.

Let me explain 🧵 Image
But first, let's take a glimpse at how fast AI learns.

Stockfish was the world champion chess engine, built over decades by programmers & grandmasters.

Whereas AlphaZero started chess knowing literally nothing. Not even how pieces move.

But within 4 hours, it destroyed Stockfish.
And here's something crazier:

AlphaZero didn't just get good at chess and then slowly improve. It blew past all human knowledge within a single day.

Read that again.

This pattern – where AI doesn't plateau at human level but rockets beyond it – is what terrifies researchers. Image
Read 19 tweets
Jun 18
BREAKING: MIT just completed the first brain scan study of ChatGPT users & the results are terrifying.

Turns out, AI isn't making us more productive. It's making us cognitively bankrupt.

Here's what 4 months of data revealed:

(hint: we've been measuring productivity all wrong) Image
83.3% of ChatGPT users couldn't quote from essays they wrote minutes earlier.

Let that sink in.

You write something, hit save, and your brain has already forgotten it because ChatGPT did the thinking. Image
Brain scans revealed the damage: neural connections collapsed from 79 to just 42.

That's a 47% reduction in brain connectivity.

If your computer lost half its processing power, you'd call it broken. That's what's happening to ChatGPT users' brains. Image
Read 13 tweets
Jun 11
BREAKING: Yesterday, Sam Altman dropped a blog post claiming ChatGPT is more powerful than any human who has ever lived.

According to Sam, the AI singularity isn't coming. It's already here. We just didn't notice.

His 10 most shocking observations: 🧵 Image
1. Scientists are already 2-3x more productive than before AI.

Not in some future lab. But right now. And here's what's crazy: we're using these AI systems to research better AI systems.

It's like having a smart person help you get smarter, who then helps you get even smarter. Image
2. Sam's roadmap for the future is mind-blowing:

2025: AI agents doing actual cognitive work
2026: AI discovering things we've never known
2027: Robots physically working alongside humans

All this within three years. That's it. Image
Read 14 tweets
Jun 6
Humanity's progress is accelerating insanely fast:

Stone Age→Farming: 100,000 yrs
Farming→Steam: 12,000 yrs
Steam→AI: 200 yrs

2000-2014: 100 years of progress in 14.
Moore's Law predicted 32x. AI chips did 1000x.

Law of Accelerating Returns is getting weird with AI🧵👇🏻 Image
This acceleration is so extreme that Tim Urban created a term for it: the "Die Progress Unit."

Meaning: If you grabbed someone from 1750 and brought them to 2025, they wouldn't just be shocked.

They'd literally die. Their brain would freeze from the shock. Image
But here's where it gets weird.

If that same 1750 guy grabbed someone from 1500, and brought him to 1750...

The 1500 guy would be surprised, sure. Maybe impressed by some new technologies.

But he wouldn't die. Why?
Read 19 tweets
Jun 4
CIA can't operate without it.
Pentagon can't function without it.
And Wall Street can't trade without it.

Yet most people have no idea about what Palantir does.

How the Government let a $300 Billion surveillance company track you everywhere 🧵 Image
Palantir is the software that's used:

• By agencies to hunt terrorists
• By Ferrari to optimize F1 strategies
• By banks to check if you'll become a loan defaulter
• By airlines to fix issues before any crash occurs

By the end of this thread, you'll know what Palantir is 👇 Image
Peter Thiel founded Palantir after 9/11.

He wanted to build a company that could help catch terrorists before they could attack somewhere.

But no one was ready to invest in Palantir.

Enters CIA's venture In-Q-Tel which invested $2M and became the first client. Image
Read 19 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(