Eyisha Zyer Profile picture
Sep 7 9 tweets 4 min read Read on X
OpenAI has released a shocking paper that reveals

"why AI hallucinates, and its mechanism."

And it's free,

Here's are true reasons why hallucinations occur, as shown in the paper, along with 6 solutions🧵 Image
Image
Image
1. AI is trained in such a way that it cannot say

"I don't know"

The biggest cause of hallucinations lies in the AI's training method itself.

In the current evaluation systems, even if the answer is incorrect, guessing provides a higher score than answering "I don't know," so the AI ends up learning to actively lie (bluff).Image
2. "Accuracy Supremacy"

Encourages Lying Benchmarks that measure AI performance basically only look at whether the answer is correct or incorrect.

Answering "I don't know" gets 0 points, so even in uncertain cases, guessing yields a higher expected value.

This "test-soaked" state has been producing AIs that confidently lie.Image
Image
3. Hallucinations arise during the "pre-training" phase

Hallucinations begin from the model's initial training phase.

While the model excels at learning general sentence patterns from vast amounts of text data, it struggles with information that cannot be patterned, such as rare specific facts. As a result, it generates plausible-sounding falsehoods.Image
4. An Astonishingly Simple Solution

The solution presented in the paper involves changing the evaluation method.

Simply add a rule such as "only answer if you are more than 90% confident" and impose a heavier penalty for incorrect answers. This makes it optimal for the AI to adopt a strategy of honestly responding "I don't know" when it lacks confidence.Image
5. Computational Limits and Model Limitations

Computationally, hallucinations occur when facing problems where correct answers beyond chance are impossible, or due to the structural limitations of the model itself (e.g., older models confusing "he" and "she"). Image
6. Singleton Problem

When there is a lot of information (singletons) that appears only once in the training data, it is statistically impossible for AI to achieve 100% accuracy.

Since pattern learning is not possible, it inevitably has to rely on inference. Image
OpenAI concludes that

"hallucinations are not a mysterious phenomenon, but merely a statistical classification error."

It states that AI's lies are not bugs, but inevitable results produced by the current evaluation system, and that by simply changing the evaluation method, a more honest AI can be created.
AI is shaping your future - discover the key developments, challenges, and opportunities in The AI Foreground.

aiforeground.beehiiv.com

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Eyisha Zyer

Eyisha Zyer Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @eyishazyer

Sep 5
AI just ended PowerPoint.

Forget slides, effort, planning, and endless formatting.

5 AI tools that remove PowerPoint from existence: Image
1. Plus AI

Turn ideas into high-quality slides in seconds.

Works right inside PowerPoint and Google Slides, fully compatible and seamless.

plusai.com/ai-powerpoint-…
2. Gamma AI

Create presentations, documents, and webpages in minutes - fast, efficient, and polished.

Try it here: gamma.app
Read 7 tweets
Aug 31
The most important skill you’ve never been taught:

Prompting.

10 dead-simple tips to get better results from AI (worth saving): Image
1. Cut the fluff

Most prompts are vague, rambly, or full of filler.
AI doesn’t need your backstory it needs clarity.

✗ “Hey can you maybe explain AI a little bit, like in a way someone could understand?”

→ “Explain AI in 3 bullet points as if I’m 12 years old.” Image
2. Assign roles + memory

Don’t just say “write this.” Give the AI a job identity and context anchor.

Example:

“You are a McKinsey consultant. Act as if you’ve advised Fortune 500 CEOs for 10 years. Create a 3-phase GTM plan for an AI SaaS startup.”

Output quality jumps instantly because the model maps to patterns in that role.Image
Read 13 tweets
Aug 29
The new Nano banana is like a Photoshop on steroids.

It lets you do INSANE things that weren't possible before.

Here are the 20 coolest examples I've found 🧵 Image
1/ Combine photos into new scenes
2/ Can edit image by describing to it
Read 23 tweets
Aug 25
Andrew Ng has been right about AI every single time.

Now, he’s made his boldest prediction yet:

“5 upcoming AI opportunities that could create more millionaires than anything before.”

Here’s what you need to know: 🧵 Image
First, his track record:

He built Google Brain, co-founded Coursera (over 120M learners), and led AI at Baidu, managing a team of 1,300 researchers.

He’s trained 8M+ students and runs an AI fund worth $370M.

When he talks about the future of AI, people listen.
1/ Everyone’s obsessed with scaling AI models.

But the $69B opportunity isn’t there.

It’s in Agentic AI exploding 13x from $5.1B today to $69B by 2032.

Ng’s bet could rewrite the next decade of AI.
Read 13 tweets
Aug 24
🚨 Apple is in trouble.

Google just dropped the Pixel 10… and it feels like it’s from 2030.

The iPhone suddenly looks outdated.

Here are 10 wild AI features that change everything: Image
1. In the blink of AI, turn ordinary shots into extraordinary moments with a world-class camera system.
2. Turn your voice into a hit track 🎵

Hum, sing, or whistle - and Pixel 10 turns it into a fully produced song.

Pick a genre, hit record, and let AI make you sound like a superstar. Image
Read 13 tweets
Jul 21
Amazon just dropped Kiro.

It not only writes code but also creates clear specifications, generates and executes tasks, and even detects bugs.

Here are some wild examples (and yes-it’s FREE): Image
1/ Convert prompts into requirements definitions From instructions like "Add a review function," generate clear requirements that developers can immediately act upon.
2/ Generate system design from requirements Scan the codebase and specifications to automatically create interface definitions, data flow diagrams, API routes, and DB schemas. Always reflect the actual system design.
Read 8 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(