Latest Twitter Threads by @dschwarz26 on Thread Reader App

Feb 27, 2025 • 11 tweets • 3 min read

Ok, this thread is long overdue. Now that everyone sees that GPT-4.5 is a disappointment -

How has OpenAI underperformed so much on the intelligence of their core models?

They've been faring worse and worse since the day GPT-4 came out. Buckle in for the tale: First, ChatGPT in Nov 2022 and GPT-4 in Mar 2023 were amazing, world-changing innovations. Full credit there.

Shortly after GPT-4, forecasts of GPT-5's release began. The initial view was mid 2024. It has crept up and up and up:

Sep 12, 2024 • 7 tweets • 3 min read

Another paper, the fourth in 2024, claiming near or super-human level forecasting accuracy from AI, seen by hundreds of thousands of people.

I call BS. Here's why.

(1/7)

https://x.com/DanHendrycks/status/1833152719756116154

1st, definitions matter. "Superhuman" forecasting should mean beating top Metaculus forecasters, or the overall crowd, not a random sample of them like this paper.

You wouldn’t say a coding AI was “superhuman” if it was top 50% of a coding competition. (2/7)

Apr 29, 2023 • 13 tweets • 3 min read

I bet I'm not the only one that has convos like this:

Me: LLMs are generational tech. I'm excited and terrified.

Them: You're worried about a Terminator / Kurzweil scenario?

Me: A bit. I'm more worried about chaos in the next 2-5 years.

Them: What exactly do you mean? It's a good question, what am I worried about? Well, let me share a scenario.

It's April 2026. I wake up in the morning and check Hacker News. "Hundreds of Starlink satellites burn up in the atmosphere."

Share this page!

Enter URL or ID to Unroll