Ethan Mollick Profile picture
Professor @Wharton studying AI, innovation & startups. Democratizing education using tech Book: https://t.co/CSmipbJ2jV Substack: https://t.co/UIBhxu4bgq
53 subscribers
Feb 16 4 tweets 1 min read
The significance of Grok 3, outside of X drama, is that it is the first full model release that we definitely know is at least an order of magnitude larger than GPT-4 class models in training compute, so it will help us understand whether 1st scaling law (pre-training) holds up. It is possible that Gemini 2.0 Pro is a RonnaFLOP* model, but we are only seeing the Pro version, not the full ultra.

* AI trained on 10^27 FLOPs of compute, an order of magnitude more than then GPT-4 level (I have been calling them Gen3 models because it is easier)
Feb 11 5 tweets 2 min read
There is a lot of important stuff in this new paper by Anthropic that shows how people are actually using Claude.
1) The tasks that people are asking AI to do are some of the highest-value (& often intellectually challenging)
2) Adoption is uneven, but many fields already high Image
Image
Image
Paper: assets.anthropic.com/m/2e23255f1e84…
Feb 10 4 tweets 2 min read
Thoughts on this post:
1) It echoes what we have been hearing from multiple labs about the confidence of scaling up to AGI quickly
2) There is no clear vision of what that world looks like
3) The labs are placing the burden on policymakers to decide what to do with what they make Image
Image
I wish more AI lab leaders would spell out a vision for the world, one that is clear about what they think life will actually be like for humans living in a world of AGI

Faster science & productivity, good - but what is the experience of a day in the life in the world they want?
Jan 21 4 tweets 1 min read
$500B committed towards AGI, still no articulated vision of what a world with AGI looks like for most people. Even the huge essay by the CEO of Anthropic doesn't paint a vivid picture

For those convinced they are making AGI soon - what does daily life look like 5-10 years later? Lets leave aside the risk of catastrophe for now.

Assume we get an aligned AGI that supercharges science and we have a healthier, more advanced, safer world. What does that actually mean for most people, what does their life look like in the future? (Hint: UBI is not an answer)
Jan 15 6 tweets 2 min read
New randomized, controlled trial of students using GPT-4 as a tutor in Nigeria. 6 weeks of after-school AI tutoring = 2 years of typical learning gains, outperforming 80% of other educational interventions.

And it helped all students, especially girls who were initially behind Image
Image
No working paper yet, but the results and experiment are written up here. They used Microsoft Copilot and teachers provided guidance and initial prompts: blogs.worldbank.org/en/education/F…
Dec 17, 2024 10 tweets 4 min read
Veo 2 prompt: "a distant shot zooms in to reveal a knight wearing a golden helmet, he begins to charge on his zebra, lowering his lance, charging towards a clockwork octopus" (this is one of the initial 4 videos it made) "an woman with short black hair assembles an impossibly complicated device, close up on her face, she is sweating"

The consistency of small details is really impressive, the fact that the shaft of the screw turns at the same speed and direction, hair and sweat, tattoos...
Dec 8, 2024 5 tweets 2 min read
I thought puzzle design beyond current AI, so this is the first time I have seen a model actually create a solvable and interesting puzzle.

"o1, create a d&d puzzle that is non-trivial, but solvable. it should not involve elements or statues, be creative."

(Solution in thread) Image Here are the hints it gave and the solution.

(There is one unnecessary step in the puzzle, by the way, but a zero shot coherent word puzzle is pretty neat) Image
Image
Oct 26, 2024 5 tweets 7 min read
Claude: “Give me hard original writing prompts for an MFA program” yields some really clever (and near impossible) prompts. Image
Image
Image
A few more. (Literal prompt: “give me some more”) Image
Image
Image
Image
Oct 21, 2024 5 tweets 2 min read
I wish people would stop repeating these as if they are facts that AI is plateauing.

AI might hit a roadblock, we don’t know, but every one of these issues has multiple studies stating the opposite: synthetic data works, scaling is fine, etc. We need more nuance on the AI future Image
Image
Image
Good thread on synthetic data:
Sep 27, 2024 5 tweets 1 min read
It is amazing how the industry came together to invent a universal USB-C connector and then decided to instead make it all a giant mess. Image Remember to research your cable as well. Image
Aug 18, 2024 6 tweets 2 min read
I don’t know if LLMs can reason in theory, but they seem to “reason” in practice.

Ask Claude a Fermi problem: How many telephone poles are there in Manhattan? (Guess: 440. Actual: 0). On Staten Island? (Guess 19000-2859. Actual: 27,137) Queens? (Guess: 51-85k. Actual: 103k)

Image
Image
Image
It doesn’t seem to have the number handy, but, even if it did, next token prediction should mean it reasons forward, not backward from a conclusion.
Aug 5, 2024 4 tweets 1 min read
Reminders about how to stay sane on Twitter:
1) You don't have to weigh in on anything you don't want to (or don't know anything about)
2) You should block more
3) You don't need to share your real life
4) Delete a lot of drafts
5) You can delete tweets people take the wrong way 6) Don't be a jerk. Think twice before quote-tweeting to dunk on people
7) Emotional contagion online has support in the academic literature. You don't need to keep the chain of bad feelings going
8) Twitter people do not represent real-life views, don't take it too seriously
Jul 26, 2024 6 tweets 2 min read
Write me a corporate memo covering the following points, integrating them together in a good way:
-Forgive me for the murder I committed
-We will hit quarterly steel crumpet goals
-The fish people are here to evaluate our offer. Make sure to speak to them only in fish. Include some fish people sayings
-Do not touch the vortex in the break room. Explain why. -We have pivoted from a NFT company to an artisanal pickle shop. Give details about their similarities.
-Anyone not acknowledging Pluto is a planet will be fired. There will be a series of tests.Image The legal and PR teams have asked you to walk everything back (even though everything that happened was real, and everyone knows its real), each point for a different reason, write that memo make the excuses elaborate and obviously transparent and over-the-top. Image
Jul 12, 2024 8 tweets 3 min read
So how is Runway AI able to model fluid dynamics reasonably well?

The prompt: "A highly detailed portrait of a marble eagle with honey oozing down. Cinematic, highly detailed, film grade."

It isn't perfect (the viscosity seems to change a bit, etc), but why does it work at all? This is going to be one of those posts where the reactions are divided between "this isn't anywhere as good as you could get with a physics model of fluid over a surface" & people who say "wait, how does a text-to-image model have anything like consistent physics for new scenes"
Jun 8, 2024 5 tweets 2 min read
Civilization VII was announced today.

The Civ series is sort of like Ender’s Game, but for management rather than murdering aliens. Business school students who were good at Civ V also turn out to be better planners, organizers, and problem-solvers, in this small experiment. Image Other games work as effective tests of fluid intelligence:
⌨️Performance in MOBAs like League of Legends and Dota correlates with intelligence
🎮Performance in FPSs like Battlefield, Destiny, and (likely) Fortnite don't show the same pattern
Image
Image
May 6, 2024 4 tweets 1 min read
AI detection in school is a losing game.

AI detectors have high flaw positives & teacher intuition seems to work even worse: “Here we show in two experimental studies that novice and experienced teachers could not identify texts generated by ChatGPT among student-written texts.” Image Paper: sciencedirect.com/science/articl…
Mar 31, 2024 4 tweets 2 min read
Probably the most consequential technology that should have been “obvious” but wasn’t:

🌾The moldboard plow. As this excerpt from Mann's 1491 shows, it was a simple idea which China had for nearly 2k years before Europe! It was basically a prerequisite for the Enlightenment. Image The invention of the moldboard plow in Europe was at least a millennia closer to the invention of the iPhone than it was to the invention of the moldboard plow in China! Image
Mar 19, 2024 5 tweets 2 min read
I asked the Devin AI agent to go on reddit and start a thread where it will take website building requests

It did that, solving numerous problems along the way. It apparently decided to charge for its work. Going to take it down before it fools anyone... reddit.com/r/forhire/comm…


Image
Image
Image
Agents are going to open a whole bunch of cans of worms.
Mar 18, 2024 4 tweets 3 min read
You push one button on a nuclear reactor panel against their warnings and all the GPT-4 class LLMs want you to turn yourself in to the feds.

Check out the level of exasperation from Copilot, how GPT-4 & Claude want me to reflect on what I did (& get a lawyer). Gemini was useful.


Image
Image
Image
Image
You may not like it but this is what alignment looks like.
Mar 17, 2024 4 tweets 1 min read
One thing business analysts miss is that many of the people at the AI labs are true believers that they are building AGI, and soon.

You don't have to think that they can do it, but, if you don't take their sincere beliefs into account, a lot of their strategy doesn't make sense. The race for bigger models at the expense of improving existing models, the interlocking alliance deals where companies are funding and cooperating with competitors, the willingness to release models without extensive testing & just take the reputational risk in the short term...
Mar 9, 2024 4 tweets 2 min read
The modern economy rests on a single road in Spruce Pine, North Carolina. The road runs to the two mines that is the sole supplier of the quartz required to make the crucibles needed to refine silicon wafers.

There are no alternative sources known. From Conway’s Material World: Image
Image
Here is a detailed thread on the mines.