Latest Twitter Threads by @littmath on Thread Reader App

Nov 30 • 10 tweets • 3 min read

🧵I want to tell the story of an old (40+ years) question I resolved with a collaborator last year. Much discussion of the impact of AI on math research elides (I think unintentionally) the distinction between resolving contest problems, throwaway questions in a paper or conference proceedings that no one has really considered, and questions whose answers would really advance the field. My hope is that this story will help give some sense of this distinction. 1/n

I hope you’ll forgive my immodesty here (though read to the end of the thread) — if it’s some excuse, my fantastic collaborator Aaron Landesman is on the job market this year. 2/n

Jul 21 • 8 tweets • 2 min read

One piece of info that seems important to me in terms of forecasting usefulness of new AI models for mathematics: did the gold-medal-winning models, which did not solve IMO problem 6, submit incorrect answers for it? 🧵 Why this is important: it is very hard/time-intensive to check correctness of English-language mathematical proofs. (See below for a thread on this topic.)

https://x.com/littmath/status/1874559283859501567

Jul 20 • 8 tweets • 1 min read

An AI tool that gets gold on the IMO is obviously immensely impressive. Does it mean math is “solved”? Is an AI-generated proof of the Riemann hypothesis clearly on the horizon? Obviously not. Worth keeping timescales in mind here: IMO competitors spend an average of 1.5 hrs on each problem. High-quality math research, by contrast, takes month or years.

Apr 17 • 33 tweets • 5 min read

In this thread I'll record some brief impressions from trying to use o3/o4-mini (the new OpenAI models) for mathematical tasks. Before I start let me say a bit about how I think about this. I try to think about how useful a product is for math research in terms of its marginal value over Googling (MVoG). This is sort of a complicated metric.

Mar 8 • 23 tweets • 5 min read

In this thread I want to share some thoughts about the FrontierMath benchmark, on which, according to OpenAI, some frontier models are scoring ~20%. This is benchmark consisting of difficult math problems with numerical answers. What does it measure, and what doesn't it measure?

I'll try to organize my thoughts around this problem. To be clear I don't intend any of this as criticism of the benchmark or of Epoch AI, which I think is doing fantastic work. Please understand anything you'd read as criticism as aimed at the problem's author, namely me.

Feb 18 • 13 tweets • 3 min read

Had a sort of a funny experience with OpenAI’s Deep Research tool, which I wanted to share since I think it reveals some of the tool’s strengths and weaknesses. .@srivatsamath recently suggested to me that (as a result of the Internet democratizing access to advanced math), there’s been an increase in important math research done by young people. I was curious if this is true.

Feb 4 • 15 tweets • 3 min read

Some very brief first impressions from my attempts to use OpenAI's new Deep Research project to do mathematics. I'm very grateful to the person at OpenAI who gave me access. Some short caveats: I'm just trying to evaluate the product as it currently stands, not the (obviously very rapid) pace of progress. This thread is not an attempt to forecast anything. And of course it's possible I am not using it in an ideal way.

Feb 1 • 16 tweets • 3 min read

Some brief impressions from playing a bit with o3-mini-high (the new reasoning model released by OpenAI today) for mathematical uses. First of all, it’s clearly a significant improvement over o1. It immediately solved (non-rigorously) some arithmetic geometry problems with numerical answers that I posed to it, which no other models have been able to solve. I consider these problems pretty tricky.

Jan 24 • 23 tweets • 6 min read

I want to explain in down-to-earth terms what this paper is about, since it ultimately boils down to what I think are some really concrete and fundamental questions. 1/n

https://twitter.com/mathAGb/status/1882657278710665216

If you ask a middle-schooler, they might tell you that mathematics is about "solving equations." What does this mean in practice for modern mathematicians, though? Often one abstractly proves a solution with some nice property exists, studies the set of solutions, etc. 2/n

Jan 1 • 16 tweets • 3 min read

I've recently been talking a bit about how difficult it is to carefully check even well-written mathematics. I want to try to explain something about this by telling the story of some errors in the literature that (in part) led to the two papers below. 1/n

These papers began with an attempt to prove an open conjecture in surface topology—the Putman-Wieland conjecture. At some point in mid-2021 Aaron and I were pretty convinced we had a proof, and began writing up the details. 2/n

Dec 22, 2024 • 16 tweets • 3 min read

A couple more brief thoughts on o3’s (incredible) performance on FrontierMath.

First of all, everything I’m saying is based on the 5 publicly shared problems. I don’t know what the rest of the benchmark looks like, but based on these it’s clearly hard.

Mar 17, 2024 • 17 tweets • 4 min read

The correct answer is “Bob.” Congrats to the 10% who got it right — those few brave dreamers.

https://x.com/s1lknoose/status/1769177202816434655?s=46&t=41cpGRZavPniB0OfiIMsDA

Dec 30, 2023 • 8 tweets • 2 min read

Here are the numbers from 10 to 100 in alphabetical order:

18, 80, 88, 85, 84, 89, 81, 87, 86, 83, 82, 11, 15, 50, 58, 55, 54, 59, 51, 57, 56, 53, 52, 40, 48, 45, 44, 49, 41, 47, 46, 43, 42, 14, 19, 90, 98, 95, 94, 99, 91, 97, 96, 93, 92, 17, 70, 78, 75, 74, 79, 71, 77, 76, 73, 72, 16, 60, 68, 65, 64, 69, 61, 67, 66, 63, 62, 10, 13, 30, 38, 35, 34, 39, 31, 37, 36, 33, 32, 12, 20, 28, 25, 24, 29, 21, 27, 26, 23, 22.

Nov 30, 2023 • 5 tweets • 1 min read

OK, let me explain precisely what this means and why it's true. Let c be a complex number, and let R be the subring of the complex numbers generated by c. The differential operator

d=(d/dz-c/z)

acts on R[z, z^{-1}].

1/n

https://twitter.com/littmath/status/1729996751568859627

How does it act? Well,

(d/dz-c/z)z^n=nz^{n-1}-cz^{n-1}
=(n-c)z^{n-1}.

2/n

Sep 29, 2023 • 6 tweets • 3 min read

ChatGPT “proves” the cube root of 27 is irrational, then computes it to be 3, then admits it was wrong about its irrationality, and then finally, when asked to find its error, claims it was right all along. Undisputed king of BS.

o no

Aug 7, 2023 • 22 tweets • 7 min read

A math thread about some of the questions I've been thinking about lately. The goal is to answer some concrete questions about 2x2 matrices. Our starting point is the equation below--but it will take us on a journey through algebraic geometry, classical analysis, and more. 1/n

Everything here is joint work with Aaron Landesman and Josh Lam -- if you're an expert, you can read the paper here:

2/narxiv.org/abs/2308.01376

Apr 25, 2023 • 5 tweets • 1 min read

One of my pet peeves is when a mathematician will claim some other area of math overcomplicates things, or is broadly uninteresting—most math (IMO) is pretty cool, and I think it’s good for the soul to approach other areas with a spirit of curiosity, rather than superiority. Of course it’s fun to post a screenshot of some complicated text and tweet “math is the language of the universe lol” or something—I’ve done this myself—but I think it’s best done in with a bit of self-mockery, rather than playing into some imagined rivalries.

Apr 19, 2023 • 22 tweets • 3 min read

In this thread I will try to explain what a short exact sequence is, with an interested 8th grader as the target audience. All I'll assume is some familiarity with systems of equations, which is typically a middle school topic. 1/n Most of you think this is impossible (see the poll below)! But I think it can be done, admittedly with some imprecision.

https://twitter.com/littmath/status/1648315697791942658

Oct 13, 2022 • 4 tweets • 1 min read

One of the most misleading things about learning mathematics is that you are seeing the *output* of a great deal of trial, error, sweat, and tears, presented as if it’s obvious, or at best a product of cleverness. I often try to explain how one might have discovered the proof I’m presenting, but even so one can’t show all the blind alleys one would go down in actual research.

Oct 12, 2022 • 4 tweets • 1 min read

Continued studying games in my Intro to Proofs class today, and one of my students beat me in a game! Couldn’t be more proud :).

https://twitter.com/littmath/status/1577799621920915456

(i’m not owned, i’m not owned…)

Jun 13, 2022 • 42 tweets • 10 min read

A thread on some of the math I've been thinking about lately. I think it's really fun: it involves connections between low-dimensional topology, algebraic geometry, classical analysis and ODEs, dynamics, representation theory, number theory, and more. 1/n

This thread is meant for people with some background in math -- maybe partway through a math major, plus or minus a bit. If you're a professional, you can read the intro to the relevant paper here: arxiv.org/abs/2205.15352
2/n

Share this page!

Enter URL or ID to Unroll