🦾Meet Aristotle Agent, the world’s first autonomous mathematician — live and currently free of charge. We designed Aristotle Agent to solve and formalize the world’s most challenging mathematical research problems. It is now:
☑️#1 in Formal Math: We’re the #1 formal math model according to ProofBench, by @ValsAI, ahead of the closest competitor by 15%. Aristotle Agent can autonomously prove/formalize for up to 24 hrs without human intervention.
☑️Fully Agentic: Give it an English problem and it will prove/formalize from scratch, or it can work and edit files directly inside your Lean project / repository.
☑️Github-ready: Aristotle agent produces repo-quality code; project leads are increasingly merging Aristotle-drafted PRs with no modifications.
Many of us intuitively feel that the field of mathematics is going to change, so let's unpack the likely outcomes, without resorting to hyperbole or doomerism.
Hypothesis #1: The number of mathematical proofs and the aggregate amount of mathematical data will expand exponentially.
This one is perhaps uncontroversial. AI models, under the direction of humans, are already creating voluminous amounts of math. Now the bottleneck has gone from creating mathematics to verifying its correctness.
Aristotle, which uses @leanprover, solves the verification problem by enabling each proof to be machine-checkable.
As the technology improves, the length and complexity of the created proofs will continue to increase.
Hypothesis #2. There will be dramatically more mathematicians in 10 years than there are today.
By automating the drudgery of verification and proving simple lemmas, Aristotle can not only speed up the work of highly sophisticated professional mathematicians, but also lower the barrier to entry for others to contribute. Among our beta API users are retired software engineers, lawyers, military members, students, and plenty of other people that have mathematical ideas / ingenuity but need help with the formalism and need someone to verify their ideas. We should expect these trends to accelerate.