Post

How to get URL link on X (Twitter) App

On the Twitter thread, click on or icon on the bottom
Click again on or Share Via icon
Click on Copy Link to Tweet
Paste it above and click "Unroll Thread"!
More info at Twitter Help

Damek

@damekdavis

Dec 2, 2023 • 9 tweets • 2 min read • Read on X

I proved a super simple helper lemma.
In English the proof is 1 line.
In lean4, it took me around 50 lines of code.
1/n

The proof took me around 5 hours (lol).
Any help shortening would be appreciated.

2/n

The trickiest things:
1. Can't take basic math -- e.g., commutativity, automatic casting of naturals as reals -- for granted.
2. Tactics feel simultaneously powerful and weak. E.g., sometimes 'linarith' easily proves inequalities, sometimes not.
3. GPT4 is not v helpful
3/n

Next up for me is figuring out support for convexity, differentiability, and the fundamental theorem of calculus. I'm also about 7 chapters in to the lean4 tutorial. I'll continue working through that in parallel.

I'm posting my code to a github repo, which i'll link below.
4/n

You can find the code above at the following link:

github.com/damek/gd-lean/…

Whoops forgot an assumption in the theorem statement above: The sequence a_k should be nonincreasing.

https://twitter.com/ericwieser/status/1731084883391344648

A ~30 line streamlined proof closer to my envisioned 1 line proof in English.
6/n

https://twitter.com/ericwieser/status/1731084883391344648

Terence Tao kindly sent me the following 3 line lean proof, which is exactly the proof I wanted to write!

https://twitter.com/damekdavis/status/1732369224427807030

Terry’s blog on writing the proof.

https://twitter.com/damekdavis/status/1732369224427807030

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @damekdavis

Damek

@damekdavis

Jul 22

https://twitter.com/damekdavis/status/1944837669218894123

Update:

On the first try of using chatgpt agent, it surfaced a reference from 2015 which contained a proof of the result in question.

The paper was on an unrelated topic. It was hidden in the appendix and was missing many relevant keywords.

https://twitter.com/damekdavis/status/1944837669218894123

The bound had also been stated but not proven in a 1994 paper. The 2015 paper gave a decently simple self-contained proof.

This question had been asked to a few prominent people in this field. They didn't know, so it wasn't well known. I'm sure they could have figured it out.

I didn't know this literature at all. I tried not to read it except through interaction with llms.

The solutions given by all llms always had a flaw. The approach didn't look like the one in the 2015 paper.

Read 10 tweets

Damek

@damekdavis

Jul 14

https://twitter.com/damekdavis/status/1943359962794856718

Update: I didn't solve the problem

Over the last 6 months, I've tried pretty hard to use LLMs to solve a (I think) not too difficult problem in a nearby field (learning theory).

The answers have gone from terrible to plausible, and I actually understand the problem better now.

https://twitter.com/damekdavis/status/1943359962794856718

I now have references that are highly relevant and several pathways that might work.

Unfortunately, every argument proposed has had (in retrospect) a fairly obvious hole in it.

https://x.com/damekdavis/status/1943835849365368866

they all produce a series of creative steps and simplifications that move us closer to solving the problem.

but when you look carefully and backtrack, you see claims that just fall apart because of unjustified 'trivial claims.'

https://x.com/damekdavis/status/1943835849365368866

Read 4 tweets

Damek

@damekdavis

Dec 11, 2023

Next formalization is the "gradient inequality" for smooth convex functions.

The proof (next tweet) is a few sentences of English.

My lean4 proof (below) is roughly 350 lines of code.

1/n

The proof in English is short. It is a consequence of the simple convex analysis fact:

If f is a convex function such that f(0) = 0 and f(y)>= o(y), then f(y) \geq 0 globally.

To formalize this proof, I first had to learn whether mathlib had support for gradients. Interestingly this was just added in October:
leanprover.zulipchat.com/#narrow/stream…

Read 14 tweets

Damek

@damekdavis

Aug 29, 2023

https://twitter.com/sp_monte_carlo/status/1696506912680857945

Cool technique. Reminds me of the the Cramer-Chernoff method, which applies a Markov type inequality and optimizes to find the "best" parameter....

https://twitter.com/sp_monte_carlo/status/1696506912680857945

Carrying out the optimization bounds the tail deviation as the exponential of the Fenchel conjugate of the log moment generating function of the r.v.

You can then use this to do the standard tensorization trick for sums of i.i.d random variables

Read 4 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Enter URL or ID to Unroll

Damek

Try unrolling a thread yourself!

More from @damekdavis

Damek

Damek

Damek

Damek

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!