martin_casado Profile picture
Aug 31 1 tweets 1 min read Read on X
OK, here is my best guess on the state of LLMs:
- The scale increase between gpt-3 and gpt-4 was 100x
- Doing that for the next model is going to be very hard
- We're nearly out of general language tokens. So let's say we can 2x that. And perhaps get more proprietary tokens and get to 3-4x. And do a lot of data cleaning and get to 6-7x.
- A 100x training run also requires a Gigawatt datacenter which we don't have yet
- Synthetic data is great, but it's not clear how that can be used for general language. I suspect this is why both OAI and Anthropic are focusing on math and code which can be improved via various "synthetic" compute methods (simulated data, or recursive self improvement of some sort)
- In the meantime, there is focus on getting more learnings from the same data. Perhaps there is a breakthrough there but I've not heard of it
- Planning can be pushed to inference in some domains (e.g. coding) which we're starting to hear about. But again, not clear how much this buys.
- Moronic policies like SB 1047 are threatening to slow all this down.

So tl;dr I don't see where the 100x jump will come from for general language reasoning. This is why we're seeing a focus on math and code. I'm glad teams are working hard at new algorithmic unlocks.

(btw, this is pure speculation, would love to know where I'm wrong!)

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with martin_casado

martin_casado Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @martin_casado

Aug 8
.@Scott_Wiener continues to falsely claim narrow opposition to SB 1047. When in reality there is massive public outcry across research, academic, public and private business and finance. Here is a mega roundup of recent announcements that fully debunk the Senator's claims 🧵
.@russellwald and @AndrewYNg from Stanford, @vishalmisra from Columbia, and Ion Stoica from Berkeley have also articulated the damage the bill would do to their research, the AI industry and to the state of California.

Image
Image
Image
An open letter from the University of California community has gathered dozens of signatures from academics defending their research.


Image
Read 12 tweets
Nov 3, 2023
1/ We’ve submitted a letter to President Biden regarding the AI Executive Order and its potential for restricting open source AI. We believe strongly that open source is the only way to keep software safe and free from monopoly. Please help amplify.


Image
Image
Image
Image
2/The letter is undersigned by researchers, academics and founders in AI including @ylecun @ClementDelangue @arthurmensch @tobi, @garrytan, @bgurley, @amasad, @AravSrinivas, @soumithchintala, @tylercowen, @ID_AA_Carmack, @NaveenGRao, @Suhail @fenbielding @pmarca @bhorowitz
3/ The EO's definitions of “AI” and "dual-use models" are too broad, casting a wide net over existing and future software innovations. Worse, the EO sets up a gauntlet of reporting designed for tech giants that would be crushing for researchers, non profits or smaller companies.
Read 8 tweets
Dec 22, 2022
0/ Most first time founders screw up their board by optimizing for the wrong things. Having built multiple of my own (and screwed up) and having sat on dozens more, here are a few things to keep in mind 👇
1/ Boards are for governance, not advice: The purpose of a board is to keep everyone out of jail. There is a *lot* of governance related work on a board. So make sure your board members know what they're doing.
2/ You can't run a company from a board meeting: All too often board meetings are an attempt to have "strategic discussions". Better to report and get focused feedback. Engage your board members outside of the board room for more effective discussions on strategy.
Read 11 tweets
Dec 20, 2022
0/ Descriptions of finding product market fit, and category creation often miss the hard part of getting to $10m in ARR ...

... which is the crazy effort required to tug, pull and hammer the shit out of the product *and* the market to make them hold together 👇
1/ Navigating early markets is really hard, and softening the market (market annealing) can take years.

While there are few easy answers, here are things to keep in mind if you're in a market annealing situation ..
2/ Founder sales can easily take you to $4m+ with the help of a few senior reps to navigate the procurement process. But founders should be on the front line.

As @bhorowitz told me "you can't run a company without being piped into the nervous system of PMF"
Read 12 tweets
Jun 9, 2022
[New Post] The Cloud Killed Infra, Long Live Infra!

The application first moved to the cloud
Now it's outgrowing it
And vertical clouds are signaling a new era of software infra

Blog post here, key points below:👇

a16z.com/2022/06/09/the…
1/ The traditional cloud market has been so successful and grown so large that independent cloud services are now large enough markets to sustain viable independent companies.
2/ Of course, a strong infra team just focused on one cloud service is naturally going to out-execute a cloud provider that's trying to support hundreds -- not just in features, support & performance. But over time, they'll also have a deeper understanding of the customer need.
Read 11 tweets
May 15, 2022
0/ (more gardening content ... because @sriramk asked :)

🌱 Here is a thread of gardening projects you can do at home, and with very little space 🌱
1/ Worm bins (Vemicompost)

I don't like household waste compost bins. It's too hard to get the right mix of clippings, brown org material, keep it wet etc. Instead I use a worm bin which is like a garbage disposal for compost.
... (cont) Image
... cont

Mine is just two plastic bins, one inside another. The inside bin has holes. I throw paper and kitchen waste in it, and wa la .. in a few weeks, amazing worm castings!

Get worms online here :

unclejimswormfarm.com
Read 9 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(