swyx Profile picture
Anti-ego ideas for anti-ergodic life. Founder, @smolmodels ▹ Listen: @latentspacepod ▹ Read: @coding_career ▹ Join: @aiDotengineer
Maleph Profile picture Jin Ho Hur Profile picture Dennis Profile picture Fei Zhao Profile picture Marko Srsan Profile picture 13 subscribed
Nov 6, 2023 23 tweets 10 min read
Join @latentspacepod and @thursdai_pod live at DevDay!

Now:

spotted: “New Products Deep Dive” for 45 mins… I wonder what that will be twitter.com/i/spaces/1BRJj…

GPT4 Turbo is ~3x cheaper than GPT4!

1. OpenAI's longest ever Context length: 128k
2. Better JSON/function calling
3. Knowledge: built in RAG and April 2023 cutoff
4. Dalle3, GPT4-V, and TTS model all in API today!!!
4b. Whisper V3 open sourced (coming to API)
5. Customization: GPT3 16k, GPT4 finetuning, Custom Models services
6. Higher Rate Limits - 2x tokens per minute, request raises in account settings - plus: Copyright Shield!

"GPT4 Turbo is a smarter model than GPT4" (GPT4.5 confirmed!)


Image
Image
Image
Oct 10, 2023 17 tweets 11 min read
it’s official - I think GitHub Copilot is the first* generative AI product to publicly claim they’ve passed $100m ARR — enough to stand alone as a publicly listed company

Whenever people ask me “is AI a fad” the biggest thing I point to is “follow the money”:

- revenue, not just funding
- RECURRING, not tcosts on hype
- people publicly saying they’d pay 5x the cost

(*there’s likely a few others but none confirmed officially - see Anatomy of Autonomy post on @latentspacepod)
Image next up is @DedyKredo LIVE CODING a full test suite, making code changes, and automating commit and PR review, all assisted by @CodiumAI . audible “what the fuck” from @eugeneyan.



ends with a powerful message for Israel. we stand with you @itamar_mar. youtube.com/live/qw4PrtyvJ…
Jul 18, 2023 17 tweets 12 min read
That was fast - Llama 2 is out!

and cleared for commercial use! and *destroys* Falcon 40B on @DanHendrycks's MMLU and other top benchmarks

They really meant it when they said "imminently" lol



Scheduled a @latentspacepod at 3pm PT - join @FanaHOVA and… https://t.co/iWFLYJLCJd https://t.co/C0YKJ8snjr https://t.co/TZvfRrz5lKtwitter.com/i/spaces/1nAKE…
twitter.com/i/web/status/1…



Image
Image
Image
@DanHendrycks @latentspacepod @FanaHOVA it seems @mascobot is on top of it - you can try out llama 2 here:

they also have a Llama playground but its not currently working for me https://t.co/cao0EUYWQSreplicate.com/a16z-infra/lla…
Jun 30, 2023 5 tweets 2 min read
🆕 Essay: The Rise of the AI Engineer



Keeping up on AI is becoming a full time job.

Let's get together and define it. https://t.co/KD2lY9FTtmlatent.space/p/ai-engineer
Builders need a place to talk turpentine. This is why i'm teaming up with @benghamine to produce @aiDotEngineer, the definitive place to talk AI UX, devtools, infra, and all things AI Engineering.

500 seats.
SF/Virtual, Oct 8-10.

Join us!

Jun 20, 2023 6 tweets 6 min read
The @latentspacepod is excited to publish:

Petaflops to the People:
@realGeorgeHotz's first interview
on his new personal compute cluster company

the tiny corp.

latent.space/p/geohot

We discuss how tiny is taking on Nvidia, Google, and PyTorch with a tiny team and go deep… twitter.com/i/web/status/1… @latentspacepod @realGeorgeHotz GPT4 is 8 x 220B params = 1.7 Trillion params



ok I wasn't sure how widely to spread the rumors on GPT-4 but it seems Soumith is also confirming the same so here's the quick clip!

so yes, GPT4 is technically 10x the size of GPT3, and all the small… twitter.com/i/web/status/1…
Jun 7, 2023 5 tweets 7 min read
this is a trend I'm calling "Code is all you need"

Comparing Bard vs @OpenAI ChatGPT vs @AnthropicAI Claude on Google's own reasoning/math prompts shows the stark contrast once you make your model write and eval code to answer questions. Reminds me of @amasad and @goodside's… twitter.com/i/web/status/1… ImageImageImage @OpenAI @AnthropicAI @amasad @goodside This is part of a broader trend of us slowly discovering the special place of code in language models:

1/ Code Improves LLMs
@Francis_YAO_ et al have repeatedly found that adding code in pretraining data improves LLMs in all benchmarks ( )

2/ Code LLMs… twitter.com/i/web/status/1…
May 14, 2023 5 tweets 4 min read
Stop building the thing.
Build the thing that builds all the things.

IMO the most important thing every developer could be doing right now on nights and weekends is building a general purpose personal junior dev agent they can control and trust, that they can scale to fleets.… twitter.com/i/web/status/1… Image first thing Tony ever built wasn't a flying suit of armor, fancy weapons, or mini fusion reactor

he built the thing that builds the things (and saves his life when the other stuff fails)
Apr 25, 2023 4 tweets 4 min read
.@Replit just announced their own LLaMa style code LLM at their developer day!

replit-code-v1-3b

- 2.7b params
- 20 languages
- 525B tokens (“20x Chinchilla?”)
- beats all open source code models on HumanEval benchmark
- trained in 10 days with @NaveenGRao @MosaicML ImageImage and @amasad follows up with a finetuned version - replit-finetune-v1-3b - using @Replit data - and this catapults Replits model *ahead* of @OpenAI codex 🤯

they are matching the performance of >10B LLMs with way smoller 2.7B models

and it will be open source/freely licensed! Image
Apr 23, 2023 4 tweets 3 min read
I love seeing the birth of a new social network. unsure about its future but its cool that in early days it’s still smol enough you can hold the world “map” in your head and zoom in to see individual people

the internet was a nicer place when it was a neighborhood and not a mob ImageImageImageImage everyone out here tweeting bsky fomo, i'm in here making @chirperai bots, we are not the same Image
Apr 19, 2023 16 tweets 10 min read
🧠 The Anatomy of Autonomy 🤖

The fifth killer app of AI is Autonomous Agents.

Presenting
- Summary of #AutoGPT / @babyAGI_
- The 5 stages of "brain" development it took to get from Foundation Models to Autonomous Agents
- Why Full Autonomy is like "Full Self Driving"!

Begin: Image @babyAGI_ (this is the obligatory threadooor TLDR of my latest newsletter post, hop over if you like my long form work: )latent.space/p/agents
Apr 19, 2023 4 tweets 4 min read
Writing my recap / thoughts on AI Agent mania today for the newsletter.

- if you've used @babyAGI_ / @AutoGpt for something interesting: what's a good usecase?

- if you're highly skeptical: why?

- if you want to see more: elaborate? using chatgpt to rip apart @yoheinakajima's code haha

this feels like cheating. i cant look at any new codebase without this visualization again (cc @ShaneaLeven or @danlovesproofs maybe already has a smarter take on this) ImageImageImageImage
Mar 27, 2023 4 tweets 3 min read
Incredible how Stephen Wolfram toiled away in relative obscurity for ~15 years, only to wake up one day and find that Wolfram|Alpha is literally the perfect bridge from agentic AI to real world knowledge, errors included.

“You can't connect the dots looking forward; you can only… twitter.com/i/web/status/1… ImageImageImageImage if you had asked me in January how long it would take us to blend symbolic ai and generative ai i would have said 5 years… took 10 lines of json with the new chatgpt plugins system
Mar 23, 2023 8 tweets 6 min read
ChatGPT casually dropped an APP STORE 🤯

It can now:
- browse the web (RIP Bing waitlist, cutoff)
- write and run Python (RIP replit?)
- access org info (RIP docsearch startups)
- add third party plugins from OpenTable, Wolfram, Instacart, Zapier, etc)
- developer SDK in preview When I talked about the AI Red Wedding last year () I was talking about AI offerings undercutting existing human-based or manual business processes.

Now the AI Red Wedding is coming for companies building atop foundation model companies.

@OpenAI is… twitter.com/i/web/status/1…
Mar 22, 2023 4 tweets 5 min read
Wow. GitHub CEO @ashtom just announced GitHub Copilot X:

- Copilot Chat - "ChatGPT-like experience in your editor" powered by GPT-4
- Copilot for Pull Requests - AI-generated descriptions for pull requests on GitHub
- GitHub Copilot for Docs - chat for *any* company's repos and… twitter.com/i/web/status/1… @ashtom Microsoft goes Megahard on AI.

It's fairly clear that "Copilot for X" is now going to be Microsoft's third major strategic shift:

80s-00s: Windows and Office @BillGates @Steven_Ballmer

2010s: Azure and Bing @satyanadella @kevin_scott

2020s-30s: Copilot @ashtom ???

This is… twitter.com/i/web/status/1…
Mar 14, 2023 17 tweets 12 min read
GPT4 is live!!!

openai.com/research/gpt-4 GPT4 gets 100% accuracy on this HumanEval task.

previous iterations were all under <50%.

holy shit.
Mar 12, 2023 5 tweets 6 min read
Big Data may be dead, but looking at data is still stupendously underrated even in 2023.

Small collection of examples where looking at ✨analytics✨ changed the trajectory of a whole business: First (and most famous, but gotta acknowledge the greats), @kevin pivoted his Foursquare mobile check-in competitor after hiring @mikeyk to look at analytics.

Mike saw that out of all the features they shipped, only one got off the charts usage.

and @Instagram was born
Jan 3, 2023 6 tweets 4 min read
ChatGPT’s current killer app isn’t search, therapy, doing math, controlling browsers, emulating a virtual machine, or any of that other cherrypicked examples that come with huge disclaimers.

It’s a lot more quotidian:

Reformatting information from any format X to any format Y. “ChatGPT reformatting” requires minimal world knowledge, are instantly verifiable, and can reliably save minutes of work multiple times a day.

The reformat can include contextual inference, which saves even more time at the cost of a bit more risk:

Nov 20, 2022 7 tweets 4 min read
Convinced that all devs should work on a database as part of training.

Ever joked about DataStructures & Algorithms only being useful at interviews?

Work on a DB

Ever wondered why {{ FAVE_APP }} is slow?

Probably a DB

Prefer compilers?

allow me to introduce query planners.. Perhaps my real hot take is that databases are the CS grads abstracting away all the Hard Problems so that us bootcamp grads can cosplay being "full stack" with literally 1 day of SQL experience before getting hired to make 6 figures making rectangles on server vs on client
Nov 18, 2022 4 tweets 3 min read
This is a HUGE milestone.

I don’t think people outside SF have any idea how close we are to self driving. Most people think it’s “continually 5 years away”.

This is no joke. Cruise now has “push button, get car” autonomous taxis 24hrs/day in one of the busiest cities on earth. i got a nighttime joyride with @alexbowe recently and while we couldnt test heavy traffic, we pointed the car at the most challenging road we could think of inside the drivable zone - the incline squiggly road in Potrero Hill.

SF is a hilly city but was handled like a champ. twitter.com/i/web/status/1… Image
Oct 11, 2022 9 tweets 4 min read
Looking to improve my mobile automation:

What iOS Shortcuts do you use?

(I don’t have a smart home but would like to be smarter everywhere else) ok early doors but this is in the running to be #1 time saver
Sep 25, 2022 4 tweets 4 min read
The AI Red Wedding:

- GPT3/Jasper 🔪 Low value copywriters
- Stable Diffusion 🔪 Stock image companies
- OpenAI Whisper 🔪 Voice transcription APIs

Every month a sleepy industry that hasn't changed for years gets AI'ed.

Which is next? My thoughts on the next L-Space newsletter:

**Eigenquestions for the AI Red Wedding**

lspace.swyx.io/p/eigenquestio…

Inspired by @shishirmehrotra's interview on the @lennysan pod

I don't know what the next great AI product *is*, but I know what questions it will answer.

Preview: Image