Post

How to get URL link on X (Twitter) App

On the Twitter thread, click on or icon on the bottom
Click again on or Share Via icon
Click on Copy Link to Tweet
Paste it above and click "Unroll Thread"!
More info at Twitter Help

Lenny Rachitsky

@lennysan

Jun 24 • 3 tweets • 9 min read • Read on X

The 20+ most common AI terms explained, simply

1. Model
An AI model is a computer program that is built to work like a human brain. You give it some input (i.e. a prompt), it does some processing, and it generates a response.

Like a child, a model “learns” by being exposed to many examples of how people typically respond or behave in different situations. As it sees more and more examples, it begins to recognize patterns, understand language, and generate coherent responses.

There are many different types of AI models. Some, which focus on language—like ChatGPT o3, Claude Sonnet 4, Gemini 2.5 Pro, Meta Llama 4, Grok 3, DeepSeek, and Mistral—are known as large language models (LLMs). Others are built for video, like Google Veo 3, OpenAI Sora, and Runway Gen-4. Some models specialize in generating voice, such as ElevenLabs, Cartesia, and Suno. There are also more traditional types of AI models, such as classification models (used in tasks like fraud detection), ranking models (used in search engines, social media feeds, and ads), and regression models (used to make numerical predictions).

2. LLM (large language model)
LLMs are text-based models, designed to understand and generate human-readable text. That’s why the name includes the word “language.”

Recently, most LLMs have actually evolved into “multi-modal” models that can process and generate not just text but also images, audio, and other types of content within a single conversational interface. For example, all of the ChatGPT LLM models natively support text, images, and even voice. This started with GPT-4o, where “o” stands for “omni” (meaning it accepts any combination of text, audio, and image input).

Here’s a really good primer on how LLMs actually work by @every: every.to/p/how-ai-works

Also, this popular deep dive by @karpathy: youtu.be/zjkBMFhNj_g

3. Transformer
The transformer architecture, developed by Google researchers in 2017, is the algorithmic discovery that made modern AI (and LLMs in particular) possible.

Transformers introduced a mechanism called “attention,” where instead of only being able to read text word‑by‑word, sequentially, the model is able to look at all the words at once. This helps the models understand how words relate to each other, making them far better at capturing meaning, context, and nuance than earlier techniques.

Another big advantage of the transformer architecture is that it’s highly parallelizable—it can process many parts of a sequence at the same time. This makes it possible to train much bigger and smarter models simply by scaling up the data and compute power. This breakthrough is why we suddenly went from basic chatbots to sophisticated AI assistants. Almost every major AI model today, including ChatGPT and Claude, is built on top of the transformer architecture.

This is the best explanation of transformers I’ve seen: ig.ft.com/generative-ai/

Here’s also a more technical and visual deep dive: youtu.be/KJtZARuO3JY

4. Training/Pre-training
Training is the process by which an AI model learns by analyzing massive amounts of data. This data might include large portions of the internet, every book ever published, audio recordings, movies, video games, etc. Training state-of-the-art models can take weeks or months, require processing terabytes of data, and cost hundreds of millions of dollars.

For LLMs, the core training method is called “next-word prediction.” The model is shown billions of text sequences with the last word hidden, and it learns to predict what word should come next.

As it trains, the model adjusts millions of internal settings called “weights.” These are similar to how neurons in the human brain strengthen or weaken their connections based on experience. When the model makes a correct prediction, those weights are reinforced. When it makes an incorrect one, they’re adjusted. Over time, this process helps the model improve its understanding of facts, grammar, reasoning, and how language works in different contexts. Here’s a quick visual explanation: youtube.com/watch?v=rEDzUT…

If you’re skeptical of next-word prediction generating novel insights and super-intelligent AI systems, here’s @ilyasut (co-founder of OpenAI) explaining why it’s deceptively powerful: youtu.be/YEUclZdj_Sc

5. Supervised learning
Supervised learning refers to when a model is trained on “labeled” data—meaning the correct answers are provided. For example, the model might be given thousands of emails labeled “spam” or “not spam” and, from that, learn to spot the patterns that distinguish spam from non-spam. Once trained, the model can then classify new emails it’s never seen before.

Most modern language models, including ChatGPT, use a subtype called “self-supervised learning.” Instead of relying on human-labeled data, the model creates its own labels, generally by hiding the last word of a sentence and learning to predict it. This allows it to learn from massive amounts of raw text without manual annotation.

6. Unsupervised learning
Unsupervised learning is the opposite: the model is given data without any labels or answers. Its job is to discover patterns or structure on its own, like grouping similar news articles together or detecting unusual patterns in a dataset. This method is often used for tasks like anomaly detection, clustering, and topic modeling, where the goal is to explore and organize information rather than make specific predictions.

7. Post-training
Post-training refers to all of the additional steps taken after training is complete to make the model even more useful. This includes steps like “fine-tuning” and “RLHF.”

7. Fine-tuning
Fine-tuning is a post-training technique where you take a trained model and do additional training on specific data that’s tailored to what you want the model to be especially good at. For example, you would fine-tune a model on your company’s customer service conversations to make it respond in your brand’s specific style, or on medical literature to make it better at answering healthcare questions, or on educational content for specific grade levels to create a tutoring assistant that explains concepts in age-appropriate ways.

This additional training tweaks the model’s internal weights to specialize its responses for your specific use case, while preserving the general knowledge it learned during pre-training.

Here’s an awesome technical deep dive into how fine-tuning works: youtu.be/eC6Hd1hFvos

8. RLHF (reinforcement learning from human feedback)
RLHF is a post-training technique that goes beyond next-word prediction and fine-tuning by teaching AI models to behave the way humans want them to—making them safer, more helpful, and aligned with our intentions. RLHF is the key method used for what’s referred to as “alignment.”

This process works in two stages: First, human evaluators compare pairs of outputs and choose which is better, training a “reward model” that learns to predict human preferences. Then, the AI model learns through reinforcement learning—a trial-and-error process where it receives “rewards” from the reward model (not directly from humans) for generating responses the reward model predicts humans would prefer. In this second stage, the model is essentially trying to “game” the reward model to get higher scores.

Here’s a great guide: arc.net/l/quote/zqyxws…

Also, this technical deep dive into RLHF: youtu.be/T_X4XFwKX8k

9. Prompt engineering
Prompt engineering is the art and science of crafting questions (i.e. “prompts”) for AI models that result in better and more useful responses. Like when you’re talking to a person, the way you phrase your question can lead to dramatically different responses. The same AI model will give very different responses based on how you craft your prompt.

There are two categories of prompts:
1. Conversational prompts: What you send ChatGPT/Claude/Gemini when you’re having a conversation with it

2. System/product prompts: The behind-the-scenes instructions that developers bake into products to shape how the AI product behaves

Here’s a podcast episode from just last week where we cover this and much more: youtu.be/eKuFqQKYRrA

10. RAG (retrieval-augmented generation)
RAG is a technique that gives models access to additional information at run-time that they weren’t trained on. It’s like giving the model an open-book test instead of having it answer from memory.

When you ask a question like “How do this month’s sales compare to last month?” a retrieval system is able to search through your databases, documents, and knowledge repos to find pertinent information. This retrieved data is then added as context to your original prompt, creating an enriched prompt that the model then processes. This leads to a much better, more accurate answer.

If you don’t give the model the context it needs to answer your question through RAG, this is when “hallucinations” happen (see more below).

Broadly, to summarize:
- Pre-training: Teaches the model general knowledge (and language)
- Fine-tuning: Specializes the model for specific tasks
- RLHF: Aligns the model with human preferences
- Prompt engineering: The skills of crafting better inputs to guide the model toward the most useful outputs
- RAG: A technique that retrieves additional relevant information from external sources at run-time to give the model up-to-date or task-specific context it wasn’t trained on

For more (including MCP, agent, token, vibe coding, synthetic data, AGI), check out today's post: lennysnewsletter.com/p/an-ai-glossa…

Thank you @sinned @_amankhan @yourgirlhils for help with this post 🙏

I'll keep adding new buzzwords as they emerge, so make sure to bookmark it: lennysnewsletter.com/p/an-ai-glossa…

Here's the post in audio form:

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @lennysan

Lenny Rachitsky

@lennysan

Sep 24, 2024

New data on the state of the product job market 🧵

1. Remote jobs are shrinking fast (down 35% from peak)

2. There’s been a shift to hiring more senior candidates

The chart below shows the proportion of open PM jobs by level over time.

If you look at the light blue and dark blue segments below (i.e. Senior and Lead/Senior++ roles), you can see they have definitely grown from early 2023 in the percentage of PMs being hired. In particular, Lead/Senior++ roles are growing their percentage of open roles the fastest. And the share of Entry/Mid-level roles (the pink segment) has decreased the most since early 2023.

3. More than one in five open PM roles is based in the San Francisco Bay Area. The share grew from 15.4% to over 20% in the past two years, and it appears to be growing further.

Read 7 tweets

Lenny Rachitsky

@lennysan

Jul 23, 2024

The rise of product management over the past 25 years.

Huge growth for 20+ years, followed by a plateau over the past couple of years.

This tells us the PM role isn’t going through the hypergrowth it saw earlier this decade, but it’s also not shrinking. This seems like a good and healthy thing all around.

Numbers-wise, there are about 450,000 active PMs in the U.S. right now, and 2,500 to 4,500 are being hired each month.

Here are the top hirers of PM roles over the past few years:

As a comparison, here’s the engineering role over that same time frame—similar growth trajectory, also a bit of a slowdown in the past one or two years, though not as much of a slowdown as PMs. Again, this seems right and healthy.

Read 4 tweets

Lenny Rachitsky

@lennysan

Feb 27, 2024

In most hiring processes, you’re lucky to get 45 minutes to chat with a candidate before having to make a thumbs-up or thumbs-down decision.

How do you use that precious time to get the most important information about the candidate?

For over a year now, I’ve been asking my illustrious podcast guests to share their favorite interview questions (nearly 150 guests now!), and the collection of questions that’s emerged is like nothing I’ve seen elsewhere. These are not just great questions—they are exceptionally good at pulling out the essential insights about the candidate in the least amount of time.

Below, I'll share some of my favorite high-signal-to-noise interview questions, including what to look for in a great answer, grouped by theme. To see the full list, don't miss today's newsletter post (link below).

How to learn the most about a candidate from a single interview question—High-signal-to-noise interview questions inspired by my 150+ podcast guests

Keep reading for some of my favorites 👇
lennysnewsletter.com/p/how-to-learn…

Theme: How do they handle the hard stuff?

Question 1: Talk me through your biggest product flop. What happened and what did you do about it?

“I look for people being brutally honest about how bad it was and why it failed. The rest of the interview, they’re trying to tell you all the wonderful things they did and all the accomplishments they had. And so I think the rawer the answer in terms of how bad it was and why, the better.”

—Annie Pearl, corporate vice president at @Microsoft, ex-CPO at @Calendly

Read 13 tweets

Lenny Rachitsky

@lennysan

Jan 16, 2024

Every startup can be distilled into a simple equation.

And until you can express yours as one, you don’t fully understand your business.

Having this equation gives you a map for understanding your biggest growth drivers, your key inputs and output, and once your teams are aligned behind it, and the equations operationalized, you’ll experience a huge force multiplier—because every team will be focusing their energy on the same (high-leverage) levers.

I teamed up with @danhockenmaier to collect the detailed equations for the eight most common tech business models:

1. Bottom-up B2B SaaS with seat-based pricing
2. Bottom-up B2B SaaS with usage-based pricing
3. Top-down B2B SaaS
4. B2C subscription
5. B2C free (ads)
6. B2C marketplaces
7. B2B marketplaces
8. DTC/e-commerce

👇🧵

(Keep reading for a summary, or bookmark this post for later)
lennysnewsletter.com/p/the-math-for…

1. Bottom-up B2B SaaS with seat-based pricing

Examples: @Figma, @SlackHQ, @Zoom, @Adobe Creative Cloud, @Asana, @Jira

ARR = New ARR + Expansion ARR + Reactivation ARR – Churned ARR – Contraction ARR

Read 22 tweets

Lenny Rachitsky

@lennysan

Sep 18, 2023

Some of my biggest surprises when researching paths to PMF for top B2B companies:

1. If you build it, they *will* come—if you have strong product-market fit.

Though it often takes years to find initial PMF, once you do, a common pattern across top startups is strong (and explosive) organic growth—primarily seen as cold inbound and word-of-mouth growth.

This was true for Segment, Loom, Dropbox, Canva, Sprig, Stytch, and most others.

2. Stop thinking of product-market fit as a single moment.

It can be, but it almost always isn't.

Instead, think of finding PMF as an ongoing process of finding stronger fit with more and more segments of the market.

Here's how @shishirmehrotra describes it:

Read 15 tweets

Lenny Rachitsky

@lennysan

Sep 12, 2023

Though there will never be a foolproof formula for finding product-market fit, here’s my best attempt at creating a guide for B2B startups that'll save you much time and pain.

It's based on interviews and research into the PMF journeys of 25 top B2B startups.

Here's a peek:

Here's the full post: A guide for finding product-market fit in B2B

Inside:
1. A framework for finding PMF
2. Signs that you’re approaching PMF
3. What to do if you aren’t

It's one of my new all-time favorite posts, and I'm super proud of it.
lennysnewsletter.com/p/finding-prod…

@KiwiDenny I go into what exactly it looked like in the post itself.

Read 4 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Enter URL or ID to Unroll

Lenny Rachitsky

Try unrolling a thread yourself!

More from @lennysan

Lenny Rachitsky

Lenny Rachitsky

Lenny Rachitsky

Lenny Rachitsky

Lenny Rachitsky

Lenny Rachitsky

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!