Aakash Gupta Profile picture
Jul 14 8 tweets 2 min read Read on X
Context engineering is the new prompt engineering.

Here's everything you need to know: Image
1. Etymology

It's unclear who coined the term, but two folks have been particularly important in its rise:

Andrej Karpathy and Dex Horthy.
2. What It Is

It's all about thinking beyond the prompt - thinking about ALL of the tools you have to drive success.

5 are the most important to understand:

a. RAG
b. Memory
c. State/ History
d. Prompt Engineering
e. Structured Outputs
3. Why It Matters

While it's somewhat important for LLM product features, it's incredibly important for AI agents.

Here's why:

a. RAG

When the agent doesn’t “know” enough, you dynamically inject relevant content
b. Memory

So the agent remembers what you've told it before, you store key information from previous conversations

c. State/History

To keep track of multi-step tasks, you maintain context about what's been done and what comes next
e. Structured Outputs

In order to get consistent outputs from the agent...

You tell it exactly how to structure the output, kind of like code (eg, id + query + source_type)
4. How to Implement It

Start by mapping your agent's workflow. Identify where context gaps occur, then layer in these tools.

Build your first agent.
Experiment in small prototypes.
Layer in these techniques as needed.
Create a great evals system to "hill climb."
Hope this helps 💜
Aakash

P.S. What agents have you been building?

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Aakash Gupta

Aakash Gupta Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @aakashg0

Jul 11
Which is it: use LLMs to improve the prompt, or is that over-engineering?

By now, we've all seen a 1000 conflicting prompt guides.

So, I wanted to understand:

• What do actual studies say?
• What do experts at OpenAI, Anthropic, & Google say?

Here are the answers: Image
I spent the past month in Google Scholar, figuring it out.

I firmed up the learnings with Miqdad Jaffer at OpenAI.

Some of my favorite takeaways from the research:
1. It's not just revenue, but cost

You have to realize that APIs charge by number of input and output tokens.

An engineered prompt can deliver the same quality with 76% cost reduction.

We're talking $3,000 daily vs $706 daily for 100k calls.
Read 10 tweets
Jul 5
If you're preparing for PM interviews in 2025, there's one question type you cannot afford to mess up: Metrics.

Here's the history of how it over took product hiring and why it's the silent killer of PM dreams: Image
PMs have been getting hit with questions like these...

And while they’re not as sexy as product sense or design...

They’ve quietly become non-negotiable in most interviews. Image
So, what happened?

In the late 2000s, Big Tech needed a way to simulate real PM work...

- Something they could ask in 45 minutes
- That showed judgment under pressure
- And gave them clean signals across a large candidate pool.

Metrics interviews were perfect.
Read 9 tweets
Jun 26
Jira Product Discovery (JPD) just launched their biggest update yet.

It will take them from 18K → 36K customers.

Here’s why I think so: Image
1. From Teams → Teams of Teams

I covered Jira Product Discovery (JPD) earlier this year.

They were really good for a product team.

Now, with this month’s launch of Premium, they are really good for multiple product teams.

This is a huge unlock…
2. The Lack of Standardization Problem

For a Head of Product or CPO that has multiple things they are taking care of, it’s hard to get a central view of everything.

Everyone has a slightly different roadmap template.

JPD allows you to centralize all of that!
Read 9 tweets
Apr 21
AI prototyping has changed what it means to be a PM, designer, and engineer in forward-thinking organizations.

Here's how: Image
The Old Way

Here’s what most product development lifecycles look like:

1. Ideation

Most teams barely prototype at the idea stage.

A rare few exceptional designers and PMs do (~5%)
2. Planning

Here, more teams use prototypes, but it still is an exception few (~15%), while sketches and mockups are much more common (>75%)

3. Discovery

In more empowered companies, many teams would test prototypes in the discovery phase (~50%)
Read 10 tweets
Apr 17
OpenAI released that there will be 5 levels of AGI.

If you want to build the future of AI, you should deeply understand it.

We are just crossing step 2 of 5 to AGI.

Yet, somehow, teams are still building like we are in levels 1 or 3.

Let me explain: Image
LEVEL 1: CONVERSATIONAL AI

Remember those awkward chatbots from 2019?

They sounded human...until they didn’t.

You’d ask for help…
They’d return gibberish.
That’s Level 1.

Might be useful but it can't be your strategic moat.
LEVEL 2: REASONING AI

This is where we are right now and it’s the real unlock.

Today’s top models (like GPT-4.1, released this week) can:

→ Break down complex problems
→ Think like PhDs
→ Make sense of ambiguity
→ Power analytics, personalization & decision support
Read 7 tweets
Feb 24
I've seen my fair share of product development processes.

JPD's approach stands out as particularly principled and well thought out.

Here are the five most important things about how they build product: Image
𝗙𝗮𝗰𝗲𝘁 𝗢𝗻𝗲 - 𝗧𝗵𝗲 𝗟𝗶𝗴𝗵𝘁𝗵𝗼𝘂𝘀𝗲 𝗣𝗵𝗶𝗹𝗼𝘀𝗼𝗽𝗵𝘆

As Catalin Bridinel, Head of Design, explains:

"The product is a ship, and the user is a lighthouse that gives you direction."

This is more than a cute metaphor - it's a fundamental operating principle.
It manifested, for instance, in the early access program stages:

Step 1 - Deep dive with 10 carefully selected customers
Step 2 - Expand to 100 customers for broader validation
Step 3 - Then 1000 and GA

And it does in a million little other ways.
Read 13 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(