Aadit Sheth Profile picture
Jan 21 11 tweets 4 min read Read on X
ChatGPT was just step 1.

Google dropped a whitepaper on the next evolution of AI: Agents.

Now, everyone—from Satya Nadella to Jensen Huang—is talking about them.

Here’s why AI agents are the next big thing. Let's break it down. 🧵 Image
Image
Agents are like supercharged AI assistants.

Unlike standalone models, they:

• Use tools to access real-world info
• Make decisions autonomously
• Execute tasks in a loop until goals are achieved Image
Key difference: Agents vs. Models

• Models: Limited to training data, one-off predictions.
• Agents: Use tools for real-time data, manage ongoing tasks, and continuously adapt to user needs. Image
Agents have a "brain" called the orchestration layer that helps them make decisions.

It's like a conductor leading an orchestra, making sure everything works together perfectly! Image
Just like we use tools, agents need tools to interact with the world.

These can be:

1. Extensions

They are like a bridge between AI agents and APIs.

They:

• Teach agents how to use APIs with examples.
• Simplify complex tasks like booking flights or fetching live data. Image
2. Functions

These are like mini-programs that AI agents can run to do specific tasks.

They're handy tools that calculate distances, translate languages, or even analyze images.

With functions, agents can get things done efficiently. Image
3. Data Stores

Data stores are like vast libraries of information that AI agents can access.

They hold everything from flight details to movie reviews, and news articles to scientific research.

With data stores, agents can tap into a world of knowledge. Image
Now, how do agents "think"?

They follow a cycle:

1. First, they gather info.

2. Reason with frameworks like ReAct (logic) or Chain-of-Thought (step-by-step).

3. Execute actions.

4. Refine based on results. Image
Practical example:

Imagine a travel agent AI.

When you ask, "I want to book a flight from Austin to Zurich."

It uses an extension to connect to the airline's booking system (API).

The extension acts as a bridge, allowing the AI to understand your request and send the correct information to the API to find your flights!Image
If you want to build agents.

You can use tools like LangChain or Google Vertex AI.

• Define goals
• Add reasoning steps
• Test & refine

Why does this matter?

In the future, agents unlock complex real-world applications.

They bring AI into our daily lives, from managing customer support to handling financial workflows.

So to keep an eye on these advancements, follow me @aaditsh for more.Image
If you found this valuable, retweet below to share with your friends ♻️

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Aadit Sheth

Aadit Sheth Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @aaditsh

Jan 29
AI is reshaping the way we work, live, and create.

This year, we curated the Prompts Daily AI Awards 2025:

The best tools across Business, Productivity, Content Creation, and more.

Here's the full list: Image
1. Best in AI Assistants Image
@perplexity_ai @AnthropicAI @AgentDotAi 2. Best in Work Productivity Image
Read 10 tweets
Jan 24
Imagine an Al that books your dinner, orders groceries, and schedules meetings.

Now, stop imagining. It's here.

Meet Operator—OpenAl's first Al agent that sees, clicks, and acts like a real assistant.

Here's what it can do (and how it changes everything): 🧵 Image
What is Operator?

- It’s an AI system that uses a remote browser (in the cloud) to accomplish tasks for you.

- It can “see” the webpage like we do, click buttons, fill out forms, and more (no specialized API required).
Now, here are 4 top use cases everyone’s talking about!

1. Book a Restaurant

In the demo, Operator made a reservation via OpenTable.

It literally sees the webpage, fills out time/date forms, and confirms the booking.
Read 13 tweets
Jan 9
Day 1 of CES 2025 is amazing!

Here are the 14 wild releases so far (Mega thread)

1. The world's first stretchable screen by Samsung.
2. A 360° health mirror that scans your body, syncs data, and connects you to care teams.
3. A futuristic electric minivan with a foldable drone.
Read 15 tweets
Jan 3
Can AI really automate our jobs?

New research tested seven LLMs: Claude, GPT-4, Gemini, Amazon Nova, Meta Llama, and Alibaba Qwen.

Here's the jobs it could replace and the ones it could not 🧵: Image
For the context, this is how it works:

Imagine creating a virtual company where AI assistants have to do real jobs, from writing code to managing projects.

That's exactly what these researchers at Carnegie Mellon University did! Image
They gave the AIs 175 different tasks you'd find in a typical workplace.

Things like:
• Writing and fixing code
• Managing team projects
• Dealing with HR stuff
• Handling company finances Image
Read 10 tweets
Jan 1
2025 is the year of AI.

Here are the companies to watch out for: Image
1. Anthropic (Founded 2021)

Focus: “Constitutional AI” and safer large language models.

Why it matters: Their Claude model aims to make AI more transparent and aligned with human values.

Recommended for: Tech leaders focused on ethical AI development.
2. Cohere (Founded 2019)

Focus: Enterprise-grade LLMs and NLP solutions with privacy-focused APIs.

Why it matters: By delivering stable, customizable models, they help businesses integrate AI smoothly without big-tech lock-in.

Recommended for: IT managers and developers in need of secure, customizable AI solutions.Image
Read 13 tweets
Dec 12, 2024
The entertainment industry will never be the same.

OpenAI just dropped Sora, a text-to-video tool that feels like magic.

Here’s how it’s about to shake up how we create and consume stories 🧵
For starters, Sora provides creators with these capabilities:

• Generate videos from text prompts
• Create content from single-source images
• Edit videos using natural language
• Produce up to 20-second professional-quality clips
• More in the video below 👇
Now let's dive deep into some of the important features:

1. Remix

Replace, remove, or reimagine elements in your video.

Creators can now edit videos through simple language instructions.

No professional editing background is required.
Read 13 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(