Shubham Saboo Profile picture
Daily tips and tutorials on LLMs, RAG and AI Agents | Author of books on GPT-3 & Neural Search in Production | DM open for collaboration
12 subscribers
Jun 10 5 tweets 2 min read
This is crazy!

Meet the world's first fully Agentic Download Agent and AI Drive.

It can literally search, download, and organize any files instantly in the AI drive from just a single prompt.

Works with PDFs, images, videos, music, and Office documents.

3 wild examples: 1. Batch Download Images Instantly:

The Download Agent searches trusted platforms like Pixabay and Unsplash, gathers your images, and places them in a dedicated folder in your AI Drive.

One prompt does it all—no more tedious clicking or sorting.
Jun 7 14 tweets 5 min read
Build an Agentic RAG systems that can Reason using Claude 4 and OpenAI Embedder (step-by-step instructions): 1. Install the necessary Python Libraries

Run the following commands from your terminal to install the required libraries: Image
Jun 5 6 tweets 2 min read
OpenAI Operator costs $200 per month.

But this AI web agent literally blew my mind. I have been testing it for a while and it has automated hours of boring work for me.

5 amazing use cases:

1. Get top 10 AI stories on Hacker News in the last 24 hours and add it to my Google Doc. 2. Find top job openings in Austin that match my resume and create a Google Sheet with Company, Title, and Salary columns.
May 28 6 tweets 2 min read
I vibe coded Instagram clone with this AI Agent using Claude 4 in less than 5 minutes.

Ship full-stack apps using Claude 4 without writing a single line of code.

Let that sink in. Emergent is an agentic coding platform to build web apps, games, SaaS, Chrome extensions, and everything else.

It doesn't just generate code snippets, it builds entire systems with databases, APIs, authentication, and infrastructure, all with just simple prompts.
May 27 8 tweets 4 min read
5 Autonomous General Purpose AI Agents that can literally do your work.

1. II-agent can think, code, reason and browse in a single loop just like humans.

Outperforms other AI Agent frameworks like Manus AI, Genspark AI, and OpenAI Deep Research.

100% opensource. 2. Skywork Super Agents can generate documents, slides, Excel sheets, webpages, and podcasts with deep research, using multi-agents.

These Super Agents can complete 8 hours of office work in just 8 minutes.
May 4 13 tweets 4 min read
10 MCP servers to supercharge your AI Agents. 1. Firecrawl MCP Server

Turns any site, even those rendered by JavaScript into ready‑to‑parse HTML or Markdown for your AI agent.
Apr 18 4 tweets 2 min read
I built an MCP AI Agent using Gemini Flash 2.5 with access to AirBnB and Google Maps in just 30 lines of Python Code.

100% Opensource Code. I have created 50+ AI Agents and RAG tutorials, 100% free and opensource.

Two simple steps to get started:
1. Subscribe to Unwind AI (for free): theunwindai.com
2. Star the repo: github.com/Shubhamsaboo/a…

New AI Agents and RAG tutorials added every week.
Apr 16 13 tweets 4 min read
OpenAI just launched o3 and o4-mini with agentic tool use.

It's been just a few hours, and the internet is filled with incredible AI examples.

10 wild examples:

1. o3 creates a movie without video tools by sketching each frame of an otter and airplane scene and stitching them into a GIF, all in one shot.

Apr 16 4 tweets 2 min read
Web Action AI Agent that doesn’t just scrape but finds the data you need.

Firecrawl launched FIRE-1, an AI agent that navigates complex websites, interacts with buttons, fills forms, and gathers data beyond traditional scraping.

No manual steps required. FIRE-1 AI Agent is available to use on Firecrawl starting today.

Learn more about it here: firecrawl.dev/blog/launch-we…
Apr 14 8 tweets 3 min read
Let's build & deploy production grade Gemini AI Agents in 3 simple steps using Google Cloud Vertex AI Engine.

Works with LangChain, LlamaIndex, and other Agent frameworks. Image 1. Install Vertex AI SDK for Python

Install the latest version of the Vertex AI SDK for Python as well as extra dependencies related to Agent Engine and LangChain: Image
Apr 4 12 tweets 5 min read
Huge AI agent updates from Anthropic, PayPal, Windsurf, Cognition AI, and more.

Multiple agents working in parallel was the highlight.

1. Devin 2.0 is here with a new agent-native IDE experience. It lets you run parallel Devins to take on multiple tasks at once. Starts at $20. 2. Genspark is a general-purpose superagent that can think, plan, act, and use tools to handle all your everyday tasks. Rather than using a computer in a sandboxed VM, it uses its in-house system to directly call APIs whenever needed.

Outperforms Manus AI and OpenAI Operator.
Mar 29 8 tweets 3 min read
Multi-agent team need long-term memory to maintain context and learn continuously.

5 frameworks for AI agent memory.

100% opensource. 1. Mem0 is an intelligent memory layer for AI agents, enabling them to actively learn from and adapt to user interactions over time.

It combines LLMs with vector storage:
- LLMs process key information from conversations
- Vector storage performs semantic search Image
Mar 28 13 tweets 5 min read
10 awesome MCP servers to supercharge your workflows on Claude, Cursor, Cline, and Windsurf.

100% opensource. 1. Firecrawl MCP server lets clients scrape, crawl, search, extract, and deep research websites. It can also convert websites into LLM-ready text right in your editor.
Mar 20 5 tweets 2 min read
OpenAI just released Voice AI Agents with GPT-4o Audio model.

You can now build amazing voice AI Agents using OpenAI Agents SDK in a few lines of Python code. Image Quickstart code to build Voice AI Agents with function calling in Python. Image
Mar 20 7 tweets 4 min read
5 AI agent frameworks to build multi-agent applications.

100% opensource.

They aren't LangChain, Crew AI, or OpenAI Agents SDK.

1. Motia is an AI agent framework is built for Software Engineers. Build agents in Python, TypeScript, JavaScript or Ruby.

github.com/MotiaDev/motiaImage 2. Agno is a lightweight Python library for building Multimodal Agents with memory, knowledge, and tools.

• ~10,000x faster than LangGraph
• Build teams of specialized agents
• Make Agents respond with structured data

github.com/agno-agi/agnoImage
Mar 11 7 tweets 3 min read
OpenAI just released Agents SDK for building multi-agent apps in Python.

It comes with 3 key features:
1. Handoffs: Allows an agent to delegate tasks to another agent
2. Guardrails: Safety checks for input & output validation
3. Tracing: Built-in visual tracking of agent runs. Image Handoffs allow an agent to delegate tasks to another agent.

It's particularly useful in scenarios where different agents specialize in distinct areas. Image
Mar 11 5 tweets 3 min read
Opensource alternative to Manus AI Agent is blowing up on GitHub.

OWL is an Autonomous AI Agent framework that can research, browse and code with multi-agent collaboration.

Works with Claude Sonnet 3.7, DeepSeek, GPT-4o and even local LLMs with Ollama.

100% Opensource. OWL ranks 1 on the GAIA Benchmark among open-source projects with a 58.18 avg score.

Try it out yourself: github.com/camel-ai/owl
Feb 22 6 tweets 3 min read
Train your own DeepSeek R1 like reasoning model with just 5GB of RAM.

It enables training custom reasoning models with 10x longer context while using 90% less VRAM.

100% Opensource. Image I have created 50+ AI Agents and RAG tutorials, 100% free and opensource.

Two simple steps to get started:
1. Subscribe to Unwind AI (for free): theunwindai.com
2. Star the repo: github.com/Shubhamsaboo/a…

New AI Agents and RAG tutorials added every week. Image
Feb 8 4 tweets 2 min read
AI Agents can now do Object Detection

AI agents can now detect objects through pure reasoning - just provide text prompt & image.

Zero training data needed. No labelling required. Find all the awesome LLM Apps with AI Agents and RAG in the following Github Repo.

P.S: Don't forget to star the repo to show your support 🌟

github.com/Shubhamsaboo/a…
Feb 5 8 tweets 4 min read
4 alternatives to OpenAI’s $200/month Deep Research AI Agent.

100% Opensource and $0 subscription fees. 1. DeepResearch by Jina AI

It uses Gemini Flash, Brave/DuckDuckGo, and Jina Reader, to mimic OpenAI's agentic search, read, and reasoning process.

100% Opensource.

github.com/jina-ai/node-D…
Feb 1 7 tweets 3 min read
3 alternatives to OpenAI's $200/month Operator like Agent.

100% free and Opensource. 1. Browser Use

It lets your AI Agent use a web browser like a human in a few lines of Python Code.

Works with Claude Sonnet 3.5, gpt-4o and Llama 3. 100% Opensource.