Shubham Saboo Profile picture
Daily tips and tutorials on AI Agents, RAG & LLMs | Author of books on GPT-3 & Neural Search in Production | DM open for collaboration
13 subscribers
Aug 15 15 tweets 10 min read
I started building Opensource AI Agents and RAG Apps 8 months ago.

It turned out to be the fastest way to learn AI engineering.

If you want to start today, here’s a step-by-step roadmap: 1. Python Programming for AI

• Harvard CS50's Introduction to AI with Python
pll.harvard.edu/course/cs50s-i…

• DeepLearning AI's AI Python for Beginners
deeplearning.ai/short-courses/… DeepLearning.AI
Aug 12 4 tweets 2 min read
LangChain literally reverse-engineered Claude Code and Manus AI to build Deep Agents.

It's a Python library to turn any LLM into a deep thinking agent with MCP tools.

100% opensource. Image 100+ free step-by-step tutorials with code covering:

🚀 AI Agents
📀 RAG Systems
🗣️ Voice AI Agents
🌐 MCP AI Agents
🤝 Multi-agent Teams
🎮 Autonomous Game Playing Agents

P.S: Don't forget to subscribe for FREE to access future tutorials.

theunwindai.com
Aug 10 4 tweets 2 min read
Google just released LangExtract Python library.

It can extract structured data from unstructured docs with precise sources in just a few lines of code.

100% Opensource. Image 100+ free step-by-step tutorials with code covering:

🚀 AI Agents
📀 RAG Systems
🗣️ Voice AI Agents
🌐 MCP AI Agents
🤝 Multi-agent Teams
🎮 Autonomous Game Playing Agents

P.S: Don't forget to subscribe for FREE to access future tutorials.

theunwindai.com
Aug 2 7 tweets 2 min read
Let's compare Kimi k2, Qwen3 Coder, DeepSeek R1, GLM-4.5 and Claude Sonnet 4 for AI code generation.

1. Kimi K2 2. Qwen3 Coder by @AlibabaGroup

3D grid of particles animated by a passing gravitational wave.
Jul 23 9 tweets 3 min read
This new Claude 4 level coding model from China outperforms DeepSeek R1, Kimi k2, Gemini 2.5 Pro and OpenAI GPT-4.1.

Meet Qwen3-Coder - a 480B MoE model with 35B active parameters that excels in both coding and agentic tasks.

And it's 100% Opensource. Let that sink in. Image Plus, you can use this with the newly launched opensource Qwen Code.

A terminal based AI workflow tool inspired by Gemini CLI.

GitHub Repo: github.com/QwenLM/qwen-co…Image
Jul 11 10 tweets 4 min read
After DeepSeek R1, there's new Claude 4 level model from China that outperforms DeepSeek v3, Qwen and OpenAI GPT-4.1

Meet Kimi k2 - 1 trillion parameter model purpose-built for agentic workflows with native MCP integration.

100% Opensource and FREE to try. Let that sink in. Image Kimi k2 by @Kimi_Moonshot is a SOTA AI coding model that gives amazing 1 shot results at coding prompts.

Jul 10 4 tweets 2 min read
Grok 4 builds an AI Investment Agent team just by going through the documentation.

Replit lets me run, test and deploy that agent team directly from the browser.

All of this in less than 2 mins. I have created 100+ AI Agents and RAG tutorials, 100% free and opensource.

P.S: Don't forget to star the repo to show your support 🌟
github.com/Shubhamsaboo/a…
Jul 6 9 tweets 3 min read
Build a Customer Support Ticket Agent with Structured output using Google Agent Development Kit.

100% Opensource code with step-by-step tutorial: 1. Install the necessary Python Libraries

Run the following commands to install the required libraries: Image
Jul 5 8 tweets 3 min read
Build an AI Agent with Structured Output using Google ADK (step-by-step instructions): What Are Structured Outputs?

Think of structured outputs as a strict dress code for your AI agent's responses.

Instead of getting random, unpredictable text, you get perfectly formatted JSON every single time.

Let's build our AI Agent with structured output 👇 Image
Jun 21 4 tweets 2 min read
This Chinese AI model just changed document OCR forever.

It can parse complex documents with text, tables, formulas and figures in parallel simultaneously using task-specific prompts.

100% opensource. Dolphin uses a smart two-stage parsing approach.

Stage 1: Analyzes page layout and generates element sequence in natural reading order.

Stage 2: Parses all document elements simultaneously using task-specific prompts. Image
Jun 10 5 tweets 2 min read
This is crazy!

Meet the world's first fully Agentic Download Agent and AI Drive.

It can literally search, download, and organize any files instantly in the AI drive from just a single prompt.

Works with PDFs, images, videos, music, and Office documents.

3 wild examples: 1. Batch Download Images Instantly:

The Download Agent searches trusted platforms like Pixabay and Unsplash, gathers your images, and places them in a dedicated folder in your AI Drive.

One prompt does it all—no more tedious clicking or sorting.
Jun 7 14 tweets 5 min read
Build an Agentic RAG systems that can Reason using Claude 4 and OpenAI Embedder (step-by-step instructions): 1. Install the necessary Python Libraries

Run the following commands from your terminal to install the required libraries: Image
Jun 5 6 tweets 2 min read
OpenAI Operator costs $200 per month.

But this AI web agent literally blew my mind. I have been testing it for a while and it has automated hours of boring work for me.

5 amazing use cases:

1. Get top 10 AI stories on Hacker News in the last 24 hours and add it to my Google Doc. 2. Find top job openings in Austin that match my resume and create a Google Sheet with Company, Title, and Salary columns.
May 28 6 tweets 2 min read
I vibe coded Instagram clone with this AI Agent using Claude 4 in less than 5 minutes.

Ship full-stack apps using Claude 4 without writing a single line of code.

Let that sink in. Emergent is an agentic coding platform to build web apps, games, SaaS, Chrome extensions, and everything else.

It doesn't just generate code snippets, it builds entire systems with databases, APIs, authentication, and infrastructure, all with just simple prompts.
May 27 8 tweets 4 min read
5 Autonomous General Purpose AI Agents that can literally do your work.

1. II-agent can think, code, reason and browse in a single loop just like humans.

Outperforms other AI Agent frameworks like Manus AI, Genspark AI, and OpenAI Deep Research.

100% opensource. 2. Skywork Super Agents can generate documents, slides, Excel sheets, webpages, and podcasts with deep research, using multi-agents.

These Super Agents can complete 8 hours of office work in just 8 minutes.
May 4 13 tweets 4 min read
10 MCP servers to supercharge your AI Agents. 1. Firecrawl MCP Server

Turns any site, even those rendered by JavaScript into ready‑to‑parse HTML or Markdown for your AI agent.
Apr 18 4 tweets 2 min read
I built an MCP AI Agent using Gemini Flash 2.5 with access to AirBnB and Google Maps in just 30 lines of Python Code.

100% Opensource Code. I have created 50+ AI Agents and RAG tutorials, 100% free and opensource.

Two simple steps to get started:
1. Subscribe to Unwind AI (for free): theunwindai.com
2. Star the repo: github.com/Shubhamsaboo/a…

New AI Agents and RAG tutorials added every week.
Apr 16 13 tweets 4 min read
OpenAI just launched o3 and o4-mini with agentic tool use.

It's been just a few hours, and the internet is filled with incredible AI examples.

10 wild examples:

1. o3 creates a movie without video tools by sketching each frame of an otter and airplane scene and stitching them into a GIF, all in one shot.

Apr 16 4 tweets 2 min read
Web Action AI Agent that doesn’t just scrape but finds the data you need.

Firecrawl launched FIRE-1, an AI agent that navigates complex websites, interacts with buttons, fills forms, and gathers data beyond traditional scraping.

No manual steps required. FIRE-1 AI Agent is available to use on Firecrawl starting today.

Learn more about it here: firecrawl.dev/blog/launch-we…
Apr 14 8 tweets 3 min read
Let's build & deploy production grade Gemini AI Agents in 3 simple steps using Google Cloud Vertex AI Engine.

Works with LangChain, LlamaIndex, and other Agent frameworks. Image 1. Install Vertex AI SDK for Python

Install the latest version of the Vertex AI SDK for Python as well as extra dependencies related to Agent Engine and LangChain: Image
Apr 4 12 tweets 5 min read
Huge AI agent updates from Anthropic, PayPal, Windsurf, Cognition AI, and more.

Multiple agents working in parallel was the highlight.

1. Devin 2.0 is here with a new agent-native IDE experience. It lets you run parallel Devins to take on multiple tasks at once. Starts at $20. 2. Genspark is a general-purpose superagent that can think, plan, act, and use tools to handle all your everyday tasks. Rather than using a computer in a sandboxed VM, it uses its in-house system to directly call APIs whenever needed.

Outperforms Manus AI and OpenAI Operator.