Latest Twitter Threads by @Saboo_Shubham_ on Thread Reader App

Aug 15 • 15 tweets • 10 min read

I started building Opensource AI Agents and RAG Apps 8 months ago.

It turned out to be the fastest way to learn AI engineering.

If you want to start today, here’s a step-by-step roadmap: 1. Python Programming for AI

• Harvard CS50's Introduction to AI with Python
pll.harvard.edu/course/cs50s-i…

• DeepLearning AI's AI Python for Beginners
deeplearning.ai/short-courses/… DeepLearning.AI

Aug 12 • 4 tweets • 2 min read

LangChain literally reverse-engineered Claude Code and Manus AI to build Deep Agents.

It's a Python library to turn any LLM into a deep thinking agent with MCP tools.

100% opensource.

100+ free step-by-step tutorials with code covering:

🚀 AI Agents
📀 RAG Systems
🗣️ Voice AI Agents
🌐 MCP AI Agents
🤝 Multi-agent Teams
🎮 Autonomous Game Playing Agents

P.S: Don't forget to subscribe for FREE to access future tutorials.

theunwindai.com

Aug 10 • 4 tweets • 2 min read

Google just released LangExtract Python library.

It can extract structured data from unstructured docs with precise sources in just a few lines of code.

100% Opensource.

Aug 2 • 7 tweets • 2 min read

Let's compare Kimi k2, Qwen3 Coder, DeepSeek R1, GLM-4.5 and Claude Sonnet 4 for AI code generation.

1. Kimi K2

2. Qwen3 Coder by @AlibabaGroup

3D grid of particles animated by a passing gravitational wave.

Jul 23 • 9 tweets • 3 min read

This new Claude 4 level coding model from China outperforms DeepSeek R1, Kimi k2, Gemini 2.5 Pro and OpenAI GPT-4.1.

Meet Qwen3-Coder - a 480B MoE model with 35B active parameters that excels in both coding and agentic tasks.

And it's 100% Opensource. Let that sink in.

Plus, you can use this with the newly launched opensource Qwen Code.

A terminal based AI workflow tool inspired by Gemini CLI.

GitHub Repo: github.com/QwenLM/qwen-co…

Jul 11 • 10 tweets • 4 min read

After DeepSeek R1, there's new Claude 4 level model from China that outperforms DeepSeek v3, Qwen and OpenAI GPT-4.1

Meet Kimi k2 - 1 trillion parameter model purpose-built for agentic workflows with native MCP integration.

100% Opensource and FREE to try. Let that sink in.

Kimi k2 by @Kimi_Moonshot is a SOTA AI coding model that gives amazing 1 shot results at coding prompts.

Jul 10 • 4 tweets • 2 min read

Grok 4 builds an AI Investment Agent team just by going through the documentation.

Replit lets me run, test and deploy that agent team directly from the browser.

All of this in less than 2 mins.

I have created 100+ AI Agents and RAG tutorials, 100% free and opensource.

P.S: Don't forget to star the repo to show your support 🌟
github.com/Shubhamsaboo/a…

Jul 6 • 9 tweets • 3 min read

Build a Customer Support Ticket Agent with Structured output using Google Agent Development Kit.

100% Opensource code with step-by-step tutorial: 1. Install the necessary Python Libraries

Run the following commands to install the required libraries:

Jul 5 • 8 tweets • 3 min read

Build an AI Agent with Structured Output using Google ADK (step-by-step instructions): What Are Structured Outputs?

Think of structured outputs as a strict dress code for your AI agent's responses.

Instead of getting random, unpredictable text, you get perfectly formatted JSON every single time.

Let's build our AI Agent with structured output 👇

Jun 21 • 4 tweets • 2 min read

This Chinese AI model just changed document OCR forever.

It can parse complex documents with text, tables, formulas and figures in parallel simultaneously using task-specific prompts.

100% opensource.

Dolphin uses a smart two-stage parsing approach.

Stage 1: Analyzes page layout and generates element sequence in natural reading order.

Stage 2: Parses all document elements simultaneously using task-specific prompts.

Jun 10 • 5 tweets • 2 min read

This is crazy!

Meet the world's first fully Agentic Download Agent and AI Drive.

It can literally search, download, and organize any files instantly in the AI drive from just a single prompt.

Works with PDFs, images, videos, music, and Office documents.

3 wild examples:

1. Batch Download Images Instantly:

The Download Agent searches trusted platforms like Pixabay and Unsplash, gathers your images, and places them in a dedicated folder in your AI Drive.

One prompt does it all—no more tedious clicking or sorting.

Jun 7 • 14 tweets • 5 min read

Build an Agentic RAG systems that can Reason using Claude 4 and OpenAI Embedder (step-by-step instructions): 1. Install the necessary Python Libraries

Run the following commands from your terminal to install the required libraries:

Jun 5 • 6 tweets • 2 min read

OpenAI Operator costs $200 per month.

But this AI web agent literally blew my mind. I have been testing it for a while and it has automated hours of boring work for me.

5 amazing use cases:

1. Get top 10 AI stories on Hacker News in the last 24 hours and add it to my Google Doc.

2. Find top job openings in Austin that match my resume and create a Google Sheet with Company, Title, and Salary columns.

May 28 • 6 tweets • 2 min read

I vibe coded Instagram clone with this AI Agent using Claude 4 in less than 5 minutes.

Ship full-stack apps using Claude 4 without writing a single line of code.

Let that sink in.

Emergent is an agentic coding platform to build web apps, games, SaaS, Chrome extensions, and everything else.

It doesn't just generate code snippets, it builds entire systems with databases, APIs, authentication, and infrastructure, all with just simple prompts.

May 27 • 8 tweets • 4 min read

5 Autonomous General Purpose AI Agents that can literally do your work.

1. II-agent can think, code, reason and browse in a single loop just like humans.

Outperforms other AI Agent frameworks like Manus AI, Genspark AI, and OpenAI Deep Research.

100% opensource.

2. Skywork Super Agents can generate documents, slides, Excel sheets, webpages, and podcasts with deep research, using multi-agents.

These Super Agents can complete 8 hours of office work in just 8 minutes.

May 4 • 13 tweets • 4 min read

10 MCP servers to supercharge your AI Agents. 1. Firecrawl MCP Server

Turns any site, even those rendered by JavaScript into ready‑to‑parse HTML or Markdown for your AI agent.

Apr 18 • 4 tweets • 2 min read

I built an MCP AI Agent using Gemini Flash 2.5 with access to AirBnB and Google Maps in just 30 lines of Python Code.

100% Opensource Code.

I have created 50+ AI Agents and RAG tutorials, 100% free and opensource.

Two simple steps to get started:
1. Subscribe to Unwind AI (for free): theunwindai.com
2. Star the repo: github.com/Shubhamsaboo/a…

New AI Agents and RAG tutorials added every week.

Apr 16 • 13 tweets • 4 min read

OpenAI just launched o3 and o4-mini with agentic tool use.

It's been just a few hours, and the internet is filled with incredible AI examples.

10 wild examples:

1. o3 creates a movie without video tools by sketching each frame of an otter and airplane scene and stitching them into a GIF, all in one shot.

https://x.com/emollick/status/1912597487287705965

Apr 16 • 4 tweets • 2 min read

Web Action AI Agent that doesn’t just scrape but finds the data you need.

Firecrawl launched FIRE-1, an AI agent that navigates complex websites, interacts with buttons, fills forms, and gathers data beyond traditional scraping.

No manual steps required.

FIRE-1 AI Agent is available to use on Firecrawl starting today.

Learn more about it here: firecrawl.dev/blog/launch-we…

Apr 14 • 8 tweets • 3 min read

Let's build & deploy production grade Gemini AI Agents in 3 simple steps using Google Cloud Vertex AI Engine.

Works with LangChain, LlamaIndex, and other Agent frameworks.

1. Install Vertex AI SDK for Python

Install the latest version of the Vertex AI SDK for Python as well as extra dependencies related to Agent Engine and LangChain:

Apr 4 • 12 tweets • 5 min read

Huge AI agent updates from Anthropic, PayPal, Windsurf, Cognition AI, and more.

Multiple agents working in parallel was the highlight.

1. Devin 2.0 is here with a new agent-native IDE experience. It lets you run parallel Devins to take on multiple tasks at once. Starts at $20.

2. Genspark is a general-purpose superagent that can think, plan, act, and use tools to handle all your everyday tasks. Rather than using a computer in a sandboxed VM, it uses its in-house system to directly call APIs whenever needed.

Outperforms Manus AI and OpenAI Operator.

Share this page!

Enter URL or ID to Unroll