Shubham Saboo Profile picture
Daily tips and tutorials on LLMs, RAG and AI Agents | Author of books on GPT-3 & Neural Search in Production | DM open for collaboration
12 subscribers
Apr 4 12 tweets 5 min read
Huge AI agent updates from Anthropic, PayPal, Windsurf, Cognition AI, and more.

Multiple agents working in parallel was the highlight.

1. Devin 2.0 is here with a new agent-native IDE experience. It lets you run parallel Devins to take on multiple tasks at once. Starts at $20. 2. Genspark is a general-purpose superagent that can think, plan, act, and use tools to handle all your everyday tasks. Rather than using a computer in a sandboxed VM, it uses its in-house system to directly call APIs whenever needed.

Outperforms Manus AI and OpenAI Operator.
Mar 29 8 tweets 3 min read
Multi-agent team need long-term memory to maintain context and learn continuously.

5 frameworks for AI agent memory.

100% opensource. 1. Mem0 is an intelligent memory layer for AI agents, enabling them to actively learn from and adapt to user interactions over time.

It combines LLMs with vector storage:
- LLMs process key information from conversations
- Vector storage performs semantic search Image
Mar 28 13 tweets 5 min read
10 awesome MCP servers to supercharge your workflows on Claude, Cursor, Cline, and Windsurf.

100% opensource. 1. Firecrawl MCP server lets clients scrape, crawl, search, extract, and deep research websites. It can also convert websites into LLM-ready text right in your editor.
Mar 20 5 tweets 2 min read
OpenAI just released Voice AI Agents with GPT-4o Audio model.

You can now build amazing voice AI Agents using OpenAI Agents SDK in a few lines of Python code. Image Quickstart code to build Voice AI Agents with function calling in Python. Image
Mar 20 7 tweets 4 min read
5 AI agent frameworks to build multi-agent applications.

100% opensource.

They aren't LangChain, Crew AI, or OpenAI Agents SDK.

1. Motia is an AI agent framework is built for Software Engineers. Build agents in Python, TypeScript, JavaScript or Ruby.

github.com/MotiaDev/motiaImage 2. Agno is a lightweight Python library for building Multimodal Agents with memory, knowledge, and tools.

• ~10,000x faster than LangGraph
• Build teams of specialized agents
• Make Agents respond with structured data

github.com/agno-agi/agnoImage
Mar 11 7 tweets 3 min read
OpenAI just released Agents SDK for building multi-agent apps in Python.

It comes with 3 key features:
1. Handoffs: Allows an agent to delegate tasks to another agent
2. Guardrails: Safety checks for input & output validation
3. Tracing: Built-in visual tracking of agent runs. Image Handoffs allow an agent to delegate tasks to another agent.

It's particularly useful in scenarios where different agents specialize in distinct areas. Image
Mar 11 5 tweets 3 min read
Opensource alternative to Manus AI Agent is blowing up on GitHub.

OWL is an Autonomous AI Agent framework that can research, browse and code with multi-agent collaboration.

Works with Claude Sonnet 3.7, DeepSeek, GPT-4o and even local LLMs with Ollama.

100% Opensource. OWL ranks 1 on the GAIA Benchmark among open-source projects with a 58.18 avg score.

Try it out yourself: github.com/camel-ai/owl
Feb 22 6 tweets 3 min read
Train your own DeepSeek R1 like reasoning model with just 5GB of RAM.

It enables training custom reasoning models with 10x longer context while using 90% less VRAM.

100% Opensource. Image I have created 50+ AI Agents and RAG tutorials, 100% free and opensource.

Two simple steps to get started:
1. Subscribe to Unwind AI (for free): theunwindai.com
2. Star the repo: github.com/Shubhamsaboo/a…

New AI Agents and RAG tutorials added every week. Image
Feb 8 4 tweets 2 min read
AI Agents can now do Object Detection

AI agents can now detect objects through pure reasoning - just provide text prompt & image.

Zero training data needed. No labelling required. Find all the awesome LLM Apps with AI Agents and RAG in the following Github Repo.

P.S: Don't forget to star the repo to show your support 🌟

github.com/Shubhamsaboo/a…
Feb 5 8 tweets 4 min read
4 alternatives to OpenAI’s $200/month Deep Research AI Agent.

100% Opensource and $0 subscription fees. 1. DeepResearch by Jina AI

It uses Gemini Flash, Brave/DuckDuckGo, and Jina Reader, to mimic OpenAI's agentic search, read, and reasoning process.

100% Opensource.

github.com/jina-ai/node-D…
Feb 1 7 tweets 3 min read
3 alternatives to OpenAI's $200/month Operator like Agent.

100% free and Opensource. 1. Browser Use

It lets your AI Agent use a web browser like a human in a few lines of Python Code.

Works with Claude Sonnet 3.5, gpt-4o and Llama 3. 100% Opensource.
Jan 26 4 tweets 2 min read
After DeepSeek R1, there's new OpenAI o1 level model from China that outperforms Claude Sonnet 3.5 & GPT-4o.

Meet Kimi 1.5 - Multimodal model with advanced Chain-of-thoughts reasoning and real-time web search.

And it's 100% FREE with unlimited chats. Let that sink in. Image Find all the awesome LLM Apps with AI Agents and RAG in the following Github Repo.

P.S: Don't forget to star the repo to show your support 🌟

github.com/Shubhamsaboo/a…
Jan 21 4 tweets 2 min read
DeepSeek R1 is 100% Opensource and 96.4% cheaper than OpenAI o1 while delivering similar performance.

OpenAI o1: $60.00 per 1M output tokens
DeepSeek R1: $2.19 per 1M output tokens

People with $200 ChatGPT subscription, let that sink in. I will be adding more AI Agent apps using DeepSeek R1 in the future.

You can find all the awesome LLM Apps with AI Agents and RAG in the following Github Repo.

P.S: Don't forget to star the repo to show your support 🌟

github.com/Shubhamsaboo/a…
Jan 16 4 tweets 2 min read
Most advanced Agentic AI Researcher by Google Gemini.

A team of AI Agents work together to:
- Draft a plan
- Search the web
- Analyze results
- Create a well-researched report

All in under 2 minutes. Find all the awesome LLM Apps with AI Agents and RAG in the following Github Repo.

P.S: Don't forget to star the repo to show your support 🌟

github.com/Shubhamsaboo/a…
Jan 3 4 tweets 2 min read
Build production-ready multi-agent AI workflows with full observability in Python.

Works with Claude Sonnet 3.5 and gpt-4o.

100% Opensource. Image Find all the awesome LLM Apps with AI Agents and RAG in the following Github Repo.

P.S: Don't forget to star the repo to show your support 🌟

github.com/Shubhamsaboo/a…
Dec 31, 2024 4 tweets 2 min read
Awesome LLM Apps just crossed 10k+ stars on GitHub

Greatest time to build in public and opensource.

50+ LLM apps, AI Agents, and RAG tutorials with step-by-step instructions.

100% free and Opensource code. Image Find all the awesome LLM Apps with AI Agents and RAG in the following Github Repo.

P.S: Don't forget to star the repo to show your support 🌟

github.com/Shubhamsaboo/a…
Dec 28, 2024 4 tweets 2 min read
AI Agents, RAG, and LLMs just had their biggest week.

And the releases are mind-blowing.

Here's everything you need to know:

1. DeepSeek V3 Base model shows impressive performance
→ 671B parameter Mixture-of-Experts model
→ 37B parameters activated per token
→ Outperforms Claude 3.5 Sonnet, GPT-4, and Llama 3

2. OASIS launches social media simulator
→ Model up to 1 million AI agent interactions
→ Study behavior across Twitter and Reddit
→ Analyze information spread and group dynamics

3. Cheshire Cat AI is an opensource framework for production AI agents
→ Built with Docker at its core
→ Process documents and connect to external APIs
→ Supports both commercial and open-source LLMs

4. Humanloop releases comprehensive RAG evaluation suite
→ Automated evaluations through code or UI
→ Human feedback collection capabilities
→ Real-time performance monitoring

5. Hercules emerges as first opensource testing agent
→ Converts Gherkin steps to end-to-end tests
→ No coding skills required
→ Handles complex testing automation Get the full sccop in my AI newsletter:

P.S: Don't forget to subscribe for FREE to get these updates directly in your inbox.theunwindai.com/p/turn-ai-agen…
Dec 27, 2024 4 tweets 2 min read
I built a RAG Routing Agent that exactly knows where to look.

It automatically route queries to the right knowledge base and falls back to web search when needed.

100% Opensource Code with step-by-step tutorial. Step-by-step tutorial with Code:

theunwindai.com/p/build-a-rag-…
Dec 24, 2024 4 tweets 2 min read
I built a multimodal AI medical image diagnosis agent using Gemini 2.0

This AI Agent can:
• Analyze medical scans
• Detect abnormalities
• Search the web
• Generate patient-friendly reports

100% Opensource Code with step-by-step tutorial. STRICTLY for educational purposes. Not for any kind of medical advice.

Find all the awesome LLM Apps with AI Agents and RAG in the following Github Repo.

P.S: Don't forget to star the repo to show your support 🌟

github.com/Shubhamsaboo/a…
Dec 19, 2024 4 tweets 2 min read
Drag & Drop to build AI Agents.

And deploy to API in one click with Nocode.

100% Opensource. Try it out now: pyspur.dev

Find all the awesome LLM Apps tutorials with RAG and AI agents in my AI newsletter for developers.

P.S: Don't forget to subscribe for FREE to access future tutorials.

theunwindai.com/archive?tags=A… Image
Dec 14, 2024 4 tweets 2 min read
I built an automated AI design team with multi-agents.

It has 3 multimodal AI agents working together as a team:

1. Visual Design Agent
2. UX Analysis Agent
3. Market Analysis Agent

100% Opensource Code with step-by-step tutorial. Step-by-step tutorial with Code:

theunwindai.com/p/build-a-mult…