Daily tips and tutorials on LLMs, RAG and AI Agents | Author of books on GPT-3 & Neural Search in Production | DM open for collaboration
12 subscribers
Apr 4 • 12 tweets • 5 min read
Huge AI agent updates from Anthropic, PayPal, Windsurf, Cognition AI, and more.
Multiple agents working in parallel was the highlight.
1. Devin 2.0 is here with a new agent-native IDE experience. It lets you run parallel Devins to take on multiple tasks at once. Starts at $20.
2. Genspark is a general-purpose superagent that can think, plan, act, and use tools to handle all your everyday tasks. Rather than using a computer in a sandboxed VM, it uses its in-house system to directly call APIs whenever needed.
Outperforms Manus AI and OpenAI Operator.
Mar 29 • 8 tweets • 3 min read
Multi-agent team need long-term memory to maintain context and learn continuously.
5 frameworks for AI agent memory.
100% opensource.
1. Mem0 is an intelligent memory layer for AI agents, enabling them to actively learn from and adapt to user interactions over time.
It combines LLMs with vector storage:
- LLMs process key information from conversations
- Vector storage performs semantic search
Mar 28 • 13 tweets • 5 min read
10 awesome MCP servers to supercharge your workflows on Claude, Cursor, Cline, and Windsurf.
100% opensource.
1. Firecrawl MCP server lets clients scrape, crawl, search, extract, and deep research websites. It can also convert websites into LLM-ready text right in your editor.
Mar 20 • 5 tweets • 2 min read
OpenAI just released Voice AI Agents with GPT-4o Audio model.
You can now build amazing voice AI Agents using OpenAI Agents SDK in a few lines of Python code.
Quickstart code to build Voice AI Agents with function calling in Python.
Mar 20 • 7 tweets • 4 min read
5 AI agent frameworks to build multi-agent applications.
100% opensource.
They aren't LangChain, Crew AI, or OpenAI Agents SDK.
1. Motia is an AI agent framework is built for Software Engineers. Build agents in Python, TypeScript, JavaScript or Ruby.
github.com/MotiaDev/motia2. Agno is a lightweight Python library for building Multimodal Agents with memory, knowledge, and tools.
• ~10,000x faster than LangGraph
• Build teams of specialized agents
• Make Agents respond with structured data
OpenAI just released Agents SDK for building multi-agent apps in Python.
It comes with 3 key features: 1. Handoffs: Allows an agent to delegate tasks to another agent 2. Guardrails: Safety checks for input & output validation 3. Tracing: Built-in visual tracking of agent runs.
Handoffs allow an agent to delegate tasks to another agent.
It's particularly useful in scenarios where different agents specialize in distinct areas.
Mar 11 • 5 tweets • 3 min read
Opensource alternative to Manus AI Agent is blowing up on GitHub.
OWL is an Autonomous AI Agent framework that can research, browse and code with multi-agent collaboration.
Works with Claude Sonnet 3.7, DeepSeek, GPT-4o and even local LLMs with Ollama.
100% Opensource.
OWL ranks 1 on the GAIA Benchmark among open-source projects with a 58.18 avg score.
AI Agents, RAG, and LLMs just had their biggest week.
And the releases are mind-blowing.
Here's everything you need to know:
1. DeepSeek V3 Base model shows impressive performance
→ 671B parameter Mixture-of-Experts model
→ 37B parameters activated per token
→ Outperforms Claude 3.5 Sonnet, GPT-4, and Llama 3
2. OASIS launches social media simulator
→ Model up to 1 million AI agent interactions
→ Study behavior across Twitter and Reddit
→ Analyze information spread and group dynamics
3. Cheshire Cat AI is an opensource framework for production AI agents
→ Built with Docker at its core
→ Process documents and connect to external APIs
→ Supports both commercial and open-source LLMs
4. Humanloop releases comprehensive RAG evaluation suite
→ Automated evaluations through code or UI
→ Human feedback collection capabilities
→ Real-time performance monitoring
5. Hercules emerges as first opensource testing agent
→ Converts Gherkin steps to end-to-end tests
→ No coding skills required
→ Handles complex testing automation
Get the full sccop in my AI newsletter: