Simplifying LLMs, AI Agents, RAGs and Machine Learning for you! β’ Co-founder @dailydoseofds_β’ BITS Pilani β’ 3 Patents β’ ex-AI Engineer @ LightningAI
24 subscribers
Apr 30 β’ 9 tweets β’ 3 min read
Let's fine-tune DeepMind's latest Gemma 3 (100% locally):
Before we begin, here's what we'll be doing.
We'll fine-tune our private and locally running Gemma 3.
To do this, we'll use:
- @UnslothAI for efficient fine-tuning.
- @ollama to run it locally.
Let's begin!
Apr 26 β’ 16 tweets β’ 5 min read
Let's build an MCP-powered multi-agent deep researcher (100% local):
Before we dive in, here's a quick demo of what we're building!
Tech stack:
- @Linkup_platform for deep web research
- @crewAIInc for multi-agent orchestration
- @Ollama to locally server DeepSeek
- @cursor_ai as MCP host
Let's go! π
Apr 21 β’ 10 tweets β’ 4 min read
Transformer vs. Mixture of Experts in LLMs, clearly explained (with visuals):
Mixture of Experts (MoE) is a popular architecture that uses different "experts" to improve Transformer models.
The visual below explains how they differ from Transformers.
Let's dive in to learn more about MoE!
Apr 17 β’ 12 tweets β’ 5 min read
10 MCP, AI Agents, and RAG projects for AI Engineers (with code):
1οΈβ£ Real-time Voice RAG Agent
In this project you'll learn how to build a real-time Voice RAG Agent.
You will also learn how to clone your voice in just 5 seconds.
MCP vs A2A (Agent2Agent) protocol, clearly explained:
Agentic applications require both A2A and MCP.
- MCP provides agents with access to tools.
- While A2A allows agents to connect with other agents and collaborate in teams.
Today, I'll clearly explain what A2A is and how it can work with MCP.
Apr 13 β’ 8 tweets β’ 3 min read
Traditional RAG vs. Graph RAG, clearly explained (with visuals):
top-k retrieval in RAG rarely works.
Imagine you want to summarize a biography where each chapter details a specific accomplishment of an individual.
Traditional RAG struggles because it retrieves only top-k chunks while it needs the entire context.
Apr 11 β’ 12 tweets β’ 4 min read
MCP is on fire.
AI agents can now talk to real world tools, apps and actually get stuff done.
This changes everything.
Here are 10 amazing examples:
1οΈβ£ WhatsApp MCP
Exchange images, videos, and voice notes on WhatsApp!
Pair it with the ElevenLabs MCP server for AI-powered transcription & audio messages with 3,000+ voices.
Check this outπ
Apr 10 β’ 14 tweets β’ 5 min read
Let's build a multi-agent brand monitoring system using DeepSeek-R1 (100% local):
Today, we're building a brand monitoring app that scraps web mentions and produces insights about a company.
Tech stack:
- Bright Data to scrape data at scale
- @crewAIInc for orchestration
- @ollama to serve DeepSeek locally
Let's go! π
Apr 6 β’ 13 tweets β’ 4 min read
Let's compare Llama 4 and DeepSeek-R1 using RAG:
Today, we're building a Streamlit app to compare MetaAI's Llama 4 against DeepSeek-R1 using RAG.
Tech stack:
- @Llama_Index workflows for orchestration
- @Cometml Opik for evaluation
- @GroqInc for blazing-fast inference (FREE)
Let's go! π
Mar 28 β’ 15 tweets β’ 4 min read
Let's build our own reasoning LLM using Reinforcement fine-tuning:
RFT allows you to transform any open-source LLM into a reasoning powerhouse.
No labeled data needed.
Today we'll use:
- @predibase for reinforcement fine-tuning (RFT)
- @Alibaba_Qwen-2.5:7b as the base model
Let's go! π
Mar 23 β’ 7 tweets β’ 2 min read
Eigenvalues and eigenvectors, clearly explained:
The concept of eigenvalues & eigenvectors is widely known yet poorly understood!
Today, I'll clearly explain their meaning & significance.
Let's go! π
Mar 21 β’ 4 tweets β’ 2 min read
Effortlessly trace and monitor multi-agent LLM applications!
With just two lines of code, Opik tracks everything happening inside your AI application, including costs. See the CrewAI example below.
100% open-source, self-hosted.
Opik offers integrations for nearly all popular frameworks.
But if you don't find what you need, don't worry!
A single decorator will handle it for you.
Check this outπ
Mar 18 β’ 9 tweets β’ 3 min read
Multithreading in Python clearly explained:
Ever felt like your Python code could run fasterβ
Multithreading might be the solution you're looking for!
Today, I'll simplify it for you in this step-by-step guide.
Let's go! π
Mar 13 β’ 11 tweets β’ 3 min read
Model Context Protocol (MCP), clearly explained:
MCP is like a USB-C port for your AI applications.
Just as USB-C offers a standardized way to connect devices to various accessories, MCP standardizes how your AI apps connect to different data sources and tools.
Let's dive in! π
Mar 12 β’ 11 tweets β’ 4 min read
Let's build a RAG app using Google DeepMind's Gemma 3 (100% local):
Google just dropped a multilingual and multimodal open-source LLM.
Today, we're building a RAG app powered by @GoogleDeepMind's Gemma3.
Tech stack:
- @Llama_Index for orchestration
- @Ollama to locally serve Gemma 3
- @Streamlit for the UI
Let's go! π
Mar 10 β’ 12 tweets β’ 4 min read
Let's build a Multimodal RAG app over complex webpages using DeepSeek's Janus-Pro (running locally):
The video depicts a multimodal RAG built with a SOTA tech stack.
We'll use:
- ColiVara's SOTA document understanding and retrieval to index webpages.
- @firecrawl_dev for reliable scrapping.
- @huggingface transformers to locally run DeepSeek Janus
Let's build it!
Mar 8 β’ 11 tweets β’ 3 min read
Object oriented programming in Python, clearly explained:
We break it down to 6 important concepts:
Bayes' Theorem clearly explained:
Bayes' Theorem is a cornerstone of probability theory!
It calculates the probability of an event, given that another event has occurred.
It's like updating your guess with fresh information!
Before we delve into the details, let's take a quick look at its formula:
Mar 6 β’ 13 tweets β’ 4 min read
Let's build a corrective RAG (CRAG) agentic workflow, step-by-step (100% local):
Before we dive in, here's a quick demo of our agentic workflow!
Tech stack:
- @Llama_Index workflows for orchestration
- @Linkup_platform for deep web search
- @Cometml's Opik to trace and monitor
- @Qdrant_engine to self-host vectorDB
Let's go! π
Mar 5 β’ 12 tweets β’ 6 min read
AI Engineering Hub just crossed 3k stars on GitHub!
Itβs 100% open-source, packed with 30+ hands-on tutorials that many would charge $1,000+ for...
Here's a small sample of what you get for free:
1οΈβ£ Multi-agent YouTube Trend Analysis App
Learn to gather trending topics using agents & transform data into actionable insights.