- Ideogram Character
- Gemini’s Deep Think
- ChatGPT Study Mode
- Alibaba Wan 2.2 AI Video
- FLUX.1 Krea Photorealistic
- Hunyuan 3D World AI Model
- Microsoft Copilot Mode Browser
- Zai SOTA Open-Source Agentic AI
Here’s EVERYTHING your need to know:
1. Ideogram releases Character, the first character consistency model that works with just one reference image.
Now available free to all users, it renders infinite variations of your characters with striking fidelity.
Jul 30 • 8 tweets • 3 min read
You can now vibe code 3D websites and mobile apps with this AI.
It lets you build fully functional apps, hit games, and even interactive sites.
- Hedra Live Avatars
- DeepMind’s Aeneas
- US New AI Action Plan
- Composite Browser Agent
- Google AI Math Gold Medal
- GitHub Spark Coding Agent
- Runway Aleph Context Model
- Alibaba Qwen3 Sota Model & CLI
Here’s EVERYTHING you need to know:
1. Hedra launches Live Avatars, the world's most advanced real-time streaming avatar model with sub-100ms response times.
You can create lifelike digital characters for live streaming, virtual meetings, and customer service.
Jul 18 • 10 tweets • 4 min read
What a crazy week in AI 🤯
- ChatGPT Agents
- Runway’s Act-Two
- Grok AI Companions
- Claude New Directory
- Mistral AI Voxtral Speech
- Amazon Coding Agent Kiro
- Google Search New AI Features
- First Live-Stream Diffusion Model
Here’s EVERYTHING you need to know:
1. OpenAI launches ChatGPT Agent, combining Operator, Deep Research, and conversational AI into one unified system.
Users can now ask it to handle complex tasks like analyzing calendars, booking travel, and creating presentations autonomously.
Jul 17 • 8 tweets • 3 min read
🚨 BREAKING: Top AI voice clone model just launched.
It’s called EVI 1 by Hume AI and is a speech-to-speech model that captures emotion no other text-to-speech can.
This model clones any speaking style from just 15-20s of audio.
It’s now fully available. Link to try free below
You can try it here:
You’ll see two main options:
• Design a voice, where you create a unique voice by describing it
• Clone your voice by uploading your own audio
This agentic browser connects to your apps and does everything you want autonomously.
10 powerful use cases👇:
1. Summarize and provide me all the links of this video
2. Travel Guide
Jul 11 • 10 tweets • 4 min read
What a crazy week in AI 🤯
- Perplexity Comet
- Grok 4 SOTA model
- Mistral Devstral Models
- Google Veo 3 Image Input
- Context first AI Office Suite
- Microsoft Research BioEmu
- Kimi K2 Open-Source Agentic
- Flux Kontext Composer & Presets
Here’s EVERYTHING you need to know:
1. Perplexity launches Comet, its first AI-powered web browser designed to challenge Google Chrome.
Comet integrates Perplexity's AI search and includes an AI assistant that can summarize emails, manage tabs, and navigate web pages automatically.
Jul 7 • 7 tweets • 3 min read
Emergent 2.0 just dropped.
You can now build apps, games, MCPs and extensions in 1 prompt.
It now has a security review, scalability, and design agent which solves the flaws in Vibe coding platforms.
5 crazy examples + how to try free 👇:
1. GitHub Live Repo Visualizer
2. Create an MCP server that uses Gemini to browse through my files and organize them
Jul 4 • 10 tweets • 4 min read
What a crazy week in AI 🤯
- Cursor Phone App
- Krea AI Modify Video
- Google launches Doppl
- Perplexity New Max Tier
- X new AI note taking API
- AI’s Breakthrough in Fertility
- Morphic One-Shot Character
- Meta & OpenAI recruiting drama
Here’s EVERYTHING you need to know:
1. Cursor launches web app for mobile, bringing AI coding agents to your phone.
Developers can now manage AI coding agents directly from their browser on desktop or mobile, with agents that can write code, fix bugs, and complete tasks autonomously.
Jul 2 • 6 tweets • 3 min read
🚨Recraft AI has released Advanced Style Controls.
You can now explore infinite styles and mix them with your own images for perfect brand consistency.
Here’s how:
Recraft is a top image generation and editing tool, and now includes:
- Midjourney V1 Video
- ChatGPT Record Mode
- Higgsfield new AI Canvas
- Claude Code MCP Servers
- Google Search Live AI Mode
- MIT Study ChatGPT’s Impact
- MiniMax M1 model & AI Agent
- Tencent open-source 3D model
Here’s EVERYTHING you need to know:
1. Midjourney enters the video game with their first AI video model V1 that turns any image into 5-second clips.
Users can also extend videos up to 20s and it's available on their $10/month plan starting now.
Jun 18 • 7 tweets • 4 min read
🚨MiniMax open-sources the world’s longest context window reasoning AI model.
It’s called MiniMax M1 and supports 1M token inputs / 80k outputs, letting you analyze huge documents and follow deep reasoning without context loss.
Powerful use cases + link to try free below:
Key features from paper:
- World’s Longest Context Window (1M token input and 80k outputs)
- SOTA Agentic Use among Open-Source Models
- $534,700 RL training in 3 weeks
Director is a tool that helps you build web automation scripts using natural language prompts.
Think of Operator but free and you also get the code behind it.
Here’s how:
There are three major components:
• Director is the application to turn ideas into workflows.
• Stagehand is the SDK to control a browser.
• Browserbase is the infrastructure to power your browsers.
Jun 15 • 7 tweets • 3 min read
Genspark AI just released AI Browser.
You can now have an AI agent in your browser to automate browsing, planning, and interacting with the web for you.
5 powerful use cases + how to try👇:
1. Download all papers talked in a YouTube video
2. Checking X feed and create a podcast with all the info
Jun 13 • 11 tweets • 4 min read
What a crazy week in AI 🤯
- OpenAI o3-Pro
- Google AI Extract
- Krea 1 Image Model
- Midjourney AI Video
- Topaz Video Upscaler
- Dia AI-first web browser
- Mistral Reasoning Models
- Scouts Web Monitor Agents
- SkyReels Open-Source AI Video
Here’s EVERYTHING you need to know:
1. OpenAI launches o3-Pro, their most capable reasoning model yet, now available to ChatGPT Pro and Team users.
Replaces o1-Pro with enhanced performance in science, education, and programming, scoring 64% win rate vs o3 on human evaluations.