TODAY'S AI NEWS: OpenAI just released o3 and o4-mini, its smartest models yet
Plus, more news from Google, Anthropic, Cohere, Microsoft, Kling AI, and more.
Here's everything you need to know:
OpenAI dropped the new o3 and o4-mini reasoner models
o3 pushes SOTA performance across coding, math, science, and multimodality, while o4-mini offers fast, cost-efficient performance
Both have agentic access to ChatGPT tools and can "think with images"
Google is expanding Gemini Live's Project Astra capabilities—to all Android users!
This will enable users to engage with real-time visual AI and have multilingual conversations about anything seen and heard via their phone's camera or screen share
TODAY'S AI NEWS: OpenAI just dropped three GPT-4.1 models for devs—API-only and packing a jaw-dropping 1M-token context window.
Plus, more news from Google, Hugging Face, ByteDance, SSI, and LM Arena.
Here's everything you need to know:
OpenAI released GPT-4.1, 4.1 Mini, and the ultra-fast 4.1 Nano—all designed for devs
— Each model beats GPT-4o and 4o mini on dev tasks
— 1M token context windows
— GPT-4.1 scored 55% on SWE-Bench Verified
— Starting at $0.10/0.40 per million I/O tokens
Google released DolphinGemma, an AI that can generate dolphin vocalizations
The system analyzes dolphin whistles and sounds to identify patterns and predict subsequent sounds!
TODAY'S AI NEWS: Amazon just dropped a new voice model that beats OpenAI
Plus, more news from Google, Nvidia, Deep Cogito, Stanford, and more.
Here's everything you need to know:
Amazon launched Nova Sonic speech-to-speech AI for human-like interactions
—Outperforms OpenAI's voice models with ~ 80% less cost
—4.2% word error rate across languages
— 46.7% better accuracy than GPT-4o for noisy environments
—On Amazon Bedrock
Amazon also dropped an upgraded Nova Reel 1.1 video model
—Delivers improved quality, style consistency
—Extends generations to 2 min via automated and manual, shot-by-shot modes
—Also available on Amazon Bedrock
TODAY'S AI NEWS: Google just dropped Gemini 2.5 Pro, its most intelligent AI model to date.
Plus, more news from OpenAI, Figure, ByteDance, Otter, and Perplexity.
Here's everything you need to know:
Google released Gemini 2.5 Pro Experimental, the first model in its Gemini 2.5 family
—#1 on the LMArena
—SOTA capabilities across benchmarks for coding, math, science, and more
—Visual reasoning
—1M token context window (2M coming soon!)
OpenAI added native image generation within GPT-4o and Sora
—A fully integrated system for creating visuals via ChatGPT
—Excels at menus, diagrams, and infographics
—Edits images with text prompts
—Rolling out to Plus, Pro, Team, and Free users