China's UBTECH unveiled Walker S2, its next-gen humanoid for industrial use cases
The company claims it's the first humanoid that can swap its battery on its own to ensure continuous operation around the clock
Hugging Face's Pollen Robotics team open-sourced a fully 3D-printed robotic hand that costs less than $200
Weighing 400g, "The Amazing Hand" uses a soft TPU shell
and promises 8 DoF with dual hobby servos per finger
Agility Robotics released a clip of its Digit Robot, showing how it can recover from uncomfortable situations, like when someone pulls a rug from under its feet
The co is targeting industrial tasks with its robots, working with giants like Amazon
Hume AI launched EVI-3, a new speech-to-speech model
It doesn't just mimic the voice of the speaker but also their tone, speaking style, nuances, language, and personality.
All it needs is just 15-20s of audio
Ex-Open CTO Mira Murati's Thinking Machines Lab announced $2B in seed funding
Murati said the company is building multimodal AI and will launch its first product in the "next couple of months" with a major open-source component
So, I summarized everything from Microsoft, Google, Meta, Perplexity, HeyGen, Chai Discovery, Replit, LimX Dynamics, K-Scale Labs, DJI, Cursor, Sakana AI, and more.
Here's everything you need to know and how to make sense out of it:
Microsoft introduced MAI-DxO, what the company is calling a "step towards medical superintelligence"
The system solved 85.5% of 304 cases vs. just 20% by experienced physicians
It also delivered greater cost savings than human doctors
Google dropped two big updates this week:
—Veo 3 AI video generator is now available globally for Gemini users with the AI Pro plan
—Gemini Live now integrates directly with Google Workspace apps like Calendar and Gmail
Significant progress in AI and Robotics this week.
So, I summarized everything from Google DeepMind, ElevenLabs, Anthropic, Meta, Microsoft, Apptronik, Mistral, Berkley AI, Galbot, and more.
Here's everything you need to know and how to make sense out of it:
Google dropped major AI models this week, starting with Gemini Robotics On-Device
The VLA model runs locally and enables robots to perform a range of two-handed tasks, without the internet
Can also learn new skills with a few dozen demos
Google DeepMind also released:
—AlphGenome: An AI that predicts how gene mutations cause disease
—Gemma 3n: On-device models with multimodal understanding
—Gemini CLI: Open terminal agent
—Magenta RealTime: Live, open music AI
So, I summarized everything from Google, Midjourney, Figure, Cover, OpenAI, 1X, MiniMax, Hexagon, Sunrise Robotics, Gecko Robotics, and more.
Here's everything you need to know and how to make sense out of it:
Google dropped a bunch of AI updates this week:
—New Search Live with voice capability in AI Mode for U.S. users
—Preview of new 2.5 Flash-Lite, fastest and most affordable in the 2.5 family
—General availability of Gemini 2.5 Pro and 2.5 Flash
Midjourney launched their fist AI video generator with V1, an image-to-video system delivering:
—Four 5s clips from an image, which can be extended up to 20s
—Motion/action controls via prompts
—Signature midjourney aesthetic
—25x less cost than rivals
Significant progress in AI and Robotics this week.
So, I summarized everything from OpenAI, Google DeepMind, Mistral, Meta, 1X, Sakana AI, Nvidia, Infinite Machine, Snap, Bytedance, and more.
Here's everything you need to know and how to make sense out of it:
OpenAI dropped several updates this week, including:
—New o3 Pro, thinks longer and beats rivals on PhD-level math & science
—80% price cut for o3
—More natural Advanced Voice, translating for multiple turns
—Projects with deep research, voice mode
So, I summarized everything from OpenAI, Microsoft, Google, Figure, Unitree, BIGAI, UCR Robotics, Agility Robotics, Agibot, and more.
Here's everything you need to know and how to make sense out of it:
OpenAI dropped a series of updates this week, including:
—Custom and ready-to-use ChatGPT data connectors (Google Drive and more)
—Record mode to turn meetings into transcripts with insights
—Internet access for Codex
—Lightweight memory for free users
Microsoft added a free Video Creator on the Bing mobile app, powered by OpenAI's Sora
The company also added native shopping into Copilot with features like review summaries, price tracking, and deal alerts