We built an AI model to simulate how a fruit fly walks, flies and behaves – in partnership with @HHMIJanelia. 🪰
Our computerized insect replicates realistic motion, and can even use its eyes to control its actions.
Here’s how we developed it – and what it means for science. 🧵
To create it, we turned to MuJoCo, our open-source physics simulator – created for robotics and biomechanics – and added features such as:
▪️simulating fluid forces on the flapping wings, enabling flight
▪️adhesion actuators – mimicking the gripping force of insect feet
We then trained an artificial neural network on real fly behavior from recorded videos 🎥 and let it control the model in MuJoCo.
This enables it to learn how to move the virtual insect in the most realistic way.
Here, the fly shows accurate movement along complex natural flight trajectories.
Watch how it’s instructed to follow a real path marked by the blue dots. 🔵🪰🔵
So why build these kinds of models?
We believe they could help scientists better understand how the brain, body and environment drive specific behaviors in an animal – finding connections that labs can't always measure. 🥼
We’ve already applied this approach to multiple organisms – a virtual rodent, and now a fruit fly.
So what comes next for neuroscientists? The zebrafish – a widely studied creature which shares 70% of its protein-coding genes with humans. 🐠
We’re dropping Gemini Omni: our first step towards a model that can create anything from anything - starting with video.
It combines Gemini’s intelligence with our generative media systems - representing a leap forward in world understanding, multimodality, and editing 🧵
Omni brings together an improved understanding of physics with Gemini's knowledge of history, biology, and culture, bridging the gap from photorealism to meaningful storytelling.
Actions have consequences, environments respond to events, and narratives evolve logically.
🔘 Define a character once - then place them in any scene, and they’ll stay consistent across locations, actions and lighting.
🔘 Apply styles, motion, or effects by using input references, or just describe it with natural language.
We’re reimagining a 50-year-old interface - the mouse pointer - with AI. 🖱️
These experimental demos show how people can intuitively direct Gemini on their screens using motion, speech, and natural shorthand to get things done 🧵
With an AI-enabled pointer, help is always available where you’re working - without having to detour to additional apps. 📲
Point at a PDF and request bullet points for an email, hover over a table to ask for a pie chart, or highlight a recipe and simply say: "double these ingredients."
Current models require precise instructions, but our AI-enabled pointer removes that burden. 💡
By "seeing" what’s under your cursor, it instantly understands the specific word, image, or code block you need help with.
Weather affects everything and everyone. Our latest AI model developed with @GoogleResearch is helping us better predict it. ⛅
WeatherNext 2 is our most advanced system yet, able to generate more accurate and higher-resolution global forecasts. Here’s what it can do - and why it matters 🧵
A core challenge in weather prediction is capturing the full range of outcomes.
With WeatherNext 2, we can explore hundreds of possibilities in less than a minute from a single starting point. This would require hours on a supercomputer using physics-based models.
The model’s improved performance is enabled by a new approach called a Functional Generative Network, which can generate the full range of possible forecasts in a single step.
We added targeted randomness directly into the architecture, allowing it to explore a wide range of sensible weather scenarios.
SIMA 2 is our most capable AI agent for virtual 3D worlds. 👾🌐
Powered by Gemini, it goes beyond following basic instructions to think, understand, and take actions in interactive environments – meaning you can talk to it through text, voice, or even images. Here’s how 🧵
Advanced reasoning 🧠
We trained SIMA 2 to achieve high-level goals in a wide array of games – allowing it to perform complex reasoning and independently plan how to accomplish tasks.
It acts like a collaborative partner that can explain its intentions and answer questions about its behavior.
Generalization ☂️
SIMA 2 is now far better at carrying out detailed instructions, even in worlds it's never seen before.
It can transfer learned concepts like “mining” in one game and apply it to “harvesting” in another – connecting the dots between similar tasks.
It even navigated unseen environments created in real-time by our Genie 3 model.
We’re announcing a research collaboration with @CFS_energy, one of the world’s leading nuclear fusion companies.
Together, we’re helping speed up the development of clean, safe, limitless fusion power with AI. ⚛️
Fusion powers the sun, but here on Earth, one approach involves controlling a super-hot, ionized gas called plasma inside a tokamak machine.
To predict power generation, we need to simulate how heat, electric current and matter flow through the core of a plasma and interact with systems around it. This is where TORAX comes in.
TORAX is our open-source plasma simulator allowing CFS to run millions of virtual experiments to test plans for their tokamak, SPARC.
Using reinforcement learning, we’re now rapidly identifying the most efficient paths for it to generate more power than it consumes - a landmark achievement known as crossing "breakeven."
We’re rolling out Veo 3.1, our updated video generation model, alongside improved creative controls for filmmakers, storytellers, and developers - many of them with audio. 🧵
🎥 Introducing Veo 3.1
It brings a deeper understanding of the narrative you want to tell, capturing textures that look and feel even more real, and improved image-to-video capabilities.
🖼️ Ingredients to video
Give multiple reference images with different people and objects, and watch how Veo integrates these into a fully-formed scene - complete with sound.