Latest Twitter Threads by @GoogleDeepMind on Thread Reader App

Jul 23 • 4 tweets • 2 min read

Our new state-of-the-art AI model Aeneas transforms how historians connect the past. 📜

Ancient inscriptions often lack context – it's like solving a puzzle with 90% of the pieces lost to time. It helps researchers interpret and situate inscriptions in their past context. 🧵

By transforming each ancient text into a unique historical fingerprint, Aeneas can identify similarities across 176,000 Latin inscriptions.

In our study, historians found these ‘parallels’ to be helpful research starting points 9 out of 10 times - improving their confidence by 44%.

Jun 24 • 5 tweets • 3 min read

We’re bringing powerful AI directly onto robots with Gemini Robotics On-Device. 🤖

It’s our first vision-language-action model to help make robots faster, highly efficient, and adaptable to new tasks and environments - without needing a constant internet connection. 🧵

What makes this new model unique?

🔵 It has the generality and dexterity of Gemini Robotics - but it can run locally on the device
🔵 It can handle a wide variety of complex, two-handed tasks out of the box
🔵 It can learn new skills with as few as 50-100 demonstrations

Jun 17 • 5 tweets • 3 min read

Hot Gemini updates off the press. 🚀

Anyone can now use 2.5 Flash and Pro to build and scale production-ready AI applications. 🙌

We’re also launching 2.5 Flash-Lite in preview: the fastest model in the 2.5 family to respond to requests, with the lowest cost too. 🧵

2.5 Flash-Lite now supports:

🔹Thinking: improving performance and transparency through step-by-step reasoning
🔹Tool-use: including Search, code execution and 1 million token context window - similar to 2.5 Flash and Pro

May 14 • 6 tweets • 3 min read

Introducing AlphaEvolve: a Gemini-powered coding agent for algorithm discovery.

It’s able to:

🔘 Design faster matrix multiplication algorithms
🔘 Find new solutions to open math problems
🔘 Make data centers, chip design and AI training more efficient across @Google. 🧵

Our system uses:
🔵 LLMs: To synthesize information about problems as well as previous attempts to solve them - and to propose new versions of algorithms
🔵 Automated evaluation: To address the broad class of problems where progress can be clearly and systematically measured.
🔵 Evolution: Iteratively improving the best algorithms found, and re-combining ideas from different solutions to find even better ones.

Apr 30 • 4 tweets • 3 min read

We’re helping robots self-improve with the power of LLMs. 🤖

Introducing the Summarize, Analyze, Synthesize (SAS) prompt, which analyzes how they perform tasks based on previous actions and then suggests ways for them to get better using the medium of table tennis. 🏓

Large language models like Gemini have an inherent ability to problem solve, without needing to retrain for specific jobs.

Robots can use these models to improve how they operate over time, by interacting with the world, and learning from those interactions. 🦾 goo.gle/4jVFsoE

Apr 23 • 7 tweets • 3 min read

We built an AI model to simulate how a fruit fly walks, flies and behaves – in partnership with @HHMIJanelia. 🪰

Our computerized insect replicates realistic motion, and can even use its eyes to control its actions.

Here’s how we developed it – and what it means for science. 🧵

To create it, we turned to MuJoCo, our open-source physics simulator – created for robotics and biomechanics – and added features such as:

▪️simulating fluid forces on the flapping wings, enabling flight
▪️adhesion actuators – mimicking the gripping force of insect feet

Dec 16, 2024 • 4 tweets • 4 min read

Today, we’re announcing Veo 2: our state-of-the-art video generation model which produces realistic, high-quality clips from text or image prompts. 🎥

We’re also releasing an improved version of our text-to-image model, Imagen 3 - available to use in ImageFX through @LabsDotGoogle. → goo.gle/veo-2-imagen-3

Veo 2 is able to:
▪️ Create videos at resolutions up to 4k
▪️ Understand camera controls in prompts, such as wide shot, POV and drone shots
▪️ Better recreate real-world physics and realistic human expression

In head-to-head comparisons of outputs by human raters, it was preferred over other top video generation models. → goo.gle/veo-2

Dec 4, 2024 • 7 tweets • 3 min read

Today in @Nature, we’re presenting GenCast: our new AI weather model which gives us the probabilities of different weather conditions up to 15 days ahead with state-of-the-art accuracy. ☁️⚡

Here’s how the technology works. 🧵 goo.gle/49trAOv

Weather affects almost everything - from our daily lives 🏠 to agriculture 🚜 to producing renewable energy 🔋 and more.

Forecasting traditionally uses physics based models which can take hours on a huge supercomputer.

We want to do it in minutes - and better.

Nov 20, 2024 • 7 tweets • 3 min read

Introducing AlphaQubit: our AI-based system that can more accurately identify errors inside quantum computers. 🖥️⚡

This research is a joint venture with @GoogleQuantumAI, published today in @Nature → goo.gle/3ZflWMn

The possibilities in quantum computing are compelling. ♾️

They can solve certain problems in a few hours, which would take a classical computer billions of years. This can help lead to advances in areas like drug discovery to material design.

But building a stable quantum system is a challenge.

Oct 23, 2024 • 6 tweets • 3 min read

Our latest generative technology is now powering MusicFX DJ in @LabsDotGoogle - and we’ve also updated Music AI Sandbox, a suite of experimental music tools which can streamline creation. 🎵

This will make it easier than ever to make music in real-time with AI. ✨goo.gle/4eTg28Z

MusicFX DJ lets you input multiple prompts and include details on instruments, genres and vibes to create music. 🎛️

We’ve updated and improved the interface using feedback from @YouTube’s Music AI Incubator.

Sep 5, 2024 • 6 tweets • 3 min read

We’re presenting AlphaProteo: an AI system for designing novel proteins that bind more successfully to target molecules. 🧬

It could help scientists better understand how biological systems function, save time in research, advance drug design and more. 🧵 dpmd.ai/3XuMqbX

Protein binders are promising tools in drug development and biotech.

They’ve demonstrated potential in:
🌀 binding cancer targets
🌀 blocking viral infections
🌀 modulating immune response

But traditional ways of identifying effective protein binders involve extensive lab work.

Aug 8, 2024 • 9 tweets • 3 min read

Meet our AI-powered robot that’s ready to play table tennis. 🤖🏓

It’s the first agent to achieve amateur human level performance in this sport. Here’s how it works. 🧵

Robotic table tennis has served as a benchmark for this type of research since the 1980s.

The robot has to be good at low level skills, such as returning the ball, as well as high level skills, like strategizing and long-term planning to achieve a goal.

Aug 2, 2024 • 7 tweets • 3 min read

AI systems can be powerful but opaque "black boxes" - even to researchers who train them. ⬛

Enter Gemma Scope: a set of open tools made up of sparse autoencoders to help decode the inner workings of Gemma 2 models, and better address safety issues. → dpmd.ai/gemma-scope

Language models turn your text input into a series of ‘activations’ - which map the relationships between the words you’ve entered to help it write its answer. 💬

Activations at different layers in its neural network represent increasingly advanced concepts, known as ‘features’.

Jul 31, 2024 • 4 tweets • 2 min read

We’re welcoming a new 2 billion parameter model to the Gemma 2 family. 🛠️

It offers best-in-class performance for its size and can run efficiently on a wide range of hardware.

Developers can get started with 2B today → dpmd.ai/4d0MKEH

We’re also introducing ShieldGemma: a series of state-of-the-art safety classifiers designed to filter harmful content. 🛡️

These target hate speech, harassment, sexually explicit material and more, both in the input and output stages.

Jul 25, 2024 • 8 tweets • 4 min read

We’re presenting the first AI to solve International Mathematical Olympiad problems at a silver medalist level.🥈

It combines AlphaProof, a new breakthrough model for formal reasoning, and AlphaGeometry 2, an improved version of our previous system. 🧵 dpmd.ai/imo-silver

Our system had to solve this year's six IMO problems, involving algebra, combinatorics, geometry & number theory. We then invited mathematicians @wtgowers and Dr Joseph K Myers to oversee scoring.

It solved 4️⃣ problems to gain 28 points - equivalent to earning a silver medal. ↓

Jun 17, 2024 • 5 tweets • 2 min read

We're sharing progress on our video-to-audio (V2A) generative technology. 🎥

It can add sound to silent clips that match the acoustics of the scene, accompany on-screen action, and more.

Here are 4 examples - turn your sound on. 🧵🔊 dpmd.ai/v2a

✍️ Prompt for audio: “Wolf howling at the moon.”

May 21, 2024 • 6 tweets • 2 min read

Our video generation model Veo gives more control over the camera. 📹

You can prompt for:
🔘 Extreme close up
🔘 Slow-motion crane shots
🔘 Timelapses

And more. 🧵

✍️ Prompt: “Timelapse of the northern lights dancing across the Arctic sky, stars twinkling, snow-covered landscape.”

✍️ Prompt: “A panning shot of a waterfall cascading down a rocky cliff, lush greenery surrounding the falls, mist rising from the crashing water.”

May 14, 2024 • 10 tweets • 4 min read

Introducing Veo: our most capable generative video model. 🎥

It can create high-quality, 1080p clips that can go beyond 60 seconds.

From photorealism to surrealism and animation, it can tackle a range of cinematic styles. 🧵 #GoogleIO

✍️ Prompt: “Many spotted jellyfish pulsating under water. Their bodies are transparent and glowing in deep ocean.”

May 8, 2024 • 6 tweets • 3 min read

Announcing AlphaFold 3: our state-of-the-art AI model for predicting the structure and interactions of all life’s molecules. 🧬

Here’s how we built it with @IsomorphicLabs and what it means for biology. 🧵 dpmd.ai/3URDiNo

AlphaFold 3 can generate the 3D structures of proteins, DNA, RNA, and smaller molecules, while also revealing how they fit together. 🧩

It can also model chemical changes to them that control the healthy functioning of cells - and when disrupted, could lead to disease.

Mar 19, 2024 • 6 tweets • 3 min read

We're announcing TacticAI: an AI assistant capable of offering insights to football experts on corner kicks. ⚽

Developed with @LFC, it can help teams sample alternative player setups to evaluate possible outcomes, and achieves state-of-the-art results. 🧵 dpmd.ai/49PGq1b

📊 Corner kicks can be challenging for AI to model due to the limited availability of data - @premierleague matches only average about 10 a game.

TacticAI uses a geometric deep learning approach to tackle this problem. → dpmd.ai/43p5Gcc

Feb 15, 2024 • 9 tweets • 4 min read

Introducing Gemini 1.5: our next-generation model with dramatically enhanced performance. It also achieves a breakthrough in long-context understanding.

The first release is 1.5 Pro, capable of processing up to 1 million tokens of information. 🧵 dpmd.ai/3SEbw4p

Gemini 1.5 was designed using a new Mixture–of-Experts (MoE) architecture, making it much more efficient to train and serve.

When tested on a set of text, code, image, audio and video evaluations, 1.5 Pro outperforms 1.0 Pro on 87% of benchmarks used for developing our LLMs.

Share this page!

Enter URL or ID to Unroll