Brett Adcock Profile picture
May 11 17 tweets 6 min read Read on X
Another huge week of AI and robotics news.

So, I summarized everything from OpenAI, Google, Meta, Microsoft, FutureHouse, Mistral, Unitree, Stanford, UC Berkeley, Hugging Face, and more.

Here's everything you need to know and how to make sense out of it:
OpenAI ditched its for-profit push, saying it will convert its existing for-profit arm into a PBC but keep its non-profit in control with a majority stake

This comes after pressure from several ex-employees and an ongoing legal battle Image
OpenAI also launched a GitHub connector for ChatGPT

The feature will allow users to connect their repos and use ChatGPT's Deep Research to read and search source code and PRs, creating a detailed report with citations
Google updated two key models:

—Gemini 2.5 Pro Preview (I/O Edition), with video understanding and improvements for UI, code, and agentic workflows

—Gemini 2.0 Flash image generation with improved quality, text rendering, and fewer content restrictions
Meta dropped two new models:

—Perception Language Model, an open AI for visual tasks like extracting details of a subject's actions at a given time

—Locate 3D, an object localization AI, aimed at helping robots understand and interact with surroundings
Microsoft updated its Copilot with "Pages," a ChatGPT Canvas-like feature

It allows users to collaborate with Copilot, asking the assistant to tweak, expand, or polish its responses

Notable it doesn't seem to have coding capabilities like Canvas
Microsoft also announced it's adopting Google's Agent2Agent (A2A) framework, launching it soon on Azure AI Foundry and Copilot Studio

The move will enable enterprises to develop AI agents that interoperate across platforms by design
Ex-Google CEO Eric Schmidt-backed FutureHouse dropped five 'AI Scientist' agents:

—Crow for general research
—Falcon for deep literature reviews
—Owl for identifying previous research
—Phoenix for chemistry workflows
—Finch for discovery in biology
Mistral released two big products:

—Medium 3, a multimodal AI that matches or surpasses 3.7 Sonnet, GPT-4o, and Llama 4 Maverick at 8x less cost

—Le Chat Enterprise, an agentic AI assistant for businesses with tools like Google Drive and agent building
Unitree is teaming up with SF-based Reborn to co-develop advanced AI to make its robots smarter, more adaptable, and capable of complex tasks

It will use multiple Reborn offerings, including its Roboverse simulator, motion datasets, and developer tools
Stanford researchers debuted a Teleoperated Whole-Body Imitation System (TWIST)

It enables coordinated, versatile, whole-body movements of humanoids, using a single neural network

This will enable functional general-purpose robots in different domains!
UC Berkeley researchers announced VideoMimic, a real-to-sim-to-real pipeline that trains robots with mobile videos

It mines videos, reconstructs the humans and the environment, and produces policies for humanoids, enabling skills like climbing stairs
Hugging Face released Open Computer Agent, an open-source AI agent for automating web tasks — similar to OpenAI's Operator

It is free to use via web browsers, but is reported to be slow and capable of handling only basic multi-step tasks
Anthropic released web search capabilities in the API

The feature allows web developers to build applications that can search the web for up-to-date information and provide grounded answers with relevant citations
UC Berkeley researchers also introduced PyRoki, a modular, extensible, and cross-platform toolkit for kinematic optimization

It solves inverse kinematics, trajectory optimization, and motion retargeting for a wide range of robots, including humanoids
We're hiring for hundreds of roles @Figure_robot:

> AI Engineers (many)
> Staff Security Engineer
> HMI Design Lead
> System Integration & Test (many)
> Legal (many)
> Manufacturing (many)

Apply here: figure.ai/careers x.com/adcock_brett/s…
@Figure_robot That's it for this week's AI and Robotics breakdown.

I share the latest research every week, so follow me @adcock_brett for more.

If you found this valuable, consider a like/retweet to spread the word.

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Brett Adcock

Brett Adcock Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @adcock_brett

May 4
Significant progress in AI and Robotics this week.

So, I summarized everything from Meta, OpenAI, Microsoft, Amazon, Google, DeepSeek, Alibaba, Baidu, and more.

Here's everything you need to know and how to make sense out of it:
Meta hosted its first LlamaCon developers conference and made a ton of announcements, including:

—Llama API free preview
—ChatGPT-like Meta AI app with "Discover" feed
—Lama Guard 4 (12B), LlamaFirewall, and Prompt Guard
—Colab with Groq and Cerebras
OpenAI pushed two key product updates:

A personality update for GPT-4o, which made the AI "sycophant-y" and was eventually reversed

And improvements for ChatGPT Search with new shopping UX, better citations, and trending and autocompleted searches
Read 17 tweets
Apr 27
Another huge week of AI and robotics news.

So, I summarized everything from OpenAI, Perplexity, Anthropic, Nari Labs, Character AI, Kortix AI, Physical Intelligence, Unitree, and more.

Here's everything you need to know and how to make sense out of it:
OpenAI dropped gpt-image-1 in the API, the model behind ChatGPT's viral image gen

This allows devs to integrate image creation, with text rendering and editing features, into third-party apps

Companies like Adobe and Figma are already using it
Perplexity released an agentic Voice Assistant

It uses web browsing and multi-app actions to book reservations, send emails and calendar invites, play podcasts/videos, and more

Currently available in the Perplexity app, but only on iOS
Read 18 tweets
Apr 20
Significant progress in AI and Robotics this week.

So, I summarized everything from OpenAI, Google, Scout AI, Microsoft, xAI, Hugging Face, Kling AI, Anthropic, ByteDance, and more.

Here's everything you need to know and how to make sense out of it:
OpenAI revealed its smartest reasoning AI models yet with o3 and o4-mini

While o4-mini focuses on cost, o3 delivers SOTA performance across benchmarks

Both can do visual analysis in their CoT and use all ChatGPT tools, including image gen
OpenAI also released API-only GPT-4.1, 4.1 Mini, and 4.1 Nano for devs

Each model beats GPT-4o and 4o mini on dev tasks with up to 1M context windows

GPT-4.1 scored 55% on SWE-Bench Verified—with prices starting at $2 and $8 per million I/O tokens
Read 16 tweets
Apr 13
Another huge week of AI and robotics news.

So, I summarized everything from OpenAI, Google, Samsung, Meta, Nvidia, Amazon, Microsoft, Midjourney, Runway, Clone Robotics, and more.

Here's everything you need to know and how to make sense out of it:
OpenAI rolled out a important update to ChatGPT’s memory

It can now automatically remember and reference information across all user conversations

The start to a more personalized and useful experience in chat
Google made several AI announcements at Cloud Next 2025:

—Firebase Studio: A Gemini-powered, cloud-based agentic dev environment
—Ironwood TPU, Google's most powerful AI chip ever
—Faster and cheaper Gemini 2.5 Flash
—Improvements to Veo 2 and Imagen 3
Read 18 tweets
Apr 6
Significant progress in AI and Robotics this week.

So, I summarized everything from OpenAI, Figure, Amazon, Sanctuary AI, xAI, Runway, Physical Intelligence, Google, Meta, Anthropic, and more

Here's everything you need to know and how to make sense out of it:
As GPT-4o's Ghibli-style image generation went viral, OpenAI closed a $40B round of funding (the largest ever private deal in history)

The investment gives OpenAI a post-money valuation of $300B
Figure Update: We just demonstrated Figure 02's fully autonomous operation on BMW's production line

Our robots are performing real work, advancing Helix AI and strengthening our end-to-end autonomy

They've been permanently deployed at the BMW plant!
Read 17 tweets
Mar 30
Another huge week of AI and robotics news.

So, I summarized everything from Google, OpenAI, Figure, Qwen, Ideogram, Reve, Microsoft, Perplexity, Tencent, DeepSeek, and more.

Here's everything you need to know and how to make sense out of it:
Google released its most intelligent AI model, Gemini 2.5 Pro Experimental

The new thinking model ranks #1 on the LMArena and delivers SOTA performance across coding, math, science, and more

Also supports visual reasoning with a 1M token context window
OpenAI added native image generation to GPT-4o and Sora

The move creates an integrated system, where ChatGPT understands conversation context and creates/edits visuals with precision

Rolling out to Free, Plus, Pro, and Team users
Read 15 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(