Post

How to get URL link on X (Twitter) App

On the Twitter thread, click on or icon on the bottom
Click again on or Share Via icon
Click on Copy Link to Tweet
Paste it above and click "Unroll Thread"!
More info at Twitter Help

Brett Adcock

@adcock_brett

May 11 • 17 tweets • 6 min read • Read on X

Scrolly

Another huge week of AI and robotics news.

So, I summarized everything from OpenAI, Google, Meta, Microsoft, FutureHouse, Mistral, Unitree, Stanford, UC Berkeley, Hugging Face, and more.

Here's everything you need to know and how to make sense out of it:

OpenAI ditched its for-profit push, saying it will convert its existing for-profit arm into a PBC but keep its non-profit in control with a majority stake

This comes after pressure from several ex-employees and an ongoing legal battle

OpenAI also launched a GitHub connector for ChatGPT

The feature will allow users to connect their repos and use ChatGPT's Deep Research to read and search source code and PRs, creating a detailed report with citations

Google updated two key models:

—Gemini 2.5 Pro Preview (I/O Edition), with video understanding and improvements for UI, code, and agentic workflows

—Gemini 2.0 Flash image generation with improved quality, text rendering, and fewer content restrictions

Meta dropped two new models:

—Perception Language Model, an open AI for visual tasks like extracting details of a subject's actions at a given time

—Locate 3D, an object localization AI, aimed at helping robots understand and interact with surroundings

Microsoft updated its Copilot with "Pages," a ChatGPT Canvas-like feature

It allows users to collaborate with Copilot, asking the assistant to tweak, expand, or polish its responses

Notable it doesn't seem to have coding capabilities like Canvas

Microsoft also announced it's adopting Google's Agent2Agent (A2A) framework, launching it soon on Azure AI Foundry and Copilot Studio

The move will enable enterprises to develop AI agents that interoperate across platforms by design

Ex-Google CEO Eric Schmidt-backed FutureHouse dropped five 'AI Scientist' agents:

—Crow for general research
—Falcon for deep literature reviews
—Owl for identifying previous research
—Phoenix for chemistry workflows
—Finch for discovery in biology

Mistral released two big products:

—Medium 3, a multimodal AI that matches or surpasses 3.7 Sonnet, GPT-4o, and Llama 4 Maverick at 8x less cost

—Le Chat Enterprise, an agentic AI assistant for businesses with tools like Google Drive and agent building

Unitree is teaming up with SF-based Reborn to co-develop advanced AI to make its robots smarter, more adaptable, and capable of complex tasks

It will use multiple Reborn offerings, including its Roboverse simulator, motion datasets, and developer tools

Stanford researchers debuted a Teleoperated Whole-Body Imitation System (TWIST)

It enables coordinated, versatile, whole-body movements of humanoids, using a single neural network

This will enable functional general-purpose robots in different domains!

UC Berkeley researchers announced VideoMimic, a real-to-sim-to-real pipeline that trains robots with mobile videos

It mines videos, reconstructs the humans and the environment, and produces policies for humanoids, enabling skills like climbing stairs

Hugging Face released Open Computer Agent, an open-source AI agent for automating web tasks — similar to OpenAI's Operator

It is free to use via web browsers, but is reported to be slow and capable of handling only basic multi-step tasks

Anthropic released web search capabilities in the API

The feature allows web developers to build applications that can search the web for up-to-date information and provide grounded answers with relevant citations

UC Berkeley researchers also introduced PyRoki, a modular, extensible, and cross-platform toolkit for kinematic optimization

It solves inverse kinematics, trajectory optimization, and motion retargeting for a wide range of robots, including humanoids

We're hiring for hundreds of roles @Figure_robot:

> AI Engineers (many)
> Staff Security Engineer
> HMI Design Lead
> System Integration & Test (many)
> Legal (many)
> Manufacturing (many)

Apply here: figure.ai/careers x.com/adcock_brett/s…

https://x.com/adcock_brett/status/1921596920520131068

@Figure_robot That's it for this week's AI and Robotics breakdown.

I share the latest research every week, so follow me @adcock_brett for more.

If you found this valuable, consider a like/retweet to spread the word.

https://x.com/adcock_brett/status/1921596920520131068

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @adcock_brett

Brett Adcock

@adcock_brett

May 4

Significant progress in AI and Robotics this week.

So, I summarized everything from Meta, OpenAI, Microsoft, Amazon, Google, DeepSeek, Alibaba, Baidu, and more.

Here's everything you need to know and how to make sense out of it:

Meta hosted its first LlamaCon developers conference and made a ton of announcements, including:

—Llama API free preview
—ChatGPT-like Meta AI app with "Discover" feed
—Lama Guard 4 (12B), LlamaFirewall, and Prompt Guard
—Colab with Groq and Cerebras

OpenAI pushed two key product updates:

A personality update for GPT-4o, which made the AI "sycophant-y" and was eventually reversed

And improvements for ChatGPT Search with new shopping UX, better citations, and trending and autocompleted searches

Read 17 tweets

Brett Adcock

@adcock_brett

Apr 27

Another huge week of AI and robotics news.

So, I summarized everything from OpenAI, Perplexity, Anthropic, Nari Labs, Character AI, Kortix AI, Physical Intelligence, Unitree, and more.

Here's everything you need to know and how to make sense out of it:

OpenAI dropped gpt-image-1 in the API, the model behind ChatGPT's viral image gen

This allows devs to integrate image creation, with text rendering and editing features, into third-party apps

Companies like Adobe and Figma are already using it

Perplexity released an agentic Voice Assistant

It uses web browsing and multi-app actions to book reservations, send emails and calendar invites, play podcasts/videos, and more

Currently available in the Perplexity app, but only on iOS

Read 18 tweets

Brett Adcock

@adcock_brett

Apr 20

Significant progress in AI and Robotics this week.

So, I summarized everything from OpenAI, Google, Scout AI, Microsoft, xAI, Hugging Face, Kling AI, Anthropic, ByteDance, and more.

Here's everything you need to know and how to make sense out of it:

OpenAI revealed its smartest reasoning AI models yet with o3 and o4-mini

While o4-mini focuses on cost, o3 delivers SOTA performance across benchmarks

Both can do visual analysis in their CoT and use all ChatGPT tools, including image gen

OpenAI also released API-only GPT-4.1, 4.1 Mini, and 4.1 Nano for devs

Each model beats GPT-4o and 4o mini on dev tasks with up to 1M context windows

GPT-4.1 scored 55% on SWE-Bench Verified—with prices starting at $2 and $8 per million I/O tokens

Read 16 tweets

Brett Adcock

@adcock_brett

Apr 13

Another huge week of AI and robotics news.

So, I summarized everything from OpenAI, Google, Samsung, Meta, Nvidia, Amazon, Microsoft, Midjourney, Runway, Clone Robotics, and more.

Here's everything you need to know and how to make sense out of it:

OpenAI rolled out a important update to ChatGPT’s memory

It can now automatically remember and reference information across all user conversations

The start to a more personalized and useful experience in chat

Google made several AI announcements at Cloud Next 2025:

—Firebase Studio: A Gemini-powered, cloud-based agentic dev environment
—Ironwood TPU, Google's most powerful AI chip ever
—Faster and cheaper Gemini 2.5 Flash
—Improvements to Veo 2 and Imagen 3

Read 18 tweets

Brett Adcock

@adcock_brett

Apr 6

Significant progress in AI and Robotics this week.

So, I summarized everything from OpenAI, Figure, Amazon, Sanctuary AI, xAI, Runway, Physical Intelligence, Google, Meta, Anthropic, and more

Here's everything you need to know and how to make sense out of it:

As GPT-4o's Ghibli-style image generation went viral, OpenAI closed a $40B round of funding (the largest ever private deal in history)

The investment gives OpenAI a post-money valuation of $300B

Figure Update: We just demonstrated Figure 02's fully autonomous operation on BMW's production line

Our robots are performing real work, advancing Helix AI and strengthening our end-to-end autonomy

They've been permanently deployed at the BMW plant!

Read 17 tweets

Brett Adcock

@adcock_brett

Mar 30

Another huge week of AI and robotics news.

So, I summarized everything from Google, OpenAI, Figure, Qwen, Ideogram, Reve, Microsoft, Perplexity, Tencent, DeepSeek, and more.

Here's everything you need to know and how to make sense out of it:

Google released its most intelligent AI model, Gemini 2.5 Pro Experimental

The new thinking model ranks #1 on the LMArena and delivers SOTA performance across coding, math, science, and more

Also supports visual reasoning with a 1M token context window

OpenAI added native image generation to GPT-4o and Sora

The move creates an integrated system, where ChatGPT understands conversation context and creates/edits visuals with precision

Rolling out to Free, Plus, Pro, and Team users