Rowan Cheung Profile picture
Mar 14, 2024 9 tweets 3 min read Read on X
AI NEWS: Google DeepMind just introduced an AI agent that plays video games with you.

Plus, more developments from OpenAI Sora, Figure, Anthrophic, Devin, Oracle, and Elon Musk.

Here's everything going on in AI right now:
Google DeepMind just introduced SIMA, an AI agent that can follow natural language instructions to perform tasks across video games.

SIMA is a glimpse into the future of gaming, where AI agents will become dynamic sidekicks/companions rather than just opponents.
OpenAI CTO Mira Murati revealed in an interview with the WSJ that Sora was trained on publicly available and licensed data.

She also revealed that the model will be released later this year.
Figure just unveiled a new demo integrating with an OpenAI vision-language model.

The robot can now engage in natural conversations, visually interpret its surroundings, and autonomously execute tasks.
Anthropic released Claude 3 Haiku, "the fastest and most affordable model in its intelligence class"

It's available via API for devs at a million tokens for just $0.25.

World-class AI models just got 90% cheaper.
Speaking of Claude, @dr_cintas found a way to generate animations from code generated directly from the chatbot.

He's doing a full tutorial in The Rundown tomorrow!

Get it for free:
therundown.ai/subscribe
@dr_cintas Cognition AI just introduced Devin, an autonomous AI agent that can write entire software projects based on prompts.

If it lives up to hype, Devin could be the first true look into a future where anyone can spawn an AI worker, no coding skills required.
@dr_cintas Oracle CEO Larry Ellison said on the company’s earnings call that he and Elon Musk are collaborating.

The duo will develop an AI-powered application for farms, helping to plan, predict, and increase agricultural output. Image
@dr_cintas That's it for this week's news in the world of AI.

I share everything going on in AI, follow me @rowancheung for more.

If you found this helpful, spare me a like/retweet to support my content 👇

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Rowan Cheung

Rowan Cheung Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @rowancheung

May 8
AI NEWS: Mistral just launched a new multimodal AI promising SOTA performance at much lower cost

Plus, more news from Meta, Anthropic, Microsoft, Google, Figma, and Netflix.

Here's everything you need to know:
Mistral just made two big reveals:

—Medium 3, a multimodal AI matching or surpassing 3.7 Sonnet, GPT-4o, and Llama 4 Maverick across benchmarks with 8x lower costs

—Le Chat Enterprise with corporate tools like Google Drive, agent building, and more
Meta AI dropped Meta Perception Language Model, an open & reproducible vision-language AI for challenging visual tasks

It can watch videos and extract details like what a person is doing in the content and how they are doing it
Read 10 tweets
May 7
AI NEWS: Google just released a new version of Gemini 2.5 Pro, taking coding and web development to the next level

Plus, more news from HeyGen, Lighttricks, Windsurf, FutureHouse, Ace Studio, and Recraft.

Here's everything you need to know:
Google released an early preview of Gemini 2.5 Pro I/O Edition, delivering:

—Enhanced performance for frontend and UI dev, code editing, and agentic workflows
—Top score on LM Arena and WebDev Arena, beating Claude 3.7 Sonnet
—Video understanding features
HeyGen dropped Avatar IV, an AI for expressive animations

With one photo and voice script, its audio to expression engine captures tone, rhythm, and emotion to generate facial motion

Also supports different subjects, camera shots, and formats
Read 10 tweets
Apr 28
AI NEWS: China's Baidu just dropped ultra-affordable "Turbo" models, taking on OpenAI and DeepSeek

Plus, more news from OpenAI, Liquid AI, xAI, and Moonshot AI!

Here's everything you need to know:
Baidu dropped faster and cheaper ERNIE 4.5 Turbo and X1 Turbo models

4.5 Turbo, with upgraded multimodality, costs 11c and 44c per million I/O tokens (0.2% of GPT-4.5).

X1 Turbo comes at 14c/55c per million I/O tokens, topping DeepSeek R1 and V3
OpenAI released a new lightweight version of Deep Research to meet user demand

—Powered by o4-mini and ‘nearly as intelligent’
—Expanded usage limits but shorter responses
—Available on all tiers, including free users
Image
Read 9 tweets
Apr 22
TODAY'S AI NEWS: Google DeepMind CEO Demis Hassabis just said AI can end all disease!

Plus, more news from Anthropic, Windsurf, ElevenLabs, Play AI, and TextArena.

Here's everything you need to know:
Demis Hassabis appeared on 60 Minutes, sharing new insights into AI:

—AGI in 5-10 years, with conscious AI likely in the future
—AI fast-tracking drug discovery, potentially eradicating all disease within the next decade
—AI ensuring radical abundance
Anthropic just mapped Claude's morality!

The co analyzed 300K+ chats and found that the AI exhibits 5 key values, including practical and protective

This will improve our understanding of AI as it becomes more and more crucial to real-world decisions
Image
Read 9 tweets
Apr 17
TODAY'S AI NEWS: OpenAI just released o3 and o4-mini, its smartest models yet

Plus, more news from Google, Anthropic, Cohere, Microsoft, Kling AI, and more.

Here's everything you need to know:
OpenAI dropped the new o3 and o4-mini reasoner models

o3 pushes SOTA performance across coding, math, science, and multimodality, while o4-mini offers fast, cost-efficient performance

Both have agentic access to ChatGPT tools and can "think with images"
Google is expanding Gemini Live's Project Astra capabilities—to all Android users!

This will enable users to engage with real-time visual AI and have multilingual conversations about anything seen and heard via their phone's camera or screen share
Read 11 tweets
Apr 15
TODAY'S AI NEWS: OpenAI just dropped three GPT-4.1 models for devs—API-only and packing a jaw-dropping 1M-token context window.

Plus, more news from Google, Hugging Face, ByteDance, SSI, and LM Arena.

Here's everything you need to know:
OpenAI released GPT-4.1, 4.1 Mini, and the ultra-fast 4.1 Nano—all designed for devs

— Each model beats GPT-4o and 4o mini on dev tasks
— 1M token context windows
— GPT-4.1 scored 55% on SWE-Bench Verified
— Starting at $0.10/0.40 per million I/O tokens
Google released DolphinGemma, an AI that can generate dolphin vocalizations

The system analyzes dolphin whistles and sounds to identify patterns and predict subsequent sounds!

Set to be open-sourced soon!
Read 9 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(