Rowan Cheung Profile picture
Founder of the world’s most read daily AI newsletter @therundownai. Sharing the latest developments in the world of artificial intelligence.
108 subscribers
Jan 23 16 tweets 6 min read
I got early access to ChatGPT Operator.

It's OpenAI's new AI agent that autonomously takes action across the web on your behalf.

The 9 most impressive use cases I’ve tried (videos sped up):

1. Ordering dinner ingredients based on a picture and a recipe 2. Planning a weekend trip based on hidden gems off Reddit, my budget and interests

Notice how at 0:06, ChatGPT Operator was blocked from Reddit but then decided to just do a Bing search with "Reddit" at the end

Very impressive decision-making
Jan 8 12 tweets 4 min read
That's a wrap for day 2 of the world's largest consumer tech event, CES 2025.

The top 10 tech and gadget reveals from day 2:

1. A stretchable Micro LED display that turns 2D into 3D by Samsung 2. A multitasking household robot that does everything from vacuuming, organization, air purification, monitoring pets, and even delivering you food while you sit on the couch by SwitchBot
Jan 7 12 tweets 4 min read
It's only been 1 day of CES 2025, and the announcements have already been incredible.

The 10 most impressive reveals of CES 2025 so far:

1. A 360° AI-powered body scanning health mirror that can scan your heart, weight, and metabolic health 2. Roborock's Saros Z70: A robotic vacuum that has a mechanical arm for picking up objects in the way of cleaning the floor
Dec 16, 2024 8 tweets 4 min read
Google just released Veo 2, a new state-of-the-art AI video model.

In testing, Veo beat OpenAI Sora in BOTH quality and prompt adherence.

The video compilation below is 100% created by AI (more details in thread): Veo can generate 8 second videos in up to 4K resolution (720p at launch).

The model also features:

— Better understanding of physics for more natural movement, lighting, etc.
— Enhanced clarity and sharpness of outputs
— Reduced hallucinated objects and details
Dec 11, 2024 7 tweets 3 min read
I've been an early tester + had in-person demos for most of Google’s AI projects announced today.

I found several practical use cases that will benefit everyday people.

12 use cases of Project Astra/Deep Research Agents/Project Mariner (beyond the hype): Project Astra: Google's AI agent that can 'see the world' using your phone camera

Use cases that stood out to me:

> Summarizing a book page in seconds and chatting with it for follow-ups on complex topics (professor-in-your-pocket)
> Identifying a rash: just seasonal hives or something more serious?
> Real-time translation of languages, sign language, and books (worked great for Japanese writing → English summary).
> Locating landscapes in photos and estimating their distance using the Google Maps integration.
> Remembering cookbook recipes and recommending wine pairings based on the recipe and budget.
> Summarizing thousands of Amazon/Airbnb reviews in seconds using mobile screen sharing, with highlights of any negative feedback.
Dec 11, 2024 11 tweets 4 min read
AI NEWS: OpenAI just rolled out 'ChatGPT Canvas' to all users.

Plus new AI coding agents, Sora, Google WIllow, xAI's Grok image generator, Amazon's AGI lab, Lindy agent phone calls, and Meta COCONUT.

Here's what you need to know: OpenAI just made Canvas, the collaborative split-screen writing and coding interface available to all users.

The feature also now has:
-Native integration with gpt-4o
-Python integration for direct code execution
-Integration within custom GPTs
Dec 6, 2024 10 tweets 4 min read
AI NEWS: Meta just unexpectedly dropped Llama 3.3—a 70B model that's ~25x cheaper than GPT-4o.

Plus, Google released a new Gemini model, OpenAI reinforcement finetuning, xAI's Grok is available for free, Copilot Vision, ElevenLabs GenFM, and more.

Here's what you need to know: Meta just dropped Llama 3.3 — a 70B open model that offers similar performance to Llama 3.1 405B, but significantly faster and cheaper.

It's also ~25x cheaper than GPT-4o.

Text only for now, and available to download at llama .com/llama-downloads
Image
Nov 26, 2024 6 tweets 2 min read
OpenAI’s Sora video model appears to have leaked.

And it's even more advanced than the February demo.

A group from the early beta testers just dropped access to what looks to be Sora's ‘turbo’ variant on Hugging Face, citing concerns about OpenAI's early access program and artist compensation.Image The leaked version generates 1080p 10-second clips and seems to be processing WAY faster than the previously reported 10-minute render times.
Nov 20, 2024 5 tweets 2 min read
NEW: ChatGPT’s ‘Live Camera’ video features were just found in the code of the latest beta.

Six months after their initial demo, OpenAI's visual AI could be ready for testing — and Advanced Voice Mode could soon be getting ‘eyes’

The code discovered in v1.2024.317 reveals:
—Live camera functionality
—Real-time processing
—Voice mode integration
—Visual recognition capabilities

The tech was initially showcased in an OpenAI demo in May, with Advanced Voice Mode interacting with a dog in real time: @AndroidAuth first spotted the code in the latest beta:

Beta
Tap the camera icon to let ChatGPT view and chat about your surroundings.
Live camera
Don't use for live navigation or decisions that may impact your health or safety.Image
Sep 25, 2024 8 tweets 4 min read
Meta just announced a ton of new AI announcements across Meta AI, Llama, Ray-Bans, and more.

Here’s everything important announced live from here @ Meta Connect:

1. Meta AI is getting its own voice mode!

2. Meta AI can now ‘see’ images!

Similar to ChatGPT, you can now share photos and have Meta AI reply to any photo in chat.

But where Meta is going a step further is by allowing users to actually edit photos, like removing an object, adding a hat, or changing backgrounds, etc., all within the chat.

Rolling out in the US only for now (unlike voice mode, which is rolling out to US, Canada, Australia, and New Zealand over the next month)Image
Sep 23, 2024 10 tweets 4 min read
Sam Altman and Jony Ive just confirmed they are working on a secret, new AI-powered hardware device.

Plus, more developments from Microsoft, Electronic Arts, Disney, Pudu Robotics, and Alibaba.

Here's everything that happened in AI over the weekend: Jony Ive is back with his first major tech project since leaving Apple, teaming up with Sam Altman for an AI-powered hardware device.

The project already raised private funding and plans to raise up to $1 billion by the end of the year.

iPhone meets o1? Image
Sep 20, 2024 14 tweets 5 min read
AI NEWS: Sam Altman just confirmed o1 has reached level 2 of OpenAI's 5-tier path to AGI.

Plus, more developments from Apple, Amazon, YouTube, Meta, Google, Alibaba, and Nvidia.

Here's everything going on in AI right now: In a recent interview with T-Mobile, Sam Altman compared o1’s current state to the ‘GPT-2 stage’ of reasoning models

He also revealed that the development of o1 unlocks a much quicker path to fully capable AI agents

Hear it from the man himself:
Sep 18, 2024 12 tweets 5 min read
AI NEWS: Snap just announced new AR/AI glasses and an AI video generator.

Plus, more developments from 1X, Google, Microsoft, Sakana, OpenAI, and Perplexity.

Here's everything going on in AI right now: Snap's Spectacles feature a suite of cameras and sensors for multimodal AI and contextual understanding.

They're powered by AI and can create shared immersive experiences with people nearby.

BUT the battery only lasts for 45 minutes, and they cost $99/m.
Sep 17, 2024 11 tweets 4 min read
AI NEWS: A smarter, faster version of Microsoft's AI assistant, Copilot, is coming to Excel and Word.

Plus, more developments from Slack, Groq, Google, Luma Labs, Runway, OpenAI, and Star Wars.

Here's everything going on in AI right now: Microsoft just unveiled its next wave of Copilot, a smarter, faster AI assistant.

Copilot is coming to Excel, PowerPoint, Teams, Outlook, Word, and OneDrive soon.

It's 2x faster and 3x better thanks to its GPT-4 integration, according to early users
Sep 16, 2024 10 tweets 4 min read
AI NEWS: The "Godmother of AI" just launched World Labs, teaching AI to create and understand 3D worlds.

Plus, more developments from OpenAI, Tencent, Runway, Google, and Meta.

Here's everything that happened in AI over the weekend: Fei-Fei Li announced a new startup, World Labs, to develop AI models capable of understanding and crafting 3D environments.

Instead of focusing on LLMs, World Labs will build LWMs (Large World Models).

This could revolutionize AR, VR, and robotics.
Sep 9, 2024 9 tweets 4 min read
Apple just announced its Apple Intelligence features for iPhone 16.

The 8 most impressive demos:

1. Apple Intelligence accessing the iPhone's camera for 'Visual Intelligence' on any surroundings
2. Another look at Apple's Visual Intelligence.

The iPhone 16 will have the ability to learn instantly about anything you see.

Like learning about dog breeds and getting restaurant reviews of the new spot around the corner just by taking a picture.


Sep 9, 2024 10 tweets 4 min read
AI NEWS: A new robot butler that can fold laundry is coming next year.

Plus, more developments from Tesla, Google, YouTube, OpenAI, and DeepSeek.

Here's everything that happened in AI over the weekend: Waeve revealed Isaac, a versatile personal robot designed to help you with household chores.

It responds to voice or text commands and can be programmed via an app.

Weave plans on selling Isaac for $59,000 to its first customers in 2025.
Sep 6, 2024 10 tweets 4 min read
Replit just launched an AI agent that codes and deploys full apps from prompts.

Building your own app has never been easier.

Here are the 7 coolest projects I've seen built so far: Ever wanted to share LLM prompts and vote on which ones are the best?

@martinbowling built an app that does that with Replit Agent in under 10 minutes.

He didn't write a single line of code 🤯
Sep 5, 2024 10 tweets 4 min read
AI NEWS: OpenAI co-founder Ilya Sutskever just raised a $1B seed round to develop 'Safe Superintelligence'

Plus, more developments from Groq, Anthropic, GPT-5, AI Film Festival, and Google.

Here's everything going on in AI right now: Ilya Sutskever's new startup, SSI, just raised $1B to build "a straight shot to safe superintelligence"

The team will spend the next few years on R&D before launching any AI products

This reportedly values the startup at $5B -- with only 10 employees

Image
Sep 4, 2024 9 tweets 3 min read
AI NEWS: CEO of OpenAI Japan just said an AI called 'GPT-Next' is coming, and it's 100x better than GPT-4.

Plus, more developments from Altera, Anthropic, Nvidia, and Google DeepMind.

Here's everything going on in AI right now: OpenAI Japan CEO revealed that GPT-Next will be 100x more powerful than GPT-4 at KDDI Summit 2024

The model will reportedly use a smaller version of OpenAI's mysterious Project Strawberry

The boost in performance comes from better architecture, not a ton of computing resources Image
Aug 29, 2024 9 tweets 4 min read
AI NEWS: Google just developed an AI system that recreated the popular DOOM game at 20 fps.

Plus, more developments from OpenAI, Bland AI, Gemini, Midjourney, and XPENG

Here's everything going on in AI right now: Google researchers just developed GameNGen, an AI that can simulate DOOM at over 20 fps

It works by predicting each frame in real time with a diffusion model

At scale this could mean AI will be able to create games on the fly, personalized to each player