Rowan Cheung Profile picture
Dec 6, 2023 10 tweets 4 min read Read on X
Google just revealed Gemini and will directly integrate the AI into Google apps.

The GPT-4 competitor comes in 3 models — Ultra, Pro, and Nano.

Here's a thread of EVERYTHING you need to know: Image
Gemini is multimodal and can recognize images and speak in real-time.

With a score of 90%, Gemini Ultra is the FIRST AI model to outperform human experts on the MMLU benchmark.

This demo is incredible.
Gemini has next-generation capabilities such as sophisticated reasoning, multimodality, and advanced coding.

The model is also advanced in math and coding, as compared to ChatGPT (GPT-4), which cannot perform math.

Check out this demo of them solving physics.
Gemini has an incredible understanding of science.

It can find and extract research across 1000's of research papers.

Because Gemini is multimodal, it can not only understand text but also graphs through images!
Gemini comes in three sizes — Ultra for complex tasks, Pro for scaling across a range of tasks, and Nano for efficient on-device tasks.

-Pro will be in Google products through Bard starting today.
-Ultra will be rolling out early next year.
-Nano will be available on Pixel. Image
Gemini Ultra’s performance beats current state-of-the-art results in 30 of 32 benchmarks used in LLM research & development.
Image
Image
Gemini Pro will be available for free in Bard and across Google apps today.

In six out of eight benchmarks, Gemini Pro outperformed GPT-3.5, making it 'the most powerful free chatbot on the market today'. Image
Gemini Nano now powers on-device generative AI features for Pixel 8 Pro.

New features include:
-Summarize in Recorder
-Smart Reply in Gboard
-Cutting-edge video
-Enhanced photography and image editing
I shared all the info on Gemini in my newsletter this morning.

Click here to join 400k+ readers, and you'll never miss a thing in AI ever again: therundown.ai/subscribe
Thanks to @GoogleDeepMind for an invitation to the early press conference invite, allowing me to share the news live.

I do these rundowns daily, follow me @rowancheung
for more.

If you found this helpful, spare me a like/retweet to support my content 👇

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Rowan Cheung

Rowan Cheung Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @rowancheung

Mar 26
TODAY'S AI NEWS: Google just dropped Gemini 2.5 Pro, its most intelligent AI model to date.

Plus, more news from OpenAI, Figure, ByteDance, Otter, and Perplexity.

Here's everything you need to know:
Google released Gemini 2.5 Pro Experimental, the first model in its Gemini 2.5 family

—#1 on the LMArena
—SOTA capabilities across benchmarks for coding, math, science, and more
—Visual reasoning
—1M token context window (2M coming soon!)
OpenAI added native image generation within GPT-4o and Sora

—A fully integrated system for creating visuals via ChatGPT
—Excels at menus, diagrams, and infographics
—Edits images with text prompts
—Rolling out to Plus, Pro, Team, and Free users
Read 9 tweets
Mar 24
TODAY'S AI NEWS: Researchers just developed an AI that detects certain cancers with 99% accuracy!

Plus, more news from Tencent, Anthropic, Perplexity, Zapier, and more.

Here's everything you need to know:
A new game-changing AI, ECgMLP, identifies endometrial cancer with 99.26% accuracy

It uses microscopic tissue images and
outperforms humans and other automated methods

Also works across colorectal, breast, and oral cancers with 97%+ accuracy!
Image
Tencent released Hunyuan T1, a reasoning AI based on industry's first Transformer-Mamba architecture

—Matches or surpasses DeepSeek R1 and OpenAI’s o1 and GPT 4.5
—2x faster with reduced compute demands
—Priced at $0.14 and $0.55 per million I/O tokens
Image
Read 9 tweets
Mar 20
It's been 4 days of NVIDIA GTC 2025, and the announcements have been incredible.

The 10 most important reveals so far:

1. Blue: A Star Wars-inspired robot powered by a new physics engine with real-time intelligence and movement
2. Newton: An open-source physics engine to simulate robotic movements in the real world — developed jointly by Nvidia, DeepMind, and Disney Research.

(This is the physics engine that powers the Blue robot!)
3. Blackwell Ultra: The next generation of Blackwell with 1.5x computational power — coming in the second half of 2025
Read 12 tweets
Mar 18
TODAY'S AI NEWS: Roblox just made 3D content creation as simple as a text command!

Plus, more news from Mistral, xAI, Zoom, Tencent, Perplexity, Baidu, and more.

Here's everything you need to know:
Roblox announced Cube 3D, an open-source AI for generating 3D objects and scenes from text prompts

Trained on native 3D data, Cube uses 3D tokenization to create functional 3D outputs in seconds

Support for image inputs is also on the way!
Mistral AI released Small 3.1, a SOTA multilingual and multimodal LLM

—24B (can run on a laptop)
—128k token context window
—Outperforms Gemma 3 and GPT-4o Mini on most benchmarks
—Inference speed of 150 tokens/sec
—Open-source under Apache 2.0 license
Image
Read 11 tweets
Mar 7
I think China's second DeepSeek moment is here.

This AI agent called 'Manus' is going crazy viral in China right now.

Probably only a matter of time until it hits the US.

It's like Deep Research + Operator + Claude Computer combined, and it's REALLY good.
We noticed Manus gaining some traction @TheRundownAI and wrote about it in the newsletter this morning

Shortly after publishing, one of the cofounders reached out with an invitation code. Thanks @peakji!

So I dropped my work for the morning (emails can wait) and tested it out:
@TheRundownAI @peakji For my first test, I asked Manus to create a biography on Rowan Cheung and deploy a website based on that biography

Insanely impressive watching it go through my social channels, browse articles, and deploy the site

And it was 100% accurate, info up to date as of today
Read 7 tweets
Mar 6
TODAY'S AI NEWS: OpenAI is reportedly planning a $20,000 subscription for specialized AI agents

Plus, more news from Google, Alibaba, Scale AI, Codeium, Luma Labs, and Turing.

Here's everything you need to know:
OpenAI is reportedly planning specialized AI agents for tasks like Ph.D.-level research

Three agent tiers expected:

—Business professionals ($2k/mo)
—Advanced devs ($10k/mo)
—PhD-level researchers ($20k/mo)

OAI charges $200/mo for its Operator agent:
Google debuted AI Mode, a Search Labs experiment that turns traditional search into a conversational experience

—Powered by custom Gemini 2.0 model
—Runs parallel searches across diverse sources
—Gives detailed yet well-reasoned responses
Read 10 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(