Rowan Cheung Profile picture
Dec 12 10 tweets 4 min read Twitter logo Read on Twitter
Massive day for open-source AI.

Plus, developments from Humanoid Robots, Meta, GPT-4V, Google, Stanford Research, and 9 new AI tools.

Here's the rundown of everything going on in AI right now:
French startup Mistral AI just released Mixtral, an open-source 45B parameter AI model.

Mixtral matches or outperforms LLaMA 2 and GPT-3.5 on most benchmarks while running 6x faster.

Did a full in-depth breakdown in the newsletter going out in ~8 hours: therundown.ai/subscribe
Image
Alongside the massive open-source model reveal, Mistral also announced an cool $415M Series A round that values the company at $2B.

This is ~6 months after a $113m seed round

Big money is being poured into open-source AI.
Researchers have unveiled 'Alter3', a humanoid robot capable of spontaneous motion using GPT-4.

Basically, text-to-motion.

Alter3 can adopt various poses, such as a 'selfie' stance or 'pretending to be a ghost,' without explicit programming for each body part.
Meta just launched a demo of Audiobox, its foundation model for generating highly realistic, voices, sound effects, and music.

The model can also generate speech in YOUR voice.

Demo includes Zero shot TTS, Text to sound effects, Infilling and more.
In the wake of Google’s backlash around the ‘faked’ Gemini demo, Greg Technology just replicated the video using GPT-4V.

In the video, Greg prompts GPT-4V without edits to highlight how far ahead the model is when compared to the staged demo.
Demo drama aside, Google has reportedly floated "Project Ellmann".

The project will leverage Gemini and generate your life story from just phone data.

Features could include chatbots detailing your bio and travel history from photos. Image
Stanford researchers just introduced W.A.L.T., an AI system that generates photorealistic and consistent video from text prompts or still images.

Notably, W.A.L.T. is also capable of outputs with consistent 3D camera motion.
9 new AI tools.

All the direct links will be included in my newsletter tomorrow.

Subscribe, and you'll never miss a thing in AI ever again: therundown.ai/subscribe
Image
That's it for today's news in the world of AI.

I do these rundowns daily, so follow me @rowancheung for more.

If you found this helpful, spare me a like/retweet to support my content 👇

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Rowan Cheung

Rowan Cheung Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @rowancheung

Dec 13
Completely AI-generated news anchors are going VIRAL.

Plus, huge developments in AI from Tesla Optimus, Microsoft, ChatGPT, Anthropic, and 8 new AI tools.

Here's the rundown of everything you need to know:
'Channel 1' just revealed their new AI news anchors.

All avatars and voice in this video is AI-generated.

With AI video tech improving week over week- the line between what's real and AI-generated is only going to continue to blur.
Notably, the first ever AI TV commercial also featured in Japan...

And it looks even more realistic than the above.

Things are getting weird!
Read 10 tweets
Dec 11
Google's new AI note-taking app just got upgraded with Gemini!

It's completely free and a life hack for students.

Here's what you need to know and how to access for free:
Before we start, here's what you need to know:

- The AI note-taking app by Google is called 'NotebookLM'
- It uses AI on your existing Docs
- It can summarize notes, answer questions, and turn them into outlines or scripts.
- It's 100% free

I show you how to access below. Image
1. Get access:

- Go to:
- If you're in outside the U.S. (like me), just use a VPN or use Opera Browser (what I use) with a built-in VPN: notebooklm.google/signup
opera.com
Read 9 tweets
Dec 11
AI NEWS: Google just admitted the mind-blowing Gemini AI demo was staged.

Plus, significant developments in AI from, Grok, Berkeley AI Research, AI regulation in the EU, NotebookLM, Seattle/UW Medicine, and 9 new AI tools.

Here's everything you need to know:
Google is facing backlash after admitting its Gemini demo video utilized heavy editing.

The video below was not recorded live, and the prompts were not given by voice.

In reality, frames were extracted, and researchers entered prompts off-camera.
This part of the Gemini demo was particularly shocking.

On-screen, a hand made gestures, and Gemini responded, "I know what you're doing! You're playing Rock, Paper, Scissors!".

In reality, prompts and still images were used behind the scenes.

The video was just for show.
Read 11 tweets
Dec 10
My AI tool database has massively expanded in size this week.

10 of my favorite AI tools that came out recently:

1. Screenshot-to-code.

Upload a screenshot of any website, and watch AI build it in real-time.
2. tldraw

Sketch anything and AI will turn it into a working website in seconds.

Link: makereal.tldraw.com
3. Magic Animate

Turn any still image into a moving animation leveraging the new 'Animate Anyone' model.

Did a full tutorial in my newsletter on how to use this tool properly:

Link: therundown.ai/p/runway-takes…
huggingface.co/spaces/zcxu-er…
Read 12 tweets
Dec 7
Today's AI developments were WILD.

-Google announces Gemini
-Meta reveals 20 new AI features
-Meta reveals AI image generator
-McDonald's new AI chatbot 'Ask Pickles'
-Alibaba's video AI scrapes TikTok data
-AI helps decode a new whale language

Here's what you need to know:
Google DeepMind revealed ChatGPT's biggest competitor, Google Gemini.

With a score of 90%, Gemini Ultra is the FIRST AI model to outperform human experts on the MMLU benchmark.
Gemini comes in 3 models — Ultra, Pro, and Nano.

Gemini Pro will be integrated into Bard today, while Gemini Ultra (the best model that beats GPT-4 in 30/32 benchmarks) will be avail early next year.

The thread below has everything you need to know:
Read 11 tweets
Dec 6
Google just revealed Gemini and will directly integrate the AI into Google apps.

The GPT-4 competitor comes in 3 models — Ultra, Pro, and Nano.

Here's a thread of EVERYTHING you need to know: Image
Gemini is multimodal and can recognize images and speak in real-time.

With a score of 90%, Gemini Ultra is the FIRST AI model to outperform human experts on the MMLU benchmark.

This demo is incredible.
Gemini has next-generation capabilities such as sophisticated reasoning, multimodality, and advanced coding.

The model is also advanced in math and coding, as compared to ChatGPT (GPT-4), which cannot perform math.

Check out this demo of them solving physics.
Read 10 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(