Chitwan Saharia Profile picture
Research Engineer @GoogleAI 🧠🍁 || B. Tech, CSE, @IITBombay

May 24, 2022, 5 tweets

We are thrilled to announce Imagen, a text-to-image model with unprecedented photorealism and deep language understanding. Explore imagen.research.google and Imagen!

A large rusted ship stuck in a frozen lake. Snowy mountains and beautiful sunset in the background. #imagen

A plush toy koala bear relaxing on a lounge chair and working on a laptop. The chair is beside a rose flower pot. There is a window on the wall beside the flower pot with a view of snowy mountains. #imagen

Imagen uses a large pre-trained language model (T5-XXL) as a text encoder, and a cascade of diffusion models for 1024x1024 image generation. Imagen outperforms all existing techniques on MS-COCO benchmark by a considerable margin.

We introduce DrawBench, a comprehensive and challenging benchmark dataset to evaluate text-to-image model. Imagen outperforms all recent techniques on DrawBench.

Work with incredible collaborators @wchan212, @srbhsxn, Lala Li, @jaywhang_, @cephaloponderer, @coolboi95, Burcu Ayan, Sara Mahdavi, @iraphas13, @TimSalimans, @hojonathanho, @fleet_dj, @mo_norouzi

Share this Scrolly Tale with your friends.

A Scrolly Tale is a new way to read Twitter threads with a more visually immersive experience.
Discover more beautiful Scrolly Tales like this.

Keep scrolling