Jim Fan Profile picture
@NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.

Apr 3, 2023, 12 tweets

Just got access to Adobe Firefly! How does the world's leading creative tool maker fare against MidJourney, a self-funded 11-person team?

Let's check it out. Left is Firefly and right is MidJourney V5. Prompt in "ALT" button on lower-left corner.

Deadpool posing on a car. 1/🧵

Super Mario in a dim lit street with a big reflection in a puddle. Firefly's interpretation of "Super Mario" is ... exotic (?) 😅

Prompt and image credits to @LinusEkenstam @vitomotiv.

2/

Same prompt as above but for Pikachu. Again, somehow Firefly does not fully get these famous characters. Maybe a training data copyright issue?

Prompt and MJ image credits to @LinusEkenstam @vitomotiv.

3/

Next, who is the better portrait photographer?

Photo of a large crowd of commuters in Tokyo, sharply focused faces, but it's the woman in red that commands your attention. Warm glow, elegance.

Prompt & MJ image credit: @nickfloats

4/

How about some sci-fi?

Abstract fractal circular mosaic city architecture.

Prompt & MJ image credit: @chetbff @BambuuArt

5/

Now let's do some mobile app icon design. Does Firefly even know what an app icon is?

iOS app icon, Sci-fi planet landscape with skeuomorphic style.

Prompt & MJ image credit: @followmarcos

6/

The "human finger" test is becoming the new visual Turing Test. It's the final moat that Diffusion needs to conquer to become truly sentient 🤣.

A stunning young Jamaican woman wearing white retrofuturistic sequin Gucci gown, standing in the desert.

Credit: @nickfloats

7/

Finally, a landscape photo. It turns out to be an easy task that both Firefly and MJ excel.

Red Ferrari F40 in Dandelions at the Lake Seealpsee.

Prompt & MJ image credit: @heyBarsee

8/

Note: these prompts are heavily optimized for MidJourney, so that may give it an unfair advantage. However, I did try a few variations but still couldn't get better results. I'm not a prompt ninja, so your mileage may vary.

Still, I'm grateful for Adobe's early beta access! /🧵

Note 2: Firefly is only trained on Adobe Stock and fully licensed images. The data curation is very conservative, which may cripple its performance.

I also included examples without copyrighted characters in the thread.

Note 3: Adobe research scientist @vdeschaintre has a good point: it may be a significant plus for companies who must ensure the IP copyright of the output image. They may be more than willing to sacrifice quality for legality, which makes MJ a less appealing option.

Thanks for all your feedback. I wrote a summary note to give Firefly's approach fair and proper credits:

Share this Scrolly Tale with your friends.

A Scrolly Tale is a new way to read Twitter threads with a more visually immersive experience.
Discover more beautiful Scrolly Tales like this.

Keep scrolling