Bilawal Sidhu Profile picture
Mar 21, 2023 β€’ 14 tweets β€’ 8 min read β€’ Read on X
Been hands-on with the beta of Adobe's cutting-edge Generative AI tool, and I'm impressed! 🀯

Here's a taste of the power of #AdobeFirefly πŸŽ‡ and what sets it apart in the increasingly crowded world of #AI art.

Thread 🧡🎨 ImageImage
For starters, Adobe Firefly isn't one thing. It encompasses multiple AI models. It's a portal for testing new capabilities with creators, and eventually graduating them into products like Photoshop & Premiere that creators know and love. Meeting users where they are, if you will: In my beta access I was abl...
If you've used any text-to-image product (e.g. Stable Diffusion or DALL-E) At first glance, Adobe Firefly will be immediately familiar.
But there's a few unique takes in Adobe's product experience.
Let's dig in... Image
Adobe is using a diffusion-based model (not GigaGAN as many of us suspected!), so needless to say you can get some pretty photorealistic results. ImageImage
Adobe's trained this model using Adobe Stock, which means the provenance of the data is rock solid.

Adobe can't afford to alienate creators, so they have *not* trained models on Behance imagery yet, despite it being a treasure trove πŸ’Ž

Will these moves woo AI art naysayers? πŸ€” Image
Firefly you can also generate text effects!
Pick a font, type in some text, describe your style and voila - a new logo for my creator brand.
I can totally see how this will be super useful inside photoshop or illustrator. No more complex layer effects to wrangle :) Image
Adobe's Firefly UX is unique in that you can provide a prompt (which describes the contents of your scene), and then you can augment it with bunch of parameters like style, color and tone, lighting and composition. This makes it super easy to iterate: Image
So let's say I like the the overall result, but I'm looking for a different camera angle, a slightly different aesthetic (e.g. low lighting, shot from below, cool tone). You can really dial in a look easily without futzing around with prompts. Pretty nice! ImageImage
Stylized not your jam, and want to go back to a photorealistic result? As easy as clicking a button, and bam: Image
"Robot that toasts your bread and applies butter to it, in the style of rick and morty" produced some impressive results in Firefly: ImageImageImageImage
You're probably wondering how hands look? Pretty coherent!

Even with a prompt like this:

Punjabi man in flannel shirt using AI voice dictation to create the client pitch deck while drinking espresso a cozy cabin, while wearing an Oculus VR headset, with a laptop on the table Image
@ericsnowden made an awesome analogy about ingredients and taking decades of Adobe tech combined with these newer models to make amazing recipes. And I have to say, the dishes do look good! Case in point: ImageImage
Adobe will be expanding access gradually -- so it won't exactly be a free-for-all. During the beta period, there are some noteworthy limitations worth being aware of -- critically commercial use is not allowed.

So what do you think of Adobe's entry? Share your thoughts below. Image
That's a wrap! If you enjoyed this deep dive on Adobe Firefly (adobe.com/firefly):
- RTing the thread below to share with your audience
- Follow @bilawalsidhu to stay tuned for more creative tech magic
- Subscribe to get these right to your inbox: creativetechnologydigest.substack.com

β€’ β€’ β€’

Missing some Tweet in this thread? You can try to force a refresh
γ€€

Keep Current with Bilawal Sidhu

Bilawal Sidhu Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @bilawalsidhu

Dec 16, 2024
BREAKING: Google just dropped Veo 2 and Imagen 3 -- their next gen video and image generation models.

Turns out Google's been closing the gap quietly -- not just on LLMs, but on visual creation too.

Here’s everything you need to know w/o the hype 🧡
1/ First, let's get the Veo 2 updates out of the way:

β€’ Up to 4K resolution (woot!)
β€’ Increased detail & realism
β€’ Improved human movement & expressions
β€’ Better physics modeling & temporal coherence

On Meta's Movie Gen Bench, Veo holds it down against top video models: Image
2/ Veo 2 now speaks cinematographer. Instead of wrestling w/ technical params or guessing how Gemini captioned stuff, you can just say what you want using terms you're used to. Legit useful for production workflows.

E.g. here's a prompt to generate a classic car chase scene:
Read 8 tweets
Dec 11, 2024
BREAKING: Here are the coolest things Google announced today; got the press briefing yesterday and here's my favorites w/o the hype.

TL;DR Gemini 2.0 brings multimodal creation, research agents, browser control, and massive compute upgrades. Plus dope research.

🧡 Let's dive in Image
1/ Let's talk Gemini 2.0 Flash:
β€’ 2x faster than 1.5 Pro while outperforming it on key benchmarks
β€’ Native tool use (Search + custom functions)
β€’ New Multimodal Live API for realtime audio/video streaming w/ smart interrupt detection
β€’ Available today; more model sizes in Jan Image
2/ Gemini 2.0 *finally* gets native multimodal output
β€’ Can generate images + text combined naturally
β€’ Steerable text-to-speech in multiple languages/accents
β€’ Alas early access only for now; wider rollout in Jan
Viggle's doing some cool stuff with it: ai.google.dev/showcase/viggle
Read 9 tweets
Oct 17, 2024
Heads up! Mosaic dropped a pretty wild dataset of 1.26 million 360° images of Prague 🀯

If you're a researcher, creator or developer into 3D/AI/Geo, I think you're gonna wanna play with this

Here's the scoop on this 15 TERAPIXEL dataset & the crazy things you can do with it 🧡
The specs are nuts:

β€’ 210,469 panos in 13K
β€’ 1,262,814 source images (6 x 12MP)
β€’ 1 image every meter
β€’ 2cm pose accuracy

Not quite Google level, but the pano density is WAY higher. An image every meter means it's perfect for all sorts of spatial 3D stuff.
This dataset is global shutter too. This beast was captured with the 6-lens Mosaic X camera. This thing is built for serious real world data collection.

And it captured Prague’s architecture, streetscapes, and urban environment in incredible detail. Image
Image
Read 14 tweets
Feb 16, 2024
OpenAI just dropped their Sora research paper.

As expected, the video-to-video results are flipping spectacular πŸͺ„

A few other gems:
Another superpower unlocked is the ability to seamlessly blend individual videos together.

Note how the drone transforms into a butterfly as gradually find ourselves underwater
Connecting videos is a surprisingly powerful primitive.

Example: Drone POV shots of Jeeps are cool, but how about blending it with another clip of a cheetah?

End result: your jeep is now being chased by the cheetah, and giving me Harold & Kumar vibes πŸ˜‚
Read 9 tweets
Dec 30, 2023
Top Gun Maverick. For a movie with no CGI, it sure has a lot of it.

A whopping 2,400 (!!) visual effects shots in fact.

But wait, wasn't everything filmed practically? πŸ˜‰

Sure was. Yet almost every jet you see on-screen is CGI.

Let's dive into this "invisible" movie magic πŸ‘‡
For starters, the level of practical filming in Top Gun is cool.

Much of the principal photography was filmed "for real" - ensuring the action always felt anchored in reality.

But make no mistake - there's a ton of invisible CGI involved that you probably didn't notice. πŸ‘‡
The team used L-39 planes with tracking markers to served as "stand in" for CG planes.

The VFX team meticulously "re-skinned" the stand-in planes with CG ones like the F-14 & Su-57.

This helped to match the look, lighting, and atmosphere since they always had an IRL reference. Image
Read 9 tweets
Oct 8, 2023
With Gaussian Splatting you get 3D editing support! So you can select, move, and delete stuff; apply shader fx. This type of editing has been tedious to do with NeRFs and their implicit black box representations.

Case in point (1/3) by @hybridherbst:
Case in point (2/3): repurpose your point cloud shaders to make something unreal like @Ruben_Fro
@Ruben_Fro Case in point (3/3): and you can still get amazing lighting effects β€” translucent vegetation, bloom and more!
Read 5 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(