Javi Lopez ⛩️ Profile picture
Aug 5, 2023 15 tweets 5 min read Read on X
🔴 PERFUSION: a generative AI model from NVIDIA that fits on a floppy disk 💾

It takes up just 100KB. Yes, you heard it right, much less than any picture you take with your mobile phone! Why is this revolutionary and can change everything?

I'll tell you 🧵👇 Image
Perfusion is a really lightweight "text-to-image" model (100KB) that also trains in just 4 minutes.

🔗 Link: https://t.co/502nEzWL2eresearch.nvidia.com/labs/par/Perfu…
Image
It allows creatively portraying objects and characters, maintaining their identity, using a novel mechanism they have called "Key-Locking." Image
Perfusion can also combine individually learned concepts into a single generated image. Image
Moreover, it allows controlling the balance between visual alignment and the text prompt at the time of inference, covering the entire Pareto front with just a single trained model. Image
And why is this revolutionary?

For several reasons. Image
1️⃣ Such great optimization means that we will soon have truly powerful AI models integrated into our mobile phones, computers, etc. Much lighter, faster to train, and consuming less computing power. Image
2️⃣ The costs of training models will be drastically reduced in the future with optimizations like this and new techniques that allow everything to be streamlined.
3️⃣ If, in just 100KB, a new technique (key-locking) has achieved such a large increase in the coherence of objects/characters between generations, as in this example, it means that we have only SCRATCHED THE SURFACE of what the future Generative AI will be able to do. Image
In short, a massive piece of news that I don't understand why it's going so unnoticed. Don't be fooled by "the low quality" of the images. The potential it has is truly MASSIVE. Image
If you liked this and would like me to continue writing similar threads, an RT on the first tweet of the thread will encourage me to keep doing so. Thanks! 😉👇

Watch out for this important detail 👇

Mentally, I'm already calling it: 'the miniLoras'

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Javi Lopez ⛩️

Javi Lopez ⛩️ Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @javilopen

Nov 27
IT'S FINALY HERE

🔥 Magnific Skin Enhancer 🔥

• No more AI plastic skins!
• Enhance EVERYTHING in your image, not only the skin!
• 3 different flavours + easy presets: improve light, level or reality, color grading, etc.

Let's dive in + tutorials + tips 🧵👇 Image
Image
First of all, if you can't wait, here you have the link! AVAILABLE NOW on Magnific & rolling out to Freepik users today!

I’ll also randomly grant access to some of you who reply with a interesting message 😘

👇👇👇

magnific.ai
You’ve been asking for a Skin Enhancer in Magnific for ages!

So I want to apologize for taking so long. But hey, better late than never!

Skin Enhancer is built for pros who need CONTROL, so it comes in 3 flavors to cover every level:

- Flexible + presets
- Creative
- Faithful Image
Read 14 tweets
Oct 26
Professional photographers don’t know they can improve their work with advanced AI upscaling.

I tested it on my old Nikon photos from Tokyo (2014) and the results blew my mind 🤯

Super quick tutorial 🧵👇
1. Upscale in Magnific AI:

- Precision
- v2 (Sublime)
- 6x (usually 4x is ok, but this one looked better)
- Sharpen: 15%
- Smart Grain: 2% (the photo was already quite grainy)

2,000 x 1,328 => 12,000 x 7,968 🤯 Image
2. That's all, enjoy! 😂

I was never so easy to improve your old (or new) professional photograpies, vfx, illustrations, etc!

Enjoy!

Read 11 tweets
Oct 24
AI upscaling in 2025 is absolutely wild 🤯

This shouldn't be possible... But here we are!

Super quick tutorial 👇
This is a combo of two upscalers inside Magnific:

1. One pass of Magnific Creative (2x)

"Vivid" preset with Illusio engine (that is perfect for architecture, 3d, etc): Image
2. A second pass of Precision v2 Sublime (4x)

With Sharpen 30% and just a small bit of Smart Grain (4%) Image
Read 6 tweets
May 25
There's no way Hollywood won't be affected by this.

I created this whole scene in less than 2h using Veo 3 (AI video), Magnific (upscaling), Suno (music, except the first 3s 😉) and CapCut (editing).

The Cambric Explosion of content has already started!

Full tutorial 👇
1. Idea

I've had this idea (a mood) of mixing a 7-eleven at night and a 🐲 for over 2y now.

The concept came to me then, but it wasn't until now that I've been able to bring it to life visually.

Veo 3 feels like being back in Apr 2022, when DALL·E 2 hit my brain like a truck.
2. Video generation using Veo 3 inside Freepik (not yet available but soon)

I used ChatGPT to craft all the prompts and then did all the video generation inside Freepik using Veo 3.

Something I've learned is that Veo 3 can handle really long and complex prompts, so don't hesitate to use very detailed descriptions to express the vision you want to create.

Example:

"Close-up shot of a pair of hands reaching toward a dusty black tome resting on a low shelf inside a dimly lit 7-Eleven. The book has a worn leather cover with a flaming dragon etched in glowing, fiery lines across the front. Above the image, an unreadable title is inscribed in ancient golden runes. The hands pick up the book slowly and carefully, as if sensing its weight and age. At the edges of the frame, part of a red puffy vest is visible over a faded denim jacket and a plaid shirt sleeve, revealing just enough of the young man’s layered clothing to hint at his presence."Image
Read 9 tweets
May 22
Just got access to Veo 3 and the first thing I did was try the Will Smith spaghetti test. SOUND ON
Spaguettis are so cooked. But flamenco is so back!

"A dog dressed as a female flamenco dancer dancing flamenco on a tablao in a bar in Seville."

😅😂
One of my usual favourite tests. The soldier riding a dog during WW2.
Read 24 tweets
Apr 29
⚡ IT'S FINALLY HERE!

F-Lite: our first foundational model for image generation. A collaboration between Freepik ♥️ Fal.

• Open Source
• Fully commercially usable
• 10B parameter DiT trained on 80M images
• Trained with 100% licensed data

Link + info 🧵👇 Image
We’ve been secretly working on this for months! It feels good to finally share it!

LINKS:

• Regular version: more predictable and prompt-faithful, but less artistic: fal.ai/models/fal-ai/…

• Texture version: is more chaotic and error-prone, but delivers better textures and creative compositions: fal.ai/models/fal-ai/…

• Paper: github.com/fal-ai/f-lite/…

Enjoy!Image
Image
Image
Image
Congrats to @cloneofsimo, @ivanprado, @kuer5ord, @info_libertas and the rest of the team that built this model from scratch!

I did nothing except to test it 😍 Image
Image
Read 7 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(