Tweet

Xander Steenbrugge

Follow @xsteenbrugge

6 Oct, 4 tweets, 1 min read

https://twitter.com/ak92501/status/1445872262771478529

Niiice! Hooking this up to CLIP as soon as the weights are released 🤞🤞😋

https://twitter.com/ak92501/status/1445872262771478529

TLDR:
1. Replaces the CNN encoder and decoder with a vision transformer ‘ViT-VQGAN’, leading to significantly better speed-quality tradeoffs compared to CNN-VQGAN

2. Vanilla VQVAE often learns rarily used / “dead” codebook vectors leading to wasted capacity. Here, they add a linear projection of the code vectors into a lower dimensional “lookup” space. This factorization of embedding / lookup consistently improves reconstruction quality.

3. Encoded latents + codebook vectors are L2 normalized, placing all of them on the unit sphere where the Euclidean distance between latent and codebook vector corresponds to their cosine similarity, further improving training stability and reconstruction quality.

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @xsteenbrugge

Xander Steenbrugge

@xsteenbrugge

6 Oct

https://twitter.com/xsteenbrugge/status/1445327569847521281

Note to self: don't use default matplotlib colormaps to make digital art🤦‍♂️😅

New samples from my 'color-quantized VQGAN' are looking great!

Here's "𝑨𝒄𝒄𝒐𝒓𝒅𝒊𝒏𝒈 𝒕𝒐 𝑾𝒊𝒕𝒕𝒈𝒆𝒏𝒔𝒕𝒆𝒊𝒏, 𝒂 𝒑𝒊𝒄𝒕𝒖𝒓𝒆 𝒊𝒔 𝒂 𝒎𝒐𝒅𝒆𝒍 𝒐𝒇 𝒓𝒆𝒂𝒍𝒊𝒕𝒚"

#clip #AIart

https://twitter.com/xsteenbrugge/status/1445327569847521281

"𝒎𝒚 𝒉𝒆𝒂𝒅 𝒊𝒔 𝒇𝒖𝒍𝒍 𝒐𝒇 𝒏𝒐𝒊𝒔𝒆"

"𝑵𝒊𝒈𝒉𝒕𝒎𝒂𝒓𝒆"

Read 7 tweets

Xander Steenbrugge

@xsteenbrugge

5 Oct

@HvnsLstAngel

Inspired by the amazing work of @HvnsLstAngel I've been experimenting with a "color-quantized VQGAN"
Essentially, I introduced a codebook of possible colors and apply quantization in rgb space.

It's always fascinating how removing entropy can make samples more interesting...

"Inception"

"The ancient temple of time"

Read 5 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Share this page!

Xander Steenbrugge

Try unrolling a thread yourself!

More from @xsteenbrugge

Xander Steenbrugge

Xander Steenbrugge

Did Thread Reader help you today?

Like this author's thread?