Jonathan Fischoff Profile picture
Computer person.
Sep 19, 2024 9 tweets 4 min read
This is the most exciting paper I've read in a while.

Alternate title could have been: "One weird trick to increase your depth map inference 200x."

arxiv:
github:

Let's go through the details 🧵 1/9 arxiv.org/abs/2409.11355
github.com/VisualComputin…
Image Some background. Marigold is a fine-tuned 8-channel version of SD2.1 for depth maps. It concatenates a clean guide image with noisy latent along the channel direction.

It works great but is slow. The authors looked at what happened if they used only a single sampling step. 2/9 Image
Nov 30, 2023 11 tweets 4 min read
“Animate Anyone” was released last night for making pose guide videos. Lets dive in.

Paper:
Project:
🧵1/ arxiv.org/abs/2311.17117
humanaigc.github.io/animate-anyone/
Image First some examples because they are very good 2/