Post

@elevenlabsio

@elevenlabsio

@elevenlabsio

More from @CoffeeVectors

CoffeeVectors

@CoffeeVectors

Aug 12

More Wan 2.2 tests on . Here I wanted to compare it directly to Veo 3 and also see how tight into the eye I could get with complex zooms. Again I’m really surprised by what Wan 2.2 can do. The prompts are the same across models, so Wan seems to be able to do more with the text, at least with camera moves and optics. 🧵wavespeed.ai

This is an example of a prompt I used for the multi-axis zooms. Admittedly it’s overkill and Wan isn’t following all of it, so it’s not an exact science and I’m still figuring it the mechanics. You definitely gotta play around.

Photoreal; soft-contrast, midtone-rich grade; log-gamma tone mapping; gentle highlight roll-off; lifted blacks with preserved shadow detail; heavy wet-lens optics (dense micro-beading, streaking droplets, veiling glare, edge softness, intermittent smear/ghosting), anamorphic-style streak flares and comet tails from torchlight; lowered micro-contrast; shallow depth of field; cool moonlit ambient + warm torch practical mix; night rain with streaking lines; bokeh firelight; dynamic dutch angles; exposure continuity; camera path: multi-axis orbit-dolly with roll and tilt, continuous zoom-out; deliberately cross the 180° eyeline axis. Scene: orc/ogre war-leader with tusks and braided hair in a torch-lit, rain-soaked war camp; weathered metal/leather armor; avoid proximity/body-position cues. Generate 4 frames, one per second; single continuous camera move; interpolate between keyframes; no cuts. 1) Start on the provided extreme eye close-up and pull back slowly, low angle looking up, heavy roll (−20°), iris nearly full frame, tear-film ripples and micro-speculars, bead clusters at edges; 2) tight close-up, begin lateral orbit with pitch up, roll easing toward neutral, pronounced torch flare/halo and smear trails; 3) axis-cross moment during zoom-out—medium oblique as the camera wraps over the brow/bridge with a pitch down then up, roll swings to +15°, brief occlusion and flare streaks; 4) end on the flipped wide three-quarter profile from the opposite side, dutch +20°, rain sheets, warm torch bokeh, wet-lens haze and edge falloff persistent.

It’s me basically asking for different shots but Wan tends to interpolate rather than jump cut so it has a kind of pseudo-keyframes feature. These prompts are generated by GPT 5 which is why they’re so verbose. I’m more describing the shots and dynamics and letting AI enrich that. I’m sure you can get away with much less but I haven’t had the time to really research it more.

Read 4 tweets

CoffeeVectors

@CoffeeVectors

Oct 12, 2024

Made this video with iPhone photos I took of my friend Stephanie that I used as keyframes in @LumaLabsAI! With the camera controls I can gen transitions between shots. I also built a custom web app in Next.js to help me speedramp and edit all the clips! Breakdown 🧵(1/18)

Basically if you have some photos, throw in a start and end frame, and start your prompt with the camera move, and then stuff like “smooth camera, steadicam” I find minimal prompts work best. And don’t enhance the prompt (that tends to add hand held). (2/18)

Sometimes I’ll add ‘motion blur’, ‘drone racing’ or ‘music video’ to see how it changes the results. “Perfect face” can help reduce cross-eyes, etc. You’ll still need to experiment, but long prompts or prompts describing the scene don’t usually help with this effect. (3/18)

Read 18 tweets

CoffeeVectors

@CoffeeVectors

Jul 20, 2024

Testing how LivePortrait works lip syncing 24fps lyrics on top of slow motion footage. Was curious to see if it might help with music videos. Quick explanation below! 🧵

Started with a clip from an Eminem song and passed it through Adobe Podcast to get the acapella. Passed that through @hedra_labs with a Midjourney portrait for the face animation. Used that as input into LivePortrait using ComfyUI and a slowmo clip from Die Hard.

I find it helps for the Live Portrait input to have a plain background. Otherwise you might get extra warping in the background behind the head.

Read 7 tweets

CoffeeVectors

@CoffeeVectors

Dec 24, 2023

Made this video (🎶) with a Midjourney v6 image! Started by upscaling/refining with @Magnific_AI, pulled a Marigold Depth Map from that in ComfyUI, then used as a displacement map in Blender where I animated this camera pass with some relighting and narrow depth of field.🧵1/12

Here's the base image and the before/after in @Magnific_AI. Even though MJv6 has an upscaler, Magnific gave me better eyelid and skin details for this case. (Fun fact, this image was from a v4 prompt from summer last year, when MJ had just released a new beta upscaler.) 2/12

Next step was using the new Marigold Depth Estimation node in ComfyUI to get an extremely detailed depth map. Note that I'm saving the result as an EXR file (important for adjusting levels later), and that the remap and colorizing nodes are just for visualization. 3/12

Read 12 tweets

CoffeeVectors

@CoffeeVectors

Nov 15, 2023

Testing LCM LORAs in an AnimateDiff & multi-controlnet workflow in ComfyUI. I was able to process this entire Black Pink music video as a single .mp4 input. The LCM lets me render at 6 steps (vs 20+) on my 4090 and uses up only 10.5 GB of VRAM. Here's a breakdown 🧵[1/11]

Entire thing took 81 minutes to render 2,467 frames, so about 2 seconds per frame. This isn't including the time to extract the img sequence from video and gen the ControlNet maps. Used Zoe Depth and Canny ControlNets in SD 1.5 at 910 x 512. [2/11]

Improving the output to give it a stronger style, more details & feel less rotoscope-ish, will require adjusting individual shots. But doing the entire video in one go lays down a rough draft for you to iterate on—build on fun surprises, troubleshoot problem areas. [3/11]

Read 11 tweets

CoffeeVectors

@CoffeeVectors

May 25, 2023

@MatthieuGB

Timelapse of using #photoshop’s new generative fill feature to connect two images and build a scene around them using blank prompts. Was inspired by @MatthieuGB’s post doing something similar! Notice how I’m not adding any descriptions, but letting gen fill present options for… twitter.com/i/web/status/1…