Bilawal Sidhu Profile picture
๐Ÿช„ Blending Realities l ๐ŸŽ™๏ธ Host, TED AI Show | ๐Ÿš€ Scout, A16z | ๐ŸŽฌ 1.4M+ Subs & 450M+ Views | ๐ŸŒŽ Ex-Google PM, 3D Maps & AR/VR ๐Ÿฅฝ | ๐ŸŽ™๏ธ https://t.co/QmJQ0BL7Ik
6 subscribers
Oct 17 โ€ข 14 tweets โ€ข 6 min read
Heads up! Mosaic dropped a pretty wild dataset of 1.26 million 360ยฐ images of Prague ๐Ÿคฏ

If you're a researcher, creator or developer into 3D/AI/Geo, I think you're gonna wanna play with this

Here's the scoop on this 15 TERAPIXEL dataset & the crazy things you can do with it ๐Ÿงต The specs are nuts:

โ€ข 210,469 panos in 13K
โ€ข 1,262,814 source images (6 x 12MP)
โ€ข 1 image every meter
โ€ข 2cm pose accuracy

Not quite Google level, but the pano density is WAY higher. An image every meter means it's perfect for all sorts of spatial 3D stuff.
Feb 16 โ€ข 9 tweets โ€ข 3 min read
OpenAI just dropped their Sora research paper.

As expected, the video-to-video results are flipping spectacular ๐Ÿช„

A few other gems: Another superpower unlocked is the ability to seamlessly blend individual videos together.

Note how the drone transforms into a butterfly as gradually find ourselves underwater
Dec 30, 2023 โ€ข 9 tweets โ€ข 4 min read
Top Gun Maverick. For a movie with no CGI, it sure has a lot of it.

A whopping 2,400 (!!) visual effects shots in fact.

But wait, wasn't everything filmed practically? ๐Ÿ˜‰

Sure was. Yet almost every jet you see on-screen is CGI.

Let's dive into this "invisible" movie magic ๐Ÿ‘‡ For starters, the level of practical filming in Top Gun is cool.

Much of the principal photography was filmed "for real" - ensuring the action always felt anchored in reality.

But make no mistake - there's a ton of invisible CGI involved that you probably didn't notice. ๐Ÿ‘‡
Oct 8, 2023 โ€ข 5 tweets โ€ข 2 min read
With Gaussian Splatting you get 3D editing support! So you can select, move, and delete stuff; apply shader fx. This type of editing has been tedious to do with NeRFs and their implicit black box representations.

Case in point (1/3) by @hybridherbst:
Case in point (2/3): repurpose your point cloud shaders to make something unreal like @Ruben_Fro
Jun 1, 2023 โ€ข 8 tweets โ€ข 4 min read
AI just took 3D modeling to a whole new level ๐Ÿคฏ

Introducing Neuralangelo, a new AI model by NVIDIA that reconstructs mind-blowingly detailed 3D surfaces directly from 2D videos โ€” like photogrammetry on steroids. ๐Ÿง™๐Ÿปโ€โ™‚๏ธ

Keep reading to see this crazy magic for yourself ๐Ÿงต So, what the heck is is this "photogrammetry" thing NVIDIA is supercharging with AI?

TL;DR photogrammetry is the art & science of measuring stuff in the real world using images and other sensors (e.g. LiDAR).

Here's a 60 second primer:
May 30, 2023 โ€ข 5 tweets โ€ข 2 min read
๐ŸŒ Minecraft2Reality ๐ŸŒ

Ever look at the blocky world of Minecraft and think, "Yeah, but what if it was real?" No? Just me then. ๐Ÿ˜Œ

Here's what happens when you feed Minecraft screen captures to an AI with an appetite for reality. ๐Ÿ‘‡ ๐ŸŒ ๐ŸŽฎ Welcome to reality, Minecraft-style ๐ŸŽฎ ๐ŸŒ

I crammed a Minecraft screen capture into a fancy AI blender โ€“ namely ControlNet, EbSynth, and Stable Diffusion.

The result? Pure visual umami.

Imagine giving all your favorite video games an instant upgrade.
May 29, 2023 โ€ข 19 tweets โ€ข 8 min read
Video-to-video AI models are like Snapchat filters on steroids ๐Ÿ”ฅ

Capture a video once and transform it infinitely in post.

See below: Original vs. photoreal vs. cartoon-style.

Tons of stylistic range, yet plenty of room for improvement

Here's how to level up your AI videos๐Ÿงต Watch this classic Office Space clip.

Two main areas of improvement:

1. Stylistic Consistency: characters & environment transform abruptly between keyframes

2. Temporal Consistency: facial & body performance is often lost

Let's unpack each problem and discuss solutions ๐Ÿ‘‡
May 29, 2023 โ€ข 4 tweets โ€ข 2 min read
3D games + AI agents = win ๐Ÿ”ฅ

Such a wild demo by NVIDIA.

Really makes me want to upgrade this ChatGPT-powered Tech CEO Debate Simulator to work in Omniverse ๐Ÿ˜

Topic: "Can we regulate AI successfully?" ๐Ÿ‘‡ Here's Varun, who already made such a 3D simulator inside Unreal Engine with multi-GPT agents that have personality, memory and have topic-based convos.

This is AI Seinfeld on steroids:
May 27, 2023 โ€ข 12 tweets โ€ข 5 min read
I guess Trump decided to take a trip to India, and it was pretty lit ๐Ÿ˜

Midjourney (AI) rendition of celebs continues to impress ๐Ÿงต ImageImageImage "Better than the Chicken Dance at Mar-a-Lago, folks!" Image
May 10, 2023 โ€ข 11 tweets โ€ข 7 min read
๐Ÿš€ Big news today with Google + Adobe joining forces!

We're talking about 3D content anchored to the real world at insane scale๐ŸŒ And of course, AI had a role to play.

I've got early access, and let's just say the physical & digital worlds are blurring ๐Ÿ˜Ž Let's get into it!๐Ÿงต ๐Ÿ“ฝ๏ธ Remember when Times Square and Piccadilly Circus was transformed into a live @gorillaz concert with AR?

Imagine that kind of immersive experience, but created by ANYONE ๐Ÿคฏ

That's the level of game-change we're talking about! โฌ‡
May 10, 2023 โ€ข 7 tweets โ€ข 3 min read
๐ŸŒณ๐ŸŽฎ The physical and digital worlds are converging. I used AI to transform the historic Lodhi Garden in India into a Minecraft landscape ๐Ÿ•Œ๐ŸŒณ

๐Ÿงฉ๐Ÿƒ I created a 3D NeRF of this serene garden using GoPro video, then transformed it into the blocky Minecraft aesthetic usingโ€ฆ twitter.com/i/web/status/1โ€ฆ It's crazy how fast things move.

Here's results from 6 months ago -- a jittery mess.

Just imagine where we'll be in 6 more months.
May 9, 2023 โ€ข 8 tweets โ€ข 8 min read
Speaking at TED was incredible. Grateful for the opportunity and the experience.

Look out for the full talk and panel on @TEDTalks in the coming weeks. Or check it out on TED Live today.

In the meantime, enjoy some photos and takeaways from an unforgettable week in Vancouver: ImageImageImage But first -- a 3D scan at the TED venue reskinned by AI. Because why the heck not!

Unsurprisingly, my TED Talk was titled "Blending Reality & Imagination with Artificial Intelligence" ๐Ÿ˜
May 7, 2023 โ€ข 4 tweets โ€ข 2 min read
๐Ÿš˜๐ŸŒŒ AI-Powered Joyride: Cyberpunk San Francisco ๐ŸŒ‰โœจ

๐Ÿ™๏ธ The world is changing quickly. Brace yourself as reality and fantasy intertwine, with AI turning into lenses through which we'll see the world. ๐ŸŒ๐ŸŒ†

โš™ Brought to life by Kaiber Video2Video (featuring ControlNet, Stableโ€ฆ twitter.com/i/web/status/1โ€ฆ Actually, on second thought -- it's more like Solarpunk San Francisco
๐Ÿ˜Ž๐ŸŒณ๐Ÿก๐ŸŒ† ๐ŸŒ‰ ImageImage
Apr 30, 2023 โ€ข 5 tweets โ€ข 2 min read
๐Ÿž๏ธ An Otherworldly Waterfall ๐Ÿ˜
๐Ÿก Solar-punk inspired AI video
๐Ÿ”ฎ NeRFs + ControlNet + EbSynth = Reality Bending Magic! ๐Ÿช„ 2/ Statue of Liberty ๐Ÿ—ฝ materializing and dematerializing โœจ
Apr 5, 2023 โ€ข 10 tweets โ€ข 6 min read
๐Ÿคฏ Wondering why creators like @SirWrender are losing their minds over @WonderDynamics?

Short answer: itโ€™s a middle ground between 3D, VFX and editorial tools โš”๏ธ

So what took 3 days across many tools โ€” takes 3 minutes in just one tool!

๐Ÿงต Thread (0/8): 1/8 Historically, digital creation tools have lived in specialized silos โ€” all chained together in the classical waterfall fashion.

This works quite well for producing long-form content in multi-year productions โ€” with teams of specialized artists who do one thing very well. Image
Apr 3, 2023 โ€ข 5 tweets โ€ข 3 min read
Creators rejoice โ€” because NeRFs are finally coming to Unreal! ๐ŸŽ‰

Easily digitize a space or place with 2D images alone, and conjure it up later with photorealistic rendition.

Combined with the real-time nature of Unreal โ€” sky is the limit for VFX ๐Ÿ”ฅ
These are full blown volumetric NeRFs. So I no longer need to composite VFX elements into source imagery *before NeRFing* to pull off effects like these glorious muzzle flashes:
Apr 1, 2023 โ€ข 22 tweets โ€ข 10 min read
What a week for AI! Not yet scary, but a feeling is in the air. Things are heating up and people are conflicted.

Why are the brightest minds in AI asking for a 6 month pause, while others say it doesn't go far enough? ๐Ÿคฏ

Here's why this debate deserves our attention.

๐Ÿงต Thread A modicum of relief was bestowed upon us this past week, after a two-week period riddled with launch-after-launch of the most advanced AI capabilities the world has ever seen.

The outcome? Unprecedented AI power to the people ๐Ÿ‘‡
Mar 25, 2023 โ€ข 14 tweets โ€ข 9 min read
Midjourney v5 has pushed into photorealism, a goal which has eluded the computer graphics industry for decades (!) ๐Ÿคฏ

Insane progression, and all that by 11 people with a shared dream.

๐Ÿงต Let's explore what these breakthrough in Generative AI mean for 3D & VFX as we know it... First off, Midjourney v5 is far more photorealistic out-of-the-box. Where as it's predecessor has a more painterly, stylized bent.

Here's a thorough comparison of v5 vs v4 incase you want to go deeper. But let's keep going...
Mar 23, 2023 โ€ข 7 tweets โ€ข 4 min read
If you though reskinning 2D videos was fun, how about reskinning 3D captures of the world?

That's exactly what you get when you combine NeRFs with InstructPix2Pix in this new paper by @ayaanzhaque et al.

Mini-thread๐Ÿงต InstructPix2Pix is applied to the input 2D views used for training to NeRF in an iterative fashion.

Notice below that the edits are gradually becoming more consistent over time.

I'm impressed with how well it works!
Mar 21, 2023 โ€ข 14 tweets โ€ข 8 min read
Been hands-on with the beta of Adobe's cutting-edge Generative AI tool, and I'm impressed! ๐Ÿคฏ

Here's a taste of the power of #AdobeFirefly ๐ŸŽ‡ and what sets it apart in the increasingly crowded world of #AI art.

Thread ๐Ÿงต๐ŸŽจ ImageImage For starters, Adobe Firefly isn't one thing. It encompasses multiple AI models. It's a portal for testing new capabilities with creators, and eventually graduating them into products like Photoshop & Premiere that creators know and love. Meeting users where they are, if you will: In my beta access I was abl...
Mar 19, 2023 โ€ข 15 tweets โ€ข 6 min read
3D capture is moving so fast - I scanned & animated this completely on an iPhone.

Last summer you'd need to wrangle COLMAP, Instant NGP, and FFmpeg to make NeRFs.

Now you can do it all inside Luma AI's mobile app. Capture anything and reframe infinitely in post!

Thread ๐Ÿงต Last summer when NVIDIA's Instant NGP dropped, I went through my entire photogrammetry catalog and re-processed everything. This should give you a teaser for the possibilities of ML-based reality capture: