Latest Twitter Threads by @aifunhouse on Thread Reader App

Dec 3, 2022 • 14 tweets • 6 min read

🧵Autocomplete your way to a business like it's 2025

1. Yesterday, @Noapinion + @tszzi posted a thoughtful take on where AI is headed – autocomplete for everything.

Inspired by their optimism, we used ChatGPT to prompt our way to a new side hustle.

https://twitter.com/Noahpinion/status/1598198817270403073

2. On Wednesday @openai released ChatGPT, an eerily competent chatbot that sent Twitter abuzz with countless screenshots and demo threads.

Since the OpenAI team seems to know a thing or two about running an online business, we figured their chatbot would too.

Low and behold👇

Nov 30, 2022 • 11 tweets • 7 min read

🧵AI-assisted image prompts

1. In the last week, both @stabilityai and @openai have released major updates to their flagship offerings.

In this thread, we use OpenAI's new #ChatGPT model to help talk us through generating prompts for StabilityAI's new #stablediffusion v2 model.

2. ChatGPT3 is a new GPT3-powered model focused on conversing with users.

As with InstructGPT3 (the current default GPT3 model), @openai leveraged Reinforcement Learning from Human Feedback (RLHF) to boost the reliability and desirability of responses.

openai.com/blog/chatgpt/

Nov 11, 2022 • 11 tweets • 5 min read

🧵Categorizing complaints to Elon with AI

1. It's a tumultuous time here in tweet town.

For the good of the community, we used language embeddings to help @elonmusk and his team get a birds eye view of the +45k complaints he recently solicited👇

https://twitter.com/elonmusk/status/1590383937284870145

2. How do you categorize +45k replies to an @elonmusk tweet?

STEP 1: Use Twitter's API to pull all 45k replies, sorted by # of likes.

STEP 2: Show GPT-3 a sampling of top replies and ask it to suggest some categories.

STEP 3: Use text embeddings to categorize all the replies.

Nov 6, 2022 • 11 tweets • 10 min read

🧵Automated image generation with DALL-E

1. A few days ago @OpenAI announced that #DALLE users could create images programmatically via API.

To celebrate we took a vintage shot of @eerac, circa 2012, and through in a healthy assortment of fully automated AI whimsey.

2. Until two days ago, DALL-E users have only ever been able to generate images through @OpenAI's website, labs.openai.com

DALL-E delivers great image quality, but without being able to use it programmatically, it's cumbersome to incorporate in to automated workflows.

Nov 2, 2022 • 11 tweets • 6 min read

🧵Chained Prompts + Web Lookup

1. If you've used GPT-3, you likely asked it to perform a task using a single "prompt" or instruction.

GPT-3's ability to perform tasks in one shot is impressive, but tools like @dust4ai will make it far more powerful.

https://twitter.com/dust4ai/status/1587104029712203778

2. When you prompt GPT-3 to perform a task (e.g. "write a series of tweets about elephants"), it may be more reliable to perform the task in multiple steps:

1: Write a short paragraph about elephants
2: Divide the following paragraph into a series of ~250 character tweets

Oct 19, 2022 • 13 tweets • 9 min read

1. Text-based image editing is coming in hot!

Between #DALLE, #midjourney and #stablediffusion text-to-image generation is all the rage, but what if your images are off target?

Usually you try new prompts, then use inpainting to make edits... until now.

https://twitter.com/krea_ai/status/1582483390392983552

2. This month multiple papers have come out demonstrating how language can be used to edit an image.

Equally exciting, these papers are being implemented on top of #stablediffusion since it's now openly available (Thanks @EMostaque!)

Let's review some of the ⚡️fast progress...

Oct 17, 2022 • 12 tweets • 5 min read

1. In 2022, text-to-image tech has improved dramatically.

Heading in to 2023, text-to-mesh, text-to-video, and text-to-audio models have all been demonstrated.

Today we play fortuneteller and explain how in 2023 you'll likely be able to create full 3D characters from text.

🧵

2. To create a 3D character from text, you'll need to combine a collection of buildings blocks.

1) You need to create a rigged 3D mesh.
2) You need to define the appearance of the character.
3) You need to define movements/animations.
4) You'll likely want some kind of voice.

Oct 2, 2022 • 18 tweets • 11 min read

1. Hey there young Spielbergs!

Curious about how AI can be used for film making?

It's still early days, but between text-to-image, text-to-audio and AI-driven animation, building blocks are starting to appear.

Today's thread provides an overview.
🧵👇

2. First off, some highlights!

Way back in July @mrjonfinger used @midjourney to produce a coherent short film.

Very solid visuals, but the voices and animation are a bit stilted. We had to rewatch to grok the plot, but it's 100% there once you get it.

https://twitter.com/mrjonfinger/status/1552401566291832832

Oct 1, 2022 • 15 tweets • 8 min read

1. Last week @openai finally removed the waitlist for DALL-E!

Of the AI image creation tools out there, DALL-E is arguably the most accessible.

Today's thread provides a hands-on tutorial for new DALL-E users looking to generate visually consistent assets.

Sticker Time!

2. If you've never used DALL-E, head on over and sign up now!

First announced in April, DALL-E is the OG text-to-image ML model. For months, access was extremely limited, but now everyone, including you, can log in and take this dream machine for a spin.

labs.openai.com

Sep 30, 2022 • 19 tweets • 6 min read

It's Friday and that means it's time for the @aifunhouse Week in Review!

As always, it's been a wild week in #AI!

DreamBooth, Instant NeRF, Make a Video, and more ... let's get in!

🤖🧵👇🏽

1. First up, DreamBooth, a technique from Google Research originally applied to their tool Imagen, but generalizable to other models, allows for fine-tuning of text-to-image networks to allow generation of consistent characters across contexts and styles. dreambooth.github.io

Sep 27, 2022 • 14 tweets • 7 min read

1. Last week @OpenAI released Whisper, an open source model for transcribing audio.

Let’s see how you can use Whisper + GPT-3 to quickly summarize text-heavy YouTube vids.

If you're new to ML this is a great tutorial to get hands on and play along.

openai.com/blog/whisper/

🧵 2. First we'll to use Whisper to transcribe a video we want to summarize.

Whisper is open source and there are already multiple UIs on the web that allow you run it on video and audio.

For YouTube videos, @jeffistyping created a super simple UI.

huggingface.co/spaces/jeffist…

Sep 26, 2022 • 13 tweets • 8 min read

Text-to-image networks generate amazing imagery, but out of the box, it's hard to define characters and styles that remain consistent across outputs.

The crew at @CorridorDigital has a great video showcasing how tools like DreamBooth can solve this.
🧵👇
DALL-E and Stable Diffusion (SD) have taken Twitter by storm because of the high quality visuals they generate from text prompts.

Within a prompt, you can refer to well-known people/objects (e.g. "Eiffel Tower") and place them in specific situations, or stylize them in new ways.

Share this page!

Enter URL or ID to Unroll