aifunhouse Profile picture
Tutorials, templates, and tricks for making #GenerativeAI do your bidding and not the other way around! 🤖🦾🤩 Living in the future since 2028
Dec 3, 2022 14 tweets 6 min read
🧵Autocomplete your way to a business like it's 2025

1. Yesterday, @Noapinion + @tszzi posted a thoughtful take on where AI is headed – autocomplete for everything.

Inspired by their optimism, we used ChatGPT to prompt our way to a new side hustle.

2. On Wednesday @openai released ChatGPT, an eerily competent chatbot that sent Twitter abuzz with countless screenshots and demo threads.

Since the OpenAI team seems to know a thing or two about running an online business, we figured their chatbot would too.

Low and behold👇
Nov 30, 2022 11 tweets 7 min read
🧵AI-assisted image prompts

1. In the last week, both @stabilityai and @openai have released major updates to their flagship offerings.

In this thread, we use OpenAI's new #ChatGPT model to help talk us through generating prompts for StabilityAI's new #stablediffusion v2 model. 2. ChatGPT3 is a new GPT3-powered model focused on conversing with users.

As with InstructGPT3 (the current default GPT3 model), @openai leveraged Reinforcement Learning from Human Feedback (RLHF) to boost the reliability and desirability of responses.

openai.com/blog/chatgpt/
Nov 11, 2022 11 tweets 5 min read
🧵Categorizing complaints to Elon with AI

1. It's a tumultuous time here in tweet town.

For the good of the community, we used language embeddings to help @elonmusk and his team get a birds eye view of the +45k complaints he recently solicited👇

2. How do you categorize +45k replies to an @elonmusk tweet?

STEP 1: Use Twitter's API to pull all 45k replies, sorted by # of likes.

STEP 2: Show GPT-3 a sampling of top replies and ask it to suggest some categories.

STEP 3: Use text embeddings to categorize all the replies.
Nov 6, 2022 11 tweets 10 min read
🧵Automated image generation with DALL-E

1. A few days ago @OpenAI announced that #DALLE users could create images programmatically via API.

To celebrate we took a vintage shot of @eerac, circa 2012, and through in a healthy assortment of fully automated AI whimsey. ImageImageImageImage 2. Until two days ago, DALL-E users have only ever been able to generate images through @OpenAI's website, labs.openai.com

DALL-E delivers great image quality, but without being able to use it programmatically, it's cumbersome to incorporate in to automated workflows.
Nov 2, 2022 11 tweets 6 min read
🧵Chained Prompts + Web Lookup

1. If you've used GPT-3, you likely asked it to perform a task using a single "prompt" or instruction.

GPT-3's ability to perform tasks in one shot is impressive, but tools like @dust4ai will make it far more powerful.

2. When you prompt GPT-3 to perform a task (e.g. "write a series of tweets about elephants"), it may be more reliable to perform the task in multiple steps:

1: Write a short paragraph about elephants
2: Divide the following paragraph into a series of ~250 character tweets
Oct 19, 2022 13 tweets 9 min read
1. Text-based image editing is coming in hot!

Between #DALLE, #midjourney and #stablediffusion text-to-image generation is all the rage, but what if your images are off target?

Usually you try new prompts, then use inpainting to make edits... until now.

2. This month multiple papers have come out demonstrating how language can be used to edit an image.

Equally exciting, these papers are being implemented on top of #stablediffusion since it's now openly available (Thanks @EMostaque!)

Let's review some of the ⚡️fast progress...
Oct 17, 2022 12 tweets 5 min read
1. In 2022, text-to-image tech has improved dramatically.

Heading in to 2023, text-to-mesh, text-to-video, and text-to-audio models have all been demonstrated.

Today we play fortuneteller and explain how in 2023 you'll likely be able to create full 3D characters from text.

🧵 ImageImageImageImage 2. To create a 3D character from text, you'll need to combine a collection of buildings blocks.

1) You need to create a rigged 3D mesh.
2) You need to define the appearance of the character.
3) You need to define movements/animations.
4) You'll likely want some kind of voice.
Oct 2, 2022 18 tweets 11 min read
1. Hey there young Spielbergs!

Curious about how AI can be used for film making?

It's still early days, but between text-to-image, text-to-audio and AI-driven animation, building blocks are starting to appear.

Today's thread provides an overview.
🧵👇 2. First off, some highlights!

Way back in July @mrjonfinger used @midjourney to produce a coherent short film.

Very solid visuals, but the voices and animation are a bit stilted. We had to rewatch to grok the plot, but it's 100% there once you get it.

Oct 1, 2022 15 tweets 8 min read
1. Last week @openai finally removed the waitlist for DALL-E!

Of the AI image creation tools out there, DALL-E is arguably the most accessible.

Today's thread provides a hands-on tutorial for new DALL-E users looking to generate visually consistent assets.

Sticker Time! 2. If you've never used DALL-E, head on over and sign up now!

First announced in April, DALL-E is the OG text-to-image ML model. For months, access was extremely limited, but now everyone, including you, can log in and take this dream machine for a spin.

labs.openai.com
Sep 30, 2022 19 tweets 6 min read
It's Friday and that means it's time for the @aifunhouse Week in Review!

As always, it's been a wild week in #AI!

DreamBooth, Instant NeRF, Make a Video, and more ... let's get in!

🤖🧵👇🏽 1. First up, DreamBooth, a technique from Google Research originally applied to their tool Imagen, but generalizable to other models, allows for fine-tuning of text-to-image networks to allow generation of consistent characters across contexts and styles. dreambooth.github.io Image
Sep 27, 2022 14 tweets 7 min read
1. Last week @OpenAI released Whisper, an open source model for transcribing audio.

Let’s see how you can use Whisper + GPT-3 to quickly summarize text-heavy YouTube vids.

If you're new to ML this is a great tutorial to get hands on and play along.

openai.com/blog/whisper/

🧵 2. First we'll to use Whisper to transcribe a video we want to summarize.

Whisper is open source and there are already multiple UIs on the web that allow you run it on video and audio.

For YouTube videos, @jeffistyping created a super simple UI.

huggingface.co/spaces/jeffist…
Sep 26, 2022 13 tweets 8 min read
Text-to-image networks generate amazing imagery, but out of the box, it's hard to define characters and styles that remain consistent across outputs.

The crew at @CorridorDigital has a great video showcasing how tools like DreamBooth can solve this.
🧵👇
DALL-E and Stable Diffusion (SD) have taken Twitter by storm because of the high quality visuals they generate from text prompts.

Within a prompt, you can refer to well-known people/objects (e.g. "Eiffel Tower") and place them in specific situations, or stylize them in new ways. ImageImageImageImage