Post

https://twitter.com/wichmaennchen/status/1536258962730926080

https://twitter.com/danielbln/status/1536276920152768513

More from @hardmaru

hardmaru

@hardmaru

May 24, 2023

Interview with Jürgen Schmidhuber, ‘Father Of Modern AI’, who says his life’s work won’t lead to dystopia.

He goes quite in depth into his views about the future of AI and AGI. A refreshing view of things, compared to other ‘influential leaders’ in AI.

reddit.com/r/MachineLearn…

Schmidhuber’s take on whether it makes sense to ban large language models like GPT in education, and future of human labor.

“Laziness and efficiency is a hallmark of intelligence. Any intelligent being wants to minimize its efforts to achieve things.”

reddit.com/r/MachineLearn…

@SchmidhuberAI

@SchmidhuberAI On open-source AI, “I signed this open letter by @laion_ai because I strongly favor the open-source movement. And I think it's also something that is going to challenge whatever big tech dominance there might be at the moment.”

“Since AI is still getting exponentially cheaper… twitter.com/i/web/status/1…

Read 5 tweets

hardmaru

@hardmaru

Jan 6, 2023

https://twitter.com/enpitsu/status/1610587513059684353

A #StableDiffusion model trained on images of Japanese Kanji characters came up with “Fake Kanji” for novel concepts like Skyscraper, Pikachu, Elon Musk, Deep Learning, YouTube, Gundam, Singularity, etc.

They kind of make sense. Not bad!

https://twitter.com/enpitsu/status/1610587513059684353

This is similar to the “Fake Kanji” with recurrent neural network experiments I did many years ago, when computers were 1000x less powerful :) Kind of fun to see updated results with modern diffusion models.

blog.otoro.net/2015/12/28/rec…

These were what I generated back in 2015:

https://twitter.com/rendicahya/status/1611270881363505152

Super Mario Kanji 😂

https://twitter.com/rendicahya/status/1611270881363505152

Read 5 tweets

hardmaru

@hardmaru

Nov 24, 2022

Excited to announce the release of Stable Diffusion 2.0!

Many new features in v2:

• Base 512x512 and 768x768 models trained from scratch with new OpenCLIP text encoder
• X4 upscaling text-guided diffusion model
• New “Depth2Image” functionality

Blog: stability.ai/blog/stable-di…

@robrombach

This release is led by @robrombach @StabilityAI

The new SD2 base model is trained from scratch using OpenCLIP-ViT/H text encoder (github.com/mlfoundations/…), with quality improvements over V1. It is fine-tuned using v-prediction (arxiv.org/abs/2202.00512) to produce 768x768 images:

A new 4x up-scaling text-guided diffusion model, enabling resolutions of 2048x2048 (or even higher!), when combined with the new text-to-image models in this release.

Made possible using Efficient Attention in (github.com/facebookresear…).

Example of 128x128 to 512x512 up-scaling:

Read 9 tweets

hardmaru

@hardmaru

Jul 19, 2022

https://twitter.com/AaronHertzmann/status/1549091375928397824

Tried some interesting prompts to test OpenAI’s new reduced-bias #dalle2 #dalle model that will generate images of people that more accurately reflect the diversity of the world’s population.

“Professional DSLR color photograph of British soldiers during the American Revolution”

https://twitter.com/AaronHertzmann/status/1549091375928397824

Here’s another four samples of the same prompt.

Reducing Bias and Improving Safety in DALL·E 2 blog post:

openai.com/blog/reducing-…

Another test:

“DSLR color photo of an US/American soldier digging a trench during 1918”

#dalle2 #dalle

Read 4 tweets

hardmaru

@hardmaru

Jun 29, 2022

The most interesting and viral images you see produced by text-to-image models are not merely the results of the deep learning models themselves, but rather the result of a complex feedback loop between a human neural net🧠 interacting with an artificial neural net🤖.

🧵Thread👇

https://twitter.com/_nateraw/status/1541830523194056704

You can clearly see this, because the prompts for images that end up going viral for one model, clearly don’t “work” for another model.

The best images are chosen from evolutionary selection at the community level, and each image are the result of human/model iterative feedback:

https://twitter.com/_nateraw/status/1541830523194056704

From the #dallemini phenomenon, it’s also clear that the most viral content is not related to particular art styles, or whether the model can produce high quality images (reflected in training data). But rather, whether the model can portray cultural items that people talk about.

Read 5 tweets

hardmaru

@hardmaru

Jun 28, 2022

Good Morning!

I tried to use text-to-image models to combine historical architecture with other locations around the world.

Here is “The Great Wall of San Francisco” by #Imagen

🧵Thread👇🏽

“The Great Wall of Stanford” generated using #Imagen

Accurate:

Let’s take this somewhere more exotic.

Here’s “The Great Wall of Bali” by #Imagen

Read 13 tweets

Share this page!

Enter URL or ID to Unroll

hardmaru

Try unrolling a thread yourself!

More from @hardmaru

hardmaru

hardmaru

hardmaru

hardmaru

hardmaru

hardmaru

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!