Post

How to get URL link on X (Twitter) App

On the Twitter thread, click on or icon on the bottom
Click again on or Share Via icon
Click on Copy Link to Tweet
Paste it above and click "Unroll Thread"!
More info at Twitter Help

Yishan

@yishan

Feb 23 • 30 tweets • 4 min read • Read on X

Google’s Gemini issue is not really about woke/DEI, and everyone who is obsessing over it has failed to notice the much, MUCH bigger problem that it represents.

(1/n)

First, to recap: Google injected special instructions into Gemini so that when it was asked to draw pictures, it would draw people with “diverse” (non-white) racial backgrounds.

This resulted in lots of weird results where people would ask it to draw pictures of people who were historically white (e.g. Vikings, 1940s Germans) and it would output black people or Asians.

Google originally did this because they didn’t want pictures of people doing universal activities (e.g. walking a dog) to always be white, reflecting whatever bias existed in their training set.

This is not an unreasonable thing to do, given that they have a global audience. Maybe you don’t agree with it, but it’s not unreasonable.

Google most likely did not anticipate or intend the historical-figures-who-should-reasonably-be-white result.

We can argue about whether they were ok with that unexpected result, but the fact that they decided to say something about it and “do additional tuning” means they didn’t anticipate it and probably didn’t intend for that to happen.

If you have a woke/anti-woke axe to grind, kindly set it aside now for a few minutes so that you can hear the rest of what I’m about to say, because it’s going to hit you from out of left field.

Everyone is obsessed with woke-whatever because it is the culture war of the moment. So everyone thinks this is significant because Google is “captured by woke” or whatever.

No.

That’s not why this is important. This event is not significant for culture war reasons. You just think so because that’s all anyone is thinking about these days, and everyone has missed the real danger.

It doesn’t matter that it made a bunch of woke mistakes and showed you a bunch of black Nazis. It’s not important that free speech or diversity or inclusion or whatever is under attack or taking over or whatever.

This event is significant because it is major demonstration of someone giving a LLM a set of instructions and the results being totally not at all what they predicted.

It is demonstrating very clearly, that one of the major AI players tried to ask a LLM to do something, and the LLM went ahead and did that, and the results were BONKERS.

Do you remember those old Asimov robot stories where the robots would do something really quite bizarre and sometimes scary, and the user would be like WTF, the robot is trying to kill me, I knew they were evil!

And then Susan Calvin would come in, and she’d ask a couple questions, and explain, “No, the robot is doing exactly what you told it, only you didn’t realize that asking it to X would also mean it would do X2 and X3, these seemingly bizarre things.”

And the lesson was that even if we had the Three Laws of Robotics, supposedly very comprehensive, that robots were still going to do crazy things, sometimes harmful things, because we couldn’t anticipate how they’d follow our instructions?

In fact, in the later novels, we even see how (spoiler) the robots develop a “Zeroth Law” where they conclude that it’s a good idea to irradiate the entire planet so that people are driven off of it to colonize the galaxy.

And that’s the scenario where it plays out WELL…. in the end.

There’s a few short stories in between where people are realizing the planet is radioactive and it’s not very pleasant.

Are you getting it?

Woke drawings of black Nazis is just today’s culture-war-fad.

The important thing is how one of the largest and most capable AI organizations in the world tried to instruct its LLM to do something, and got a totally bonkers result they couldn’t anticipate.

What this means is that @ESYudkowsky has a very very strong point.

It represents a very strong existence proof for the “instrumental convergence” argument and the “paperclip maximizer” argument in practice.

@ESYudkowsky If this had been a truly existential situation where “we only get one chance to get it right,” we’d be dead.

Because I’m sure Google tested it internally before releasing it and it was fine per their original intentions. They probably didn’t think to ask for Vikings or Nazis.

@ESYudkowsky It demonstrates quite conclusively that with all our current alignment work, that even at the level of our current LLMs, we are absolutely terrible at predicting how it’s going to execute an intended set of instructions.

@ESYudkowsky When you see these kinds of things happen, you should not laugh.

@ESYudkowsky Every single comedic large-scale error by AI is evidence that when it is even more powerful and complex, the things it’ll do wrong will be utterly unpredictable and some of them will be very consequential.

@ESYudkowsky I work in climate change, I’m very pro-tech, and even I think the biggest danger would be someone saying to AI, “solve climate change.”

@ESYudkowsky Because there are already people who say “humans are the problem; we should have fewer humans” so it will be VERY plausible for an AI to simply conclude that it should proceed with the most expedient way to delete ~95% of humans.

@ESYudkowsky That requires no malice, only logic.

@ESYudkowsky Again, I will say this: any time you see a comedic large-scale error by AI, it is evidence that we do not know how to align and control it, that we are not even close.

@ESYudkowsky Because alignment is not just about “moral alignment” or “human values,” it is just about whether a regular user can give an AI an instruction and have it do exactly that, with no unintended results. You shouldn’t need to be Susan Calvin.

@ESYudkowsky I like robots, I like AI, but let’s not kid ourselves that we’re playing with fire here.

All right, would you like to help solve climate change? Read this:

terraformation.com/blog/why-you-s…

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @yishan

Yishan

@yishan

Nov 20, 2023

I am probably one of a small number of people who have had the chance to work directly with both @AdamDAngelo and @Sama and get to know them.

Here’s what you need to know about these two guys:

First, I worked with Adam as an engineer and then director of engineering while he was CTO at Facebook, and then later I did consulting work for Quora.

I worked with Sam when he helped me raised Reddit’s Series B round and served together with me on Reddit’s board. His firm (him and his brother Max) is also the lead investor in my company @TF_Global.

Read 17 tweets

Yishan

@yishan

Sep 11, 2023

Our society today is basically just about yelling incoherently about things without regard for facts instead of doing anything real…

… which is exactly what you’d expect from a gerontocracy, right?

What if the problem is not political polarization or lack of education or wokeness or fascism or any of those things but merely that our society is a reflection of the fact that our senior leaders are really old people?

Really old people don’t DO things, they just complain.

Trump is 77, Biden is 80, Mitch McConell is 81, and Pelosi is 83.

In your own family, does anyone you know who is that age lead the way with bold and concrete vision and clear solutions? Or do they just… sit around and talk?

Read 4 tweets

Yishan

@yishan

Jun 19, 2023

Every science and tech person who is currently on the bandwagon calling for the vaccine doctor to go on Joe Rogan to debate RFK should be ashamed of themselves.

If you care about the truth or science, that is the WORST possible thing you could be advocating for.

(thread)

The argument goes something like this:

“If you[the vax doctor]’re so sure that your position is right, you ought to be willing to go and defend it [on any podcast, like this one], otherwise your claims have no credibility.”

That SOUNDS like a good statement, but ironically it is perfect example of why it is wrong - because it is (as Plato would call it) “rhetoric’s oral spell” - or in simpler terms, it’s a nice turn of phrase, but it’s an idea with very weak merits.

Read 57 tweets

Yishan

@yishan

Jun 18, 2023

Yo, it's not the vaccines causing your autism, it's the glyphosate (RoundUp) used on all our crops, especially wheat:

Full PDF can be downloaded here:
mdpi.com/1099-4300/15/4…

Read 18 tweets

Yishan

@yishan

Jun 16, 2023

If you want to know the next big thing in "real atoms" investment macro-trends, I'll tell you right now.

(1/x)

It is WATER.

Specifically, solar-powered reverse-osmosis desalination.

Here is why...

Water is the source of ALL wealth. That fact is so basic that we have all but forgotten it.

Water is so essential that even in ancient times, we built megaprojects to ensure that entire populations would have low-cost access to clean running water.

Read 32 tweets

Yishan

@yishan

Jun 15, 2023

(US specific) You probably think YOUR news source is telling you the whole story, unlike the other side, right?

You're wrong.

France is looking to join BRICS.

Look it up. You will find this in ZERO US-based news sources.

All the news sources that reported on this are Indian, French, Chinese, Russian, Greek, etc, nothing from the US.

This is just one example.

There are things that neither (US) party wants you to know, because ultimately they both serve the same Americentric interests.

Taken 7 April 2023

Read 7 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Enter URL or ID to Unroll

Yishan

Try unrolling a thread yourself!

More from @yishan

Yishan

Yishan

Yishan

Yishan

Yishan

Yishan

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!