Tweet

Dan Hendrycks

@DanHendrycks

Mar 29 • 7 tweets • 3 min read Twitter logo

As AI systems become more useful, people will delegate greater authority to them across more tasks.
AIs are evolving in an increasingly frenzied and uncontrolled manner. This carries risks as natural selection favors AIs over humans.

Paper: arxiv.org/abs/2303.16200 (🧵 below)

@geoffreyhinton

Other AI scientists have implicitly recognized that this could be an evolutionary struggle and that humans may become the new gorillas.
@geoffreyhinton “There is not a good track record of less intelligent things controlling things of greater intelligence.”

Jürgen Schmidhuber: “In the long run, humans are not going to remain the crown of creation... But that’s okay... you are a tiny part of a much grander scheme which is leading the universe from lower complexity towards higher complexity”

@elonmusk

Others like Google's co-founder Larry Page think that “that digital life is the natural and desirable next step in the cosmic evolution”
Page called @elonmusk a “speciesist” for being on the side of humans (which partially caused him to start OpenAI)

@ylecun

@ylecun argues oppositely: “because AI systems did not pass through the crucible of natural selection...[their] intelligence and survival are decoupled, and so intelligence can serve whatever goals we set for it.”
I argue that AIs will in fact be distorted by that crucible.

In the long run, I think AIs can be thought of an invasive species. I discuss ways to mitigate this existential risk in the paper.

Full argument and countermeasures:
arxiv.org/abs/2303.16200
Video explainer:

Paper abstract:

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @DanHendrycks

Dan Hendrycks

@DanHendrycks

Mar 31

More and more researchers think that building AIs smarter than us could pose existential risks. But what might these risks look like, and how can we manage them? We provide a guide to help analyze how research can reduce these risks.

Paper: arxiv.org/abs/2206.05862 (🧵below)

We review time-tested concepts from safety engineering and discuss how to apply these to advanced AI systems. We need to think of safety not just as a technical problem but also a societal problem, so we need to think about the broader sociotechnical system.

Let’s turn to possible failure modes.
Weaponization: AI can be repurposed to be highly destructive. As with nuclear and biological weapons, only one irrational or malevolent actor is sufficient to unilaterally cause harm on a massive scale.

Read 13 tweets

Dan Hendrycks

@DanHendrycks

Mar 14

Some impressions from using GPT-4 🧵

It knows many esoteric facts (e.g., the meaning of obscure songs, knows what area a researcher works in, can contrast ML optimizers like Adam vs AdamW like in a PhD oral exam, and so on).

My rule-of-thumb is that
"if it's on the internet 5 or more times, GPT-4 remembers it."

Since it gets 86.4% on our MMLU benchmark, that suggests GPT-4.5 should be able to reach expert-level performance.

GPT-2: Language Models are Unsupervised Multitask Learners
GPT-3: Language Models are Few-Shot Learners
GPT-4: Language Models are... Almost Omniscient

Read 6 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Dan Hendrycks

People who liked this thread also liked...

Try unrolling a thread yourself!

More from @DanHendrycks

Dan Hendrycks

Dan Hendrycks

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!