Dan Hendrycks Profile picture
Mar 29 7 tweets 3 min read Twitter logo Read on Twitter
As AI systems become more useful, people will delegate greater authority to them across more tasks.
AIs are evolving in an increasingly frenzied and uncontrolled manner. This carries risks as natural selection favors AIs over humans.

Paper: arxiv.org/abs/2303.16200 (🧵 below)  Forces that fuel selfishne...Darwinism and evolution app...
Other AI scientists have implicitly recognized that this could be an evolutionary struggle and that humans may become the new gorillas.
@geoffreyhinton “There is not a good track record of less intelligent things controlling things of greater intelligence.”
Jürgen Schmidhuber: “In the long run, humans are not going to remain the crown of creation... But that’s okay... you are a tiny part of a much grander scheme which is leading the universe from lower complexity towards higher complexity”
Others like Google's co-founder Larry Page think that “that digital life is the natural and desirable next step in the cosmic evolution”
Page called @elonmusk a “speciesist” for being on the side of humans (which partially caused him to start OpenAI)
@ylecun argues oppositely: “because AI systems did not pass through the crucible of natural selection...[their] intelligence and survival are decoupled, and so intelligence can serve whatever goals we set for it.”
I argue that AIs will in fact be distorted by that crucible.
In the long run, I think AIs can be thought of an invasive species. I discuss ways to mitigate this existential risk in the paper.

Full argument and countermeasures:
arxiv.org/abs/2303.16200
Video explainer:
Paper abstract: Image

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Dan Hendrycks

Dan Hendrycks Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @DanHendrycks

Mar 31
More and more researchers think that building AIs smarter than us could pose existential risks. But what might these risks look like, and how can we manage them? We provide a guide to help analyze how research can reduce these risks.

Paper: arxiv.org/abs/2206.05862 (🧵below) Image
We review time-tested concepts from safety engineering and discuss how to apply these to advanced AI systems. We need to think of safety not just as a technical problem but also a societal problem, so we need to think about the broader sociotechnical system. Image
Let’s turn to possible failure modes.
Weaponization: AI can be repurposed to be highly destructive. As with nuclear and biological weapons, only one irrational or malevolent actor is sufficient to unilaterally cause harm on a massive scale. Image
Read 13 tweets
Mar 14
Some impressions from using GPT-4 🧵
It knows many esoteric facts (e.g., the meaning of obscure songs, knows what area a researcher works in, can contrast ML optimizers like Adam vs AdamW like in a PhD oral exam, and so on).

My rule-of-thumb is that
"if it's on the internet 5 or more times, GPT-4 remembers it."
Since it gets 86.4% on our MMLU benchmark, that suggests GPT-4.5 should be able to reach expert-level performance.

GPT-2: Language Models are Unsupervised Multitask Learners
GPT-3: Language Models are Few-Shot Learners
GPT-4: Language Models are... Almost Omniscient
Read 6 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(