tory.io 🦾 Profile picture
Dec 14, 2022 42 tweets 15 min read Read on X
Will #ChatGPT replace humans?

Everyone’s got an opinion, but 99% of people don’t understand how it works

I wanted to learn so I took a 400-hour AI bootcamp

Here’s an explanation of how GPT-3 works that a 5-year-old can understand

Read it and make up your own mind

🧵

👇
2/

The following thread will answer five questions using little to no technical jargon:

• What is #AI?

• What are the benefits of AI?

• How does AI work? (in particular, GPT-3 and #ChatGPT)

• What are its Limitations?

• What’s the Long-Term Potential of AI?
3/

🔶 What is AI?

Virtually all “artificial intelligence” today is machine and / or deep learning

Machine and deep learning are advanced forms of pattern recognition

So when you hear the term “AI”, you should mentally substitute that to “advanced pattern recognition”
4/

🔶 What are the benefits of AI?

This isn’t to say that advanced pattern recognition isn’t highly useful

Indeed, computers can use it to:

• Identify pictures

• Recommend movies on Netflix

• Translate languages

• Drive cars

• Converse with humans

(and much more!)
5/

🔶 How does AI work?

To understand how AI works, let’s examine one of the most popular models today – GPT-3

#GPT3 is a highly advanced “language model” created by #OpenAI. It is the engine behind products like Dall-E and ChatGPT

So what is a language model?
6/

Language models read billions of pages of text and find patterns between words and sentences

For example, the words “I warmed my bagel in the____” are generally followed by words like “oven” or “microwave”

(you would almost never see “I warmed my bagel in the baseball”)
7/

Language models use their knowledge of these patterns to make predictions

So when you ask a language model to predict the next word in the phrase: “I warmed my bagel in the ____”, it will use probability to guess “oven” or “microwave”
8/

The cool thing about #GPT3 is that it can use the patterns it identified to perform related tasks:

• Differentiate word ordering: “I used the __ to warm my bagel”

• Answer questions: “Where should I warm my bagel?”

• Understand Synonyms: “I heated my bread in the __”
9/

GPT-3 does this using three innovations:

• Positional Encoding

• Attention

• Self-Attention

Let’s discuss each of these:
10/

🔹 Positional Encoding

The sequence of words in a sentence matters

For example, “I used the oven to warm my bagel” is roughly the same as “I warmed my bagel in the oven”

“I warmed my OVEN in the BAGEL”, however, has a very different (and nonsensical) meaning
11/

As such, when performing its analysis, GPT-3 “encodes” each word in a sentence to give it an understanding of its relative position

For example, “I am a Robot” is

I = 0

Am = 1

A = 2

Robot = 3
12/

🔹 Attention

Once the order of a sentence is encoded, GPT-3 uses a process known as “attention”

Attention tells the AI what words it should focus on when determining patterns and relationships
13/

For example, take the sentence “Bark is very cute and he is a dog”

What does “he” refer to?

This is an easy question for a human to answer, but more difficult for an AI

Even though “and” and “is” are the closet words to “he” they don’t give any context
14/

As such, AI needs to learn to “weight” different words in the sentence to learn what’s important

By analyzing billions of similar sentences, it can learn that “is”, “very”, “cute”, “and”, “is”, and “a” are NOT very important

But “Bark” and “dog” are Source:  Arjun Sarkar in Towards Data Science
15/

This allows AI find relationships even when the word order is different

For instance, through reading the sentence “I warmed my bagel in the oven” a million times, it has related the combination of the words “warm” and “bagel” to the word “oven”
16/

So rephrasing the statement “I warmed my bagel in the ___” into a question, “where should I warm my bagel”, is largely irrelevant from the AI’s point of view

Because it’s focusing on the words “warm” and “bagel”, it knows the next logical word is probably “oven”
17/

🔹Self-Attention

One of the coolest things about #GPT3 is that it can give what humans would describe as “context” to words

For instance, it can determine that “warmed” and “heated” are synonyms and that “bagel” and “bread” are highly related
18/

It does this through a process known as “self-attention”, which is the analysis of the relationship of words within a sentence

For instance, take the sentence:

“The King ordered his troops to transport his gold”

What does that tell us?
19/

#AI can glean a lot from this sentence

It can determine that Kings have authority (“ordered”), are rich (“gold”) and are male (“his”)

Because Emperors also have authority, are rich and are male, AI can determine that “Emperor” is a synonym for “King” Source:  Codebasics
20/

Similarly, a Queen has authority, is rich and is female

So AI can tell that King and Queen are related

In fact, from a mathematical perspective you could say that King – Man + Woman = Queen Source:  Codebasics
21/

This allows #AI to understand that the sentance “I warmed my bagel in the oven” is similar to "I heated my bread in the oven"
22/

When you combine the concepts of positional encoding, attention and self-attention, language models can do some extremely impressive things

GPT-3 can:

• Create art works

• Write business plans

• Code apps

• Write novels

• Answer complicated questions
23/

In fact, one Twitter user - @tqbf - asked #ChatGPT to “write a biblical verse in the style of the king james bible explaining how to remove a peanut butter sandwich from a VCR” and got the following response
24/

🔶 Limitations of AI

While extremely impressive, it’s important to remember that GPT-3 is still just an advanced form of pattern recognition & probability

As such, asking it to write a 10K word novel is technically no different than asking it “where can I warm my bagel”
25/

In fact, ChatGPT isn’t even answering your questions, because it has no concept of what a question is

It’s simply using probability to find the next word in the sequence:

“What should I use to warm my bagel?”

It sees “warm” & “bagel” and knows that it’s probably “oven”
26/

If you really want to get technical, #GPT3 doesn’t even understand the concept of a “word”

Computers can only recognize binary so everything is translated into 0s and 1s
27/

So if you ask #AI to complete the sentence “How are ___”, it would see:

01001000 01101111 01110111 00100000 01100001 01110010 01100101
28/

From analyzing patterns, it knows that “How are” - 01001000 01101111 01110111 00100000 01100001 01110010 01100101 - is often followed by the following series of numbers:

01111001 01101111 01110101 00001010
29/

As you might have guessed, 01111001 01101111 01110101 00001010 translates to “you”
30/

So the resulting “answer”:

01001000 01101111 01110111 00100000 01100001 01110010 01100101 01111001 01101111 01110101 00001010

Translates to “how are you”
31/

🔶 Long-Term Vision

Perhaps the biggest outstanding question is whether technologies like will GPT-3 lead to “artificial general intelligence” (#AGI)

That is, AI that can think & reason like a human (and even be conscious)

Many data scientists and AI researchers say no
32/

This isn’t because they don’t think AGI is possible, just that machine & deep learning aren’t the way to get there

They point to the fact that systems like #GPT3 aren’t really “thinking” per se, they’re just predicting the most statistically likely association of words
33/

In fact, many criticize deep learning systems for being:

• Greedy

• Brittle

• Opaque

Let’s dig into each
34/

🔹 Greedy

Deep learning networks require a LOT of data to learn

For instance, you might need to show AI tens to hundreds of thousands of pictures of cats before it can accurately identify a cat

In contrast, a human infant can identify a cat after seeing one or two pics
35/

One of the main problems with this is the computing resources needed to train #AI are increasing at a much faster rate than the supply

MIT estimates that requirements double every 3-4 months (vs. Moore's Law, which states that supply doubles every 2 years)
36/

🔹Brittle

Deep learning networks often fail to do things that humans consider relatively simple

When prompted for images of a “horse riding an astronaut”, early version of Dall-E 2 kept producing images of an astronaut riding a horse
37/

In fact, that’s why CAPTCHA tests exist

Even though #GPT3 can write a novel in the style of Hemingway or Faulkner, other deep learning networks can’t distinguish between a Chihuahua and a Blueberry Muffin
38/

While some of these mistakes are funny, they can also be dangerous

For instance, a Tesla almost ran over a roadside worker carrying a stop sign

While it knew what a person was and what a stop sign was, it didn’t know what to make of the combination of the two
39/

🔹 Opaque

Human beings can explain their decision-making process

Deep learning systems can’t

We don't know how they identify the patterns they do, and there's a good argument to make that neither do they
40/

Yes, deep learning will get much better over time, but when asking yourself if it will become conscious, creative, etc… you need to ask the following question:

Can pattern recognition and statistical analysis lead to advanced intelligence, creativity, reasoning, etc…?
41/

Ultimately, one of the reasons that this is so tough to answer is because we’re not entirely sure how humans think and reason, or even what makes us conscious

So I believe the debate over the future of AGI is ultimately philosophical, rather than technical
I hope you've found this thread helpful

I usually write on crypto - in particular deep fundamental analysis of Web3 protocols

If you're into that kind of thing follow me at @MTorygreen

If you like the AI stuff, give this a like /RT and let me know if you'd like to see more

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with tory.io 🦾

tory.io 🦾 Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @MTorygreen

Aug 13, 2023
Vitalik believes that L2s like @Arbitrum are critical to Ethereum’s survival.

But fewer than 1% of people truly understand why.

Here are 10 threads to help you master $ARB:

A 🧵of 🧵s Image
@arbitrum 1/

Hot off the presses!

A new thread by @Flowslikeosmo comparing Arbitrum and Optimism
@arbitrum @Flowslikeosmo 2/

@TheDeFinvestor updates us on recent developments in the ecosystem in July
Read 13 tweets
Aug 12, 2023
Everyone thinks of @Filecoin as a storage network.

But most people don’t realize that it’s making a covert play to dominate a MUCH larger market (one with $100 TRILLION of opportunity)

Here’s everything you need to know to get ahead of the crowd:

👇

[A microthread ⚡ 🧵] Image
2/

On the surface, Filecoin is a decentralized marketplace for cloud storage

It allows developers to “rent” unused storage space from individual’s personal computers

This provides a promising alternative to a market that has historically been dominated by Big Tech Image
3/

This decentralized approach offers several competitive advantages over incumbents, including:

• Capacity: Total storage capacity > 16EiB

• Reliability: Daily active faults average <0.0002% (source: CBInsights)

• Cost: Significantly cheaper than Amazon S3 Image
Read 19 tweets
Aug 11, 2023
How do you get rich in a gold rush?

Sell picks and shovels.

Over the last 9 months, I’ve made it my mission to find out who will sell “picks and shovels” in the next CRYPTO GOLD RUSH.

Here's the culmination of 500+ hours of research.

A shortlist for the next bull run:

👇

🧵 Image
2/

Levi Strauss made his fortune in the California Gold Rush of 1849

But instead of prospecting for gold, he built an empire selling blue jeans to miners

This inspired a style of investing known as “pick and shovel” investing: Image
3/

Pick and shovel investors back companies that provide the goods, services or technology needed to produce a final product

(vs. investing in the product itself)

Many consider it one of the best styles of investing

So who are the “pick and shovel” plays for DeFi and Web3?: Image
Read 30 tweets
Aug 8, 2023
You don’t understand crypto.

And no matter how hard you try, you never will.

That’s because it wasn’t made for your tiny human brain.

It was made for the machines…

Here’s why:

👇

🧵 Image
2/

Many people think the blockchain space will someday be worth several trillion dollars.

I disagree.

I think it could be worth QUADRILLIONS.

I know this might sound crazy, but read to the end because the explanation is guaranteed to blow your mind:

🤯
3/

🔶 Dawn of the “Intelligent Machine” Economy

The Internet is still in the Stone Age.

While it has revolutionized how we shop, communicate and consume media, it’s not even close to reaching its full potential: Image
Read 46 tweets
Aug 4, 2023
Confession: It took me a while to understand @chainlink.

It’s just an oracle network, right?

Not even close.

With CCIP, it’s evolved into so much more. But less than 1% of people understand why.

Here's 7 threads explaining why $LINK “2.0” could yield 100x returns:

🧵 of 🧵s Image
1️⃣

First, we MUST start with the OG thread on CCIP.

Written by @alpha_pls almost 2 years ago, it details the vision for the network:
2️⃣

@poopmandefi provides another easy-to-understand (yet extremely thorough) overview of CCIP including:

· What is CCIP

· How it works

· Why it’s important

Read more below:
Read 8 tweets
Aug 2, 2023
Every crypto bull run has its king

This time it might be @zkSync

Here's why:

👇

🧵 Image
2/

First off, retweet this thread to your friends who aren't familiar with zkSync👆

We'll cover:

• What is zkSync?

• What’s a zk rollup (and why is it important)?

• How zkSync works

• Why it’s been successful

• How it could catalyze the next bull run

Let's get to it:
3/

🔶 What is a zk Rollup?

Ethereum has a problem

It can only execute around 30 transactions per second. This creates congestion, making it:

• Slow: Transactions can take several minutes

• Expensive: They can cost > $10!

A technology called “rollups” solves both problems: Image
Read 46 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(