Arun Krishnan 🇮🇳 Profile picture
Apr 24 24 tweets 7 min read Twitter logo Read on Twitter
A recent paper titled "Sparks of Artificial General Intelligence: Early experiments with GPT-4", published by Microsoft Research makes for very interesting reading. I have gone through the entire report and it IS TL;DR. Here is my summary of the same #GPT4 #AGI #LLMs 1/n Image
But before that, let's take a brief detour into what intelligence actually means. One of the definitions is that intelligence is a multifaceted and complex cognitive ability that involves the capacity to understand, learn, reason, solve problems, adapt to new situations, 2/n
think critically, and apply knowledge to different domains. But what does it mean to say that an artificial intelligence system is intelligent?
From the paper:
3/n Image
There are multiple other definitions of AGI:
Goal-oriented - Measures an agent's ability to achieve goals in a wide range of environments. However, it does not take into account systems that can carry out complex tasks without being motivated by a goal.
4/n
Skill-acquisition efficiency: Emphasises learning from experience.
Do anything a human can: However, there is no single, standard definition of human intelligence given the wide diversity.

5/n
In the paper, the authors propose an approach to study whether GPT4 is making progress towards AGI by sticking closer to traditional psychology. With that aim, they generated tasks and questions that were meant to push GPT4 beyond mere memorization and looked at testing GPT-4 6/n
across the following areas:
1. Its mastery of natural language by asking it to translate not just between languages but across tone, content, style and domain and to observe if it can manipulate complex concepts. Spoiler alert: It can!

7/n
2. Coding and Mathematics and its ability across tests
3. Testing its ability to plan as well as to learn from experience by having it play games and interact with tools.

8/n
4. Testing whether it can understand humans and making itself understandable by humans, i.e addressing the problem of explainability.

Having done this, as the authors say

9/n Image
Some interesting findings:

A key measure of intelligence is the ability to synthesize information from different domains or modalities and the capacity to apply knowledge and skills across different contexts or disciplines.

10/n
GPT-4 did some remarkable things like:

Producing javascript code which generates random images in the style of the painter Kandinsky
11/n Image
Find a proof that there are infinitely many prime numbers in the literary style of Shakespeare
Combining knowledge of history and physics by asking it to write a supporting letter fr Electron as a US presidential candidate, written by Mahatma Gandhi and addressed to his wife 12/n Image
Produce python code for a program that takes as an input a patient’s age, sex, weight, height and blood test results vector and indicates if the person is at increased risk for diabetes
There are numerous other examples of the model being able to combine diverse disciplines
13/n
So where there areas where it faltered?
Yes, most certainly. For example, with music. Read for yourself from the paper:
The model failed at any non-trivial form of harmony. As a musician, I find this VERY interesting. Does harmony actually require a higher order intelligence?14/n ImageImage
GPT-4 was able to carry out coding tasks fairly well. Not just at coming up with code, but also in understanding existing pieces of code and reverse-engineering assembly code.

However the model failed at more complicated mathematical problems, since, being a language model, 15/n
it is context-dependent. The authors look at three aspects of mathematical understanding, viz.,
Creative Reasoning - The ability to identify which arguments, intermediate steps, calculations or algebraic manipulations are likely to be relevant at each stage, 16/n
in order to chart a path towards the solution. - In this, the model demonstrates a high level of ability in choosing the right argument or path towards the solution.

17/n
Technical Proficiency - The ability to perform routine calculations or manipulations that follow a prescribed set of steps. The model falters quite often when doing this making simple arithmetic mistakes.

18/n
Critical Reasoning - Ability to critically examine each step of the argument, break it down into its sub-components, explain what it entails, how it is related to the rest of the argument and why it is correct. The model demonstrates a significant deficiency in this aspect. 19/n
What explains the limitations of GPT-4?
The authors attribute it to two main issues:
"Naive" attention mistakes (remember that attention, that wonderful element of the GPT models that reinforces the learning on a particular aspect in its context) but which act as limitations and
Its "linear thinking" as a next-token prediction machine
Since LLMs are based on the next-character or next-word prediction paradigms, their inherent limitations also impact on their generalizability to AGI.
From the paper:

21/n ImageImageImage
Interestingly, the authors borrow from Nobel Laureate Daniel Kahneman's seminal work on the fast and slow modes of thinking. Read for yourself as to what they say:
22/n Image
The paper has many more examples but suffice to say that while we are still far from AGI, GPT-4 does show that there is a concerted effort to start moving rapidly towards the same.
23/n
The authors having pointed out reasons for the model's less than optimal performance in dealing with specific classes of problems, propose directions for future research. In my opinion it is only a matter of time, not if, we move towards Artificial General Intelligence.

THE END

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Arun Krishnan 🇮🇳

Arun Krishnan 🇮🇳 Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @ArunKrishnan_

Apr 24
Since today is the day to reminisce on one's Sachin encounters, let me share a story I have shared a few times earlier.
The year was 1989. Somewhere around February. I was in my 12th. A hostel student at MCC school where the MRF Pace foundation is located. It was my habit to 1/n
walk along the road that led past the pace foundation grounds to our school's back gate as I studied from my book. One day, as I was walking past the MRF nets, I noticed two people I recognized. One was @sachin_rt . the other was @jats72 . They were at the nets, getting ready 2/n
to practice against the MRF pace bowlers. Vivek Razdan was one of them. How did I recognize Sachin? Well, he was in the news those days, being one half of the duo that had shattered that partnership record in Mumbai. So yeah, it was easy for me to recognize him.

3/n
Read 5 tweets
Mar 4
12:30 AM: The phone dings silently as a new chat group is created. We sleep on.

4:30: OH wakes up and looks at her phone. A new group has been created by a childhood friend and includes her and just one other friend. Group Name: "My little baby is no more".

1/n
One single link to a news article.

5:00 AM. OH wakes me up and tells me that the elder daughter of her friend has passed away. I wake up, all confused. What? How?
The young lady, all of 18 had committed suicide.
We are both heartbroken. We know the young girl.
2/n
She and her sister have spent nights at our place playing with our girls.
We decide to go over. But hey, it is also the day that the board exams start for YO. So we don't tell her. Tell EO though, who comes and gives me a cuddle as I am waking up, knowing that I need it.
3/n
Read 8 tweets
Mar 3
For those who don't want to listen to this, here is my summary (with occasional notes. Might be a slightly longish thread so fair warning).

My comments are in parantheses.

1/n
First, the presentation template he uses is something I used like 8 years back. At least update karo Pitroda ji.
Starts off with "Indian democracy is under pressure"

"Idea of mobilisation - Idea of moving around is being constrained"

"Attack on basic structure of constitution"
"India is a Union of states - requires negotiation and conversation"

"Negotiation coming under attack"

Attack on minorities, Dalits and tribals

Attack on the press

Shutdown of Dissent

3/n
Read 35 tweets
Feb 19
There is something very poignant about the end of Bhagwan Krishna on this earth. The complete destruction of his clan and the submergence of Dwaraka. And his fatalistic acceptance of the fact that it was time for him to move on. The man, or God who was singularly
Responsible for charting the course of events in Bharatavarsha for over 80 years. His death at the hands of a hunter. The true Karmayogi, passing away into the ages, alone. Brings to mind those immortal words of Kannadasan...
"who is there at the end?"

If even the greats have to go it alone, what can us mere mortals look forward to?

It always brings a lump to my throat. The thought of an aged Krishna Vasudeva, reclining alone in the forest ...
Read 8 tweets
Feb 2
Thread 👇

I was listening to @smitaprakash and @ANI 's podcast with Lt. Gen Kuldeep Singh Brar, the commanding officer of Opn Bluestar, ony way in to work today. Caution: if you are a patriot and a nationalist and are moved to years by bravery, please don't listen & drive 1/n
It was a good reminder of what those days were like on Punjab in the early 80s. The lawlwssness. A limp police force. Khalistani terrorists calling the shots. Bhindranwale the big Kahuna. A timely reminder. 2/n
At one point just before the op, Gen Brat went around talking to his batallions, giving any of his men who didn't feel like going into the Golden Temple, th option of standing down.
The first 2 batallions no one stood up. In the 3rd one, a young sardar did. 3/n
Read 19 tweets
Oct 21, 2022
Thead:Britain using a Western Lens:

The situation in the tiny island nation in the north of continental Europe has taken a turn for the worse with the new Prime Minister, Liz Truss, having resigned.

Political instability is not new to this tiny nation with a bloody history 1/n
of conquest, pillage, plunder and exploitation.

The island which is a constitutional monarchy, recently lost its longest reigning monarch, Queen Elizabeth and that seems to have triggered a bout of infighting among the countries far right, Christian Nationalist politicians 2/n
also known as the Tories.

Sources say that with its currency depreciating against the dollar, its non-existent manufacturing base shrinking further and race riots in its cities, the tiny island nation, somewhat optimistically calling itself "Great" Britain has lost its sheen.3/n
Read 8 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(