Tweet

Vladimir Haltakov

8 Nov, 13 tweets, 4 min read

Why is AI bad at math? 📐

Machine learning models today are good at generating realistic-looking text (see GPT-3), images (VQGAN+CLIP), or even code (GitHub Co-Pilot/Codex).

However, these models only learn to imitate, so the results often contain logical errors.

Thread 👇

Simple math problems, like the ones 10-year-old kids solve, usually require several logical steps involving simple arithmetics.

The problem is that, if the ML model makes a logical mistake anywhere along the way, it will not be able to recover the correct answer.

👇

@OpenAI

@OpenAI is now working on tackling this issue.

In their latest paper, they introduce the so-called verifiers. The generative model generates 100 solutions, but the verifiers select the one that has the highest chance of being factually correct.

openai.com/blog/grade-sch…

👇

This strategy helps them get much better at solving simple math problems - almost on par with kids aged 9-12. However, they still achieve only 55% correct answers, so there is still some way to go.

It is an interesting research field though, so great to see progress there.

👇

@lacker

Thanks to @lacker for running and documenting a series of interesting experimets with GPT-3. The example in the first tweet is taken from there. Check all of them in this blog post:

lacker.io/ai/2020/07/06/…

https://twitter.com/Joe__Scott__/status/1457893413655810049

Yes, this is a good point! AI is not bad at math, language models are still bad at math.

If you converted these problems to mathematical notation, they would be trivial to solve by a computer without any AI. It is how you get there...

https://twitter.com/Joe__Scott__/status/1457893413655810049

https://twitter.com/AnoniMaedel/status/1457983057382846465?s=20

I mostly agree with this and I really like your scrabble example. This perfectly illustrates the point you are trying to make!

And I agree that AI is still far away from human intelligence.

That being said, I think you underestimate its abilities! 👇

https://twitter.com/AnoniMaedel/status/1457983057382846465?s=20

It's true that language models are trained to imitate human written text and that's why you see these stupid mistakes.

However, the fact that the same model can be used to assess if a statement is true or not shows that there is more to that than just imitation! 👇

The English scrabble player is able to imitate French scrabble by remembering the words, but he wasn't able to assess if a particular sentence makes sense, right?

👇

And the human thought process for complicated tasks is somewhat similar. You play around with different possible solutions in your head and assess them if they are real solutions.

Or like brainstorming - people throw ideas around and discuss and assess them. 👇

And maybe AI learns in a different way than humans, but we also learn in different ways. Imagine learning a scientific formula.

One person may learn it by hard, while another one may learn how it is derived and not remember the formula itself by hard.

A third person may remember it by some analogy with a formula in another field.

So, AI may find different ways to "learn" things, that are not like the human ways, but are not less effective. I agree we are not there, though...

And last tweet - if you are interesting in this topic I recommend this podcast on what intelligence means!

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @haltakov

Vladimir Haltakov

@haltakov

9 Nov

How I made $3000 in 3 weeks selling AI-generated art? 💰

Last week I showed you how you can use VQGAN+CLIP to generate interesting images based on text prompts.

Now, I'll tell you how I sold some of these as NFTs for more than $3000 in less than 3 weeks.

Let's go 👇

@cryptoadzNFT

Background

I've been interested in NFTs for 2 months now and one collection I find interesting is @cryptoadzNFT. What's special about it is that the creator @supergremplin published all of the art in the public domain. This spurred the creation of many derivative projects.

👇

The Idea 💡

My idea was to use VQGAN+CLIP to create interesting versions of the CrypToadz. So, I started experimenting with my own toad #6741.

I took the original NFT image as a start and experimented a lot with different text prompts. The results were very promising!

👇

Read 20 tweets

Vladimir Haltakov

@haltakov

3 Nov

How to create art with Machine Learning? 🎨

You've probably seen these strangely beautiful AI-generated images on Twitter. Have you wondered how they are created?

In this thread, I'll tell you about a method for generating art with ML known as VQGAN+CLIP.

Let's jump in 👇

@OpenAI

Short History 📜

In January @OpenAI publicly released CLIP, which is a model that allows matching text to images.

Just days after that, some people like @advadnoun, @RiversHaveWings, and @quasimondo started experimenting using CLIP to guide the output of a GAN using text.

👇

https://twitter.com/advadnoun/status/1351038053033406468

OpenAI published an image generation model together with CLIP, called DALL-E, but without the full code and the pre-trained models.

The results from guiding StyleGAN2 or BigGAN with CLIP aren't as accurate as DALL-E, but they are weirdly artistic.

https://twitter.com/advadnoun/status/1351038053033406468

👇

Read 18 tweets

Vladimir Haltakov

@haltakov

2 Nov

Creators only get badges 🏅

There is a problem with how value is distributed in online communities today. It seems we take the status quo for granted and don't discuss it much.

The people that create most of the value, get none of the money! Only badges...

Thread 👇

Online communities

I'm talking about platforms like Twitter, Reddit, Stack Overflow etc. They're wonderful places, where you can discuss interesting topics, get help with a problem, or read the latest news.

However, the people that make them truly valuable receive nothing 👇

It usually looks like this:

▪️ Company creates a web 2.0 platform
▪️ Users create content and increase the value
▪️ Company aggregates the demand
▪️ Company monetizes with ads and subscriptions
▪️ Company gets lots of money
▪️ Creators get badges, karma and virtual gold

👇

Read 25 tweets

Vladimir Haltakov

@haltakov

13 Oct

Machine Learning Formulas Explained! 👨‍🏫

This is the formula for the Binary Cross Entropy Loss. This loss function is commonly used for binary classification problems.

It may look super confusing, but I promise you that it is actually quite simple!

Let's go step by step 👇

The Cross-Entropy Loss function is one of the most used losses for classification problems. It tells us how well a machine learning model classifies a dataset compared to the ground truth labels.

The Binary Cross-Entropy Loss is a special case when we have only 2 classes.

👇

The most important part to understand is this one - this is the core of the whole formula!

Here, Y denotes the ground-truth label, while Ŷ is the predicted probability of the classifier.

Let's look at a simple example before we talk about the logarithm... 👇

Read 15 tweets

Vladimir Haltakov

@haltakov

21 Sep

There are two problems with ROC curves

❌ They don't work for imbalanced datasets
❌ They don't work for object detection problems

So what do we do to evaluate our machine learning models properly in these cases?

We use a Precision-Recall curve.

Another one of my threads 👇

https://twitter.com/haltakov/status/1438206936680386560

Last week I wrote another detailed thread on ROC curves. I recommend that you read it first if you don't know what they are.

https://twitter.com/haltakov/status/1438206936680386560

Then go on 👇

https://twitter.com/haltakov/status/1435296511772999684

❌ Problem 1 - Imbalanced Data

ROC curves measure the True Positive Rate (also known as Accuracy). So, if you have an imbalanced dataset, the ROC curve will not tell you if your classifier completely ignores the underrepresented class.

More details:

https://twitter.com/haltakov/status/1435296511772999684

👇

Read 19 tweets

Vladimir Haltakov

@haltakov

20 Sep

How to spot fake images of faces generated by a GAN? Look at the eyes! 👁️

This is an interesting paper that shows how fake images of faces can be easily detected by looking at the shape of the pupil.

The pupils in GAN-generated images are usually not round - see the image!

👇

Here is the actual paper. The authors propose a way to automatically identify fake images by analyzing the pupil's shape.

arxiv.org/abs/2109.00162

The bad thing is, GANs will probably quickly catch up and include an additional constraint for pupils to be round...

Read 5 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Share this page!

Vladimir Haltakov

Try unrolling a thread yourself!

More from @haltakov

Vladimir Haltakov

Vladimir Haltakov

Vladimir Haltakov

Vladimir Haltakov

Vladimir Haltakov

Vladimir Haltakov

Did Thread Reader help you today?

Like this author's thread?