Tweet

Alan Zucconi

22 Feb, 21 tweets, 10 min read

Let's talk about NEURAL NETWORKS. 🧠

Most of you are probably familiar with them. 🧑‍💻

But not many know HOW they actually work and—even more importantly—WHY they work. 🔧

So, let's take a journey to understand what makes Neural Networks so effective... 📊

#ML #AI #NN

🧵👇

Let's start with a simple question...

❓ What problem are NNs trying to solve? 🤔

Generally speaking, NNs are trained on examples, to produce predictions based on some input values.

The example data (input + desired output) draws a curve that the NN is trying to fit.

In a nutshell, NNs are—like most ML tools—a fancy way to fit the curves inherently generated by the examples used to train it.

The more inputs it needs, and the more outputs it produces, the higher the dimension of the curve.

The simplest curve we can fit is ...a line! 😅

Fitting a line is known by Mathematicians & Statisticians as LINEAR REGRESSION. 📈

The equation of a line is:
𝐲 = 𝐱𝐰 + 𝐛

where:
🔸 𝐰: Slope
🔸 𝐛: Y-intercept

Fitting a line means finding the 𝐰 & 𝐛 of the line that best fits the input data! 🔍

To find which line better fits the training data, we need to define what "better" means first.

There are many ways to measure "linear fitness", and they all take into account how close each point is to the line.

The RMSE (Root Mean Square Error) is very popular metric.

On top of the "traditional" algebraic form (𝐲 = 𝐱𝐰 + 𝐛), let's introduce a more "visual" way to represent equations.

💡 NETWORKS allow us to better see the relationships between each part.

It will be important later, trust me! 😎

LINEAR regression, however, only work well with LINEAR data.

❓ What if out data is "binary" instead? 🤔

This is common with many decision-making problems.

For instance:
🔸 𝐱: the room temperature 🌡️
🔸 𝐲: either 0 or 1, to turn the fan ON/OFF ❄️

If we try to naively use LINEAR REGRESSION to fit binary data, we will likely get a line that passes through both sets of points.

The example below shows the "best" fitting line, according to RMSE.

It's a bad fit. ❌

❓ Can we "fix" linear interpolation? 🤔

In *this* special case, we can! 😎

Let's find a DIFFERENT line. Not the one that BEST FITS the data, but the one that BEST SEPARATES the data.

So that:

🔹 When 𝐲 ≤ 0, we return 0 (turn fan OFF 🔴)
🔹 When 𝐲 > 0, we return 1 (turn fan ON 🔵)

To do that, we need to update our MODEL:
𝐲 = 𝐬(𝐱𝐰 + 𝐛)

where 𝐬() is a the HEAVISIDE STEP function.

That will be the ACTIVATION FUNCTION of our network.
Other commonly used AFs are:
🔹 Sigmoid
🔹 Tanh
🔹 Rectified Linear Unit (ReLU)
🔹 Leaky ReLU

Ultimately, the ACTIVATION FUNCTION is where the magic happens, because it adds NON-LINEARITY to our MODEL. ✨

This gives us the power to fit virtually any type of data! 🔮

This is (more or less!) what a PERCEPTRON is: the grandfather of modern Neural Networks. 🧓

Now that we have PERCEPTRONs, let's see how we can use them as the building blocks of more complex networks.

For instance, let's imagine a more complex training data ("ternary" data? 🤔).

A perceptron can only fit 2/3 of the data.
So, why not using ...THREE of them? 😎

The FIRST perceptron fits the first 2/3 of the data:
1⃣ 🔴📈🔵 ⚫️

The SECOND perceptron fits the last 2/3 of the data:
2⃣ ⚫️ 🔵📉🔴

What's left to do now is to use a THIRD perceptron to merge the first two:
3⃣ 𝐲=(📈+📉)/2 - 0.5

This is a better view of the resulting network, with each colour indicating a different perceptron.

Pretty neat, right? 😎

Training a network like requires finding the 7 PARAMETERS so that out model fits the training data best.

Modern NNs can have MILLIONS of parameters. 🤯

If we translate that network back into its equation, you can immediately see how messy that looks.

You probably would have never come up with this yourself. But when you think in terms of curve fitting, that becomes much easier to understand.

At this point, you might wonder...

❓ What has all of this to do with the reason WHY Neural Networks are so effective? 🤔

Because we have just built an AND gate! 😎

Likewise, we can also build OR and NOT gates, de-facto proving that NNs are TURING COMPLETE! 🖥️

This proves that they can perform ANY computation that a more "traditional" computer can. 🖥️

To continue our analogy with CURVE FITTING, it means that Neural Networks have the potential to fit ANY curve in ANY number of dimensions, with as much precision as you want. 🤯

Any arbitrary 2D curve can be potentially recreated by a NN, in just three steps:

1⃣ Slice the original shape in thin sections 🔪
2⃣ Fit each section with a perceptron (AND) 📈
3⃣ Use a perceptron to merge all sections (OR) 📊

You can see here that very same principle applied to the design of a Neural Network.

This NN now has 33 parameters to fit, meaning that our search problem is now taking place in a 33-dimensional space. 🔍

That is nothing compared to the many millions some NNs nowadays have.

This is, in a nutshell, what Machine Learning is really about.

Making decisions...
...by learning from examples...
...by fitting a curve...
...by finding some numbers...
...that minimise the error of our model over a set of examples.

@Patreon

✨ 𝒕𝒉𝒂𝒏𝒌 𝒚𝒐𝒖 𝒇𝒐𝒓 𝒄𝒐𝒎𝒊𝒏𝒈 𝒕𝒐 𝒎𝒚 𝒕𝒆𝒅 𝒕𝒂𝒍𝒌 ✨

I tweet about Machine Learning, Artificial Intelligence, #GameDev & Shader Coding. 🧔🏻

If you are interested in any of these topics, follow me & have a look at my @Patreon! 😎

patreon.com/AlanZucconi

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @AlanZucconi

Alan Zucconi

@AlanZucconi

10 Feb

This is a list of TEN (plus one) games that are "accidentally" TURING COMPLETE. 🖥️

In a nutshell, when a game is Turing Complete you can use it to build a WORKING COMPUTER. 🤯

If you are a parent: please don't underestimate games as a creative medium.

Let's start with...

🧵👇

@Minecraft

1⃣ Minecraft ⛏️

Unsurprisingly, @Minecraft is indeed Turing Complete. 🖥️

This is possible thanks to the so-called REDSTONE CIRCUITS, which allow to build simple logic gates. 🟥

And if you are really, REALLY committed ...you can even build a computer!

@anklejbiter

2⃣ Portal 2 🟠🔵

Yes. Everybody's favourite game is Turing Complete! 🥰

@anklejbiter used Portal 2 level editor to create a 3-Digit Binary Calculator. De-facto proving it can be used for arbitrary computation.

steamcommunity.com/sharedfiles/fi…

Read 14 tweets

Alan Zucconi

@AlanZucconi

2 Feb

@unity3d

After using @unity3d for almost 10 years, I want to share the TEN small features that are really helping me develop games faster.

Starting with my favourite... 😎

1⃣ Inspector Maths

You can write ACTUAL mathematical expressions in the inspector! 🤯

Check the other ones!

🧵👇

2⃣ Animation Curves

Unity supports a super easy way to create smooth paths through ANIMATION CURVES. Perfect for blending & animating properties.

Creation:
public AnimationCurve Curve;

Usage:
float x = Curve.Evaluate(t);

#unitytips

docs.unity3d.com/ScriptReferenc…

3⃣ Gradients

The equivalent of AnimationCurve for colours is Gradient. You can use it to create and sample smooth gradients.

Creation:
public Gradient Gradient;

Usage:
Color c = Gradient.Evaluate(t);

#unitytips

docs.unity3d.com/ScriptReferenc…

Read 10 tweets

Alan Zucconi

@AlanZucconi

25 Jan

These are FIVE TRICKS that my colleagues and I have learnt from one academic year or REMOTE TEACHING in HIGHER EDUCATION. 📚🎓

Every class is different, and every lecture has their own teaching style. But feel free to share this thread if you think it might help someone!

🧵👇

1⃣ 𝗦𝗲𝘁𝘂𝗽 🖥️

If you are using PowerPoint with a second monitor, you can actually resize the "Presenter View" window:

1⃣🖥️
🔴Shared screen with slides

2⃣🖥️
🔵Presenter view with notes
🟢MS Teams chat

So you can share your screen, see your notes & read questions!

2⃣ 𝗖𝗹𝗮𝗽𝗽𝗶𝗻𝗴 𝗼𝗻 𝗠𝗦 𝗧𝗲𝗮𝗺𝘀 👏

Every time a student present their work in front of the class, I invite everyone to clap. 👏

A remote alternative is to encourage students to raise and lower their hands on MS Teams quickly.

Read 7 tweets

Alan Zucconi

@AlanZucconi

23 Jan

BEHAVIOUR TREES are the cornerstone of ARTIFICIAL INTELLIGENCE for #gamedev.

But they can be difficult to "grow". 🌱

I have selected FIVE rather approachable papers that use EVOLUTION to grow behaviour trees AUTOMATICALLY, so you don't have to. 🌲

Let's start with...📝

🧵👇

1⃣ "Evolving Behaviour Trees for the Commercial Game DEFCON" (2010)

🔹Chong-U Lim
🔹Robin Baumgarten
🔹Simon Colton

researchgate.net/publication/22…

2⃣ "Learning of Behavior Trees for Autonomous Agents" (2015)

🔹Michele Colledanchise
🔹Ramviyas Parasuraman
🔹Petter Ögren

arxiv.org/abs/1504.05811

Read 8 tweets

Alan Zucconi

@AlanZucconi

22 Jan

@sciam

Fifty years have passed since CONWAY'S GAME OF LIFE firstly appeared on a column called "Mathematical Games" on @sciam.

While most Programmers & Computer Science enthusiasts are familiar with it, not many know that the game is actually TURING COMPLETE.

Let's see why. ⠠⠵

🧵👇

The quickest way to prove that a system is TURING COMPLETE is to show that it allows for the constructions of LOGIC GATES. 🖥️

So, let's see how the 𝗔𝗡𝗗, 𝗢𝗥 and 𝗡𝗢𝗧 gates can actually be constructed in Conway's Game of Life...

Firstly, we need to find a way to encode binary signals.

One very popular choice is to use a stream of GLIDERS. The so-called GOSPER GLIDER GUN can generated a new glider every 30 generations. 🔫

Hence, receiving a glider every 30 generations counts as a "1".

Read 11 tweets

Alan Zucconi

@AlanZucconi

19 Jan

@idmillington

This is a story about the importance of being the first one to cover a topic.

📙 "Artificial Intelligence for Games" by @idmillington & @funge, was published in 2009.

And it featured a diagram which has now possibly become the most popular Behaviour Tree seen in Colleges.

🧵👇

I first realised I had seen this Behaviour Tree before when I noticed the "Barge door" node in someone else's slides.

And from that moment onwards, I have been finding variations of that very Behaviour Tree in virtually every game AI presentation.

Jeremy Gow, Goldsmiths 👇

So, I went on a journey to find out how many other presentations I could discover, which features a variation of that original Behaviour Tree...

Simon Colton & Alison Pease, Imperial College London 👇

Read 12 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!