Tweet

Harpreet Sahota 🥑

Jan 4 • 10 tweets • 2 min read

Back propagation is the secret sauce for training neural networks.

You need to understand how it works.

I break it down for you in this thread 👇🏽🧵

#deeplearning

Here's the lowdown: the output of a neural network is calculated with the 🏋🏽 weights 🏋🏽 of the edges that connect the nodes in the network.

So, you gotta find the optimal values of weights to minimize the final error on training data examples.

1️⃣ We start by assigning random values to all weights in the network

2️⃣ Then, for every input sample, we perform a feedforward operation to calculate the final output and the prediction error.

3️⃣ This predicted error is propagated backwards through the network, where it's used to adjust the weights.

4️⃣ We repeat this process until the error falls below the desired threshold.

🔁🔁 This is where back propagation comes in 🔁🔁

A simple four step process is at the heart of it all
1) Error calculation
2) Weight adjustment
3) Propagation
4) Repeat

Let's go into more detail:

1) Error calculation

The error of the network's output is calculated using a loss function.

This error is then used to update the network's weights in the next step of backpropagation.

2) Weight adjustment

This is done using an optimization algorithm (ie gradient descent) which adjusts the weights to minimize the error and improve the network's performance.

The adjusted weights are then used to propagate the error through the network in the next step.

3) Propagation

This is done by applying the chain rule to calculate the gradient of the error with respect to each weight in the network.

The calculated gradients are then used to adjust the weights in the next step of back propagation.

4) Repeat

Back prop is repeated multiple times to improve the network's performance.

This involves calculating the output, error, and gradients, and adjusting the weights.

By repeating this process, the network is able to learn from the data and improve its performance.

@DataScienceHarp

If you found this thread helpful, go back up and retweet it and don't forget to follow me @DataScienceHarp for more deep learning fundamentals in your feed.

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @DataScienceHarp

Harpreet Sahota 🥑

@DataScienceHarp

Jan 6

Skip connections are one of the most important innovations in the history of deep learning.

You need to understand the problems it solves and how it helps us build deep networks.

Let me break it down for you in this thread 👇🏽🧵

#deeplearning #machinelearning

Skip connections are a common feature in modern CNN architectures. They create an alternative path for the gradient to flow through, which can help the model learn faster.

In a neural network, the gradient measures how much a change in one part of the network affects the output. We use the gradient to update the network during training to recognize data patterns better.

Read 8 tweets

Harpreet Sahota 🥑

@DataScienceHarp

Jan 6

The hardest part of data science is keeping track of and measuring the right things.

And you do that with metrics.

Most data scientists don't know what makes a good metric.

In this thread, I'll teach you what makes a good metric📈

A good metric has four qualities; it's...
1) Comparative
2) Understandable
3) A ratio or rate
4) Changes in behaviour

1) A good metric is comparative

Comparing a metric to other periods, groups of users or competitors helps you understand which way things are moving.

Read 9 tweets

Harpreet Sahota 🥑

@DataScienceHarp

Jan 6

The convolution operation q fundamental concepts in deep learning.

Most people know how to perform it.

But far fewer understand what it means.

These five 🖐🏽 resources will help you grok one of the most important concepts in deep learning 👇🏽

#deeplearning #machinelearning

1) Convolution vs cross correlation

You must understand the difference between convolution and cross-correlation in order to understand backpropagation in CNNs.

This will help.

glassboxmedicine.com/2019/07/26/con…

@3blue1brown

2) Convolutions in image processing

This 30ish minute lecture is from the MIT course on computational thinking.

Oh, and it’s taught by Grant from @3blue1brown

Read 7 tweets

Harpreet Sahota 🥑

@DataScienceHarp

Jan 5

Accuracy, precision, recall, and F1-score are a few ways you can measure the performance of your classification model.

Most people get them confused.

Use this thread as a cheat sheet and remind yourself what they mean👇🏽🧵

Accuracy: This is the number of correct predictions made divided by the total number of predictions.

True/false positives/negatives: For binary classification, these refer to correctly/incorrectly predicted positive/negative class samples.

Read 10 tweets

Harpreet Sahota 🥑

@DataScienceHarp

Jan 5

Here's how I would study deep learning if I had to do it all over again.

#deeplearning #machinelearning
👇🏽 🧵

@ultralytics

1) Skip the math

I’d ignore the math when first starting out.

Looking at equations will demotivate you.

Instead, look for applications of deep learning.

Clone the
@ultralytics
Yolov5 repo and run the usage on the command line.

See the magic happen.

Get inspired.

@AndrewGlassner

2) Go through @AndrewGlassner’s DL crash course.

It’s 3.5 hours long but will give you an intuition for how it all works under the hood.

Great return on time investment.

Read 12 tweets

Harpreet Sahota 🥑

@DataScienceHarp

Jan 4

You shouldn't evaluate the performance of a deep learning model solely on accuracy.

You're missing the whole picture if that's all you're looking at.

There are 3 other factors you should consider when evaluating the performance of your model 👇🏽

#deeplearning #ai

Flops (floating point operations)

This measures of the amount of computation required to train and run a model.

More complex models often require more flops, which can make them more expensive to use.

Parameters

The number of parameters in a model can also impact its performance.

Models with more parameters may fit the training data better, but they may also be more prone to overfitting and may not generalize well to new data.

Read 6 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Harpreet Sahota 🥑

People who liked this thread also liked...

Try unrolling a thread yourself!

More from @DataScienceHarp

Harpreet Sahota 🥑

Harpreet Sahota 🥑

Harpreet Sahota 🥑

Harpreet Sahota 🥑

Harpreet Sahota 🥑

Harpreet Sahota 🥑

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!