Tweet

How to get URL link on Twitter App

On the Twitter thread, click on or icon on the bottom
Click again on or Share Via icon
Click on Copy Link to Tweet
Paste it above and click "Unroll Thread"!
More info at Twitter Help

Sachin Kumar

@Sachintukumar

May 12 • 12 tweets • 4 min read Twitter logo

Read on Twitter

Day 59 of #100DayswithMachinelearning

Topic - Mini-Batch Gradient Descent

A Thread 🧵

Mini-batch gradient descent is a variation of the gradient descent optimization algorithm used in ML & DL

It is designed to address the limitations of two other variants: BGD and SGD

In BGD the entire training dataset is used to compute the gradient of the cost function for each iteration.

This approach guarantees convergence to the global minimum but can be computationally expensive, especially for large datasets

On other hand (SGD) randomly selects a single training example for each iteration and computes the gradient based on that example

SGD is computationally efficient but can exhibit high variance in the gradient estimate, which can lead to slow convergence and noisy updates

Mini-batch gradient descent combines best of both worlds by using a small subset or mini-batch of training data for each iteration

Instead of using entire dataset (as in BGD) or just single example (as in SGD), MBGD compute gradient based on mini-batch of training example

The mini-batch size is typically chosen to be a compromise between computational efficiency and variance reduction

Common choices for mini-batch sizes are usually in the range of 10 to 1,000, depending on size of the dataset and the available computational resources.

The main advantages of mini-batch gradient descent are:

- Efficiency: By using mini-batches, it allows for parallelization of computations, which can significantly speed up the training process, especially on hardware accelerators like GPUs

- Variance reduction: Compared to stochastic gradient descent, mini-batch gradient descent provides a more stable and less noisy estimate of the gradient, resulting in smoother updates and faster convergence.

- Generalization: Mini-batch gradient descent strikes a balance between the biased updates of batch gradient descent and the noisy updates of stochastic gradient descent, often leading to better generalization performance

However MBGD also introduce new hyperparameter: mini-batch size.

Selecting appropriate mini-batch size can be trade-off between computational efficiency & convergence speed

larger mini-batch size may reduce noise in gradient estimate but also increase computational overhead

@CodingNinjasOff

mini-batch gradient descent is widely used as optimization algorithm of choice for training deep neural networks & other large-scale ML models offering good balance between computational efficiency & convergence properties

@CodingNinjasOff Blog Link -
codingninjas.com/codestudio/lib…

@Sachintukumar

🔹If this thread was helpful to you

1. Follow me @Sachintukumar
for daily content

2. Connect with me on Linkedin: linkedin.com/in/sachintukum…

3. RT tweet below to share it with your friend

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

Read 6 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Enter Twitter Thread URL to Unroll

Sachin Kumar

People who liked this thread also liked...

Try unrolling a thread yourself!

More from @Sachintukumar

Sachin Kumar

Sachin Kumar

Sachin Kumar

Sachin Kumar

Sachin Kumar

Sachin Kumar

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!