Tweet

How to get URL link on Twitter App

On the Twitter thread, click on or icon on the bottom
Click again on or Share Via icon
Click on Copy Link to Tweet
Paste it above and click "Unroll Thread"!
More info at Twitter Help

Sachin Kumar

@Sachintukumar

May 10 • 14 tweets • 5 min read Twitter logo

Read on Twitter

Day 57 of #100dayswithMachinelearning

Topic - Batch Gradient Descent (BGD)

A Thread 🧵

(BGD) is optimization algorithm commonly used in ML & optimization problems to minimize the cost function or maximize the objective function

It is type of GD algorithm that update model parameters by taking the average gradient of entire training dataset at each iteration

Here's how the BGD algorithm works:

1) Initialize the model parameters: Start by initializing the model parameters, such as weights and biases, with random values.

2) Compute the cost function: Evaluate the cost function or the objective function on the entire training dataset. The cost function measures the error or the difference between the predicted values of the model and the actual values in the training dataset.

3) Compute the gradient: Calculate the gradient of the cost function with respect to each model parameter. The gradient represents the direction and magnitude of the steepest ascent or descent of the cost function.

4) Update the parameters: Adjust model parameter by subtracting fraction of gradient from current parameter value. The fraction is determined by learning rate which control step size taken in each iteration The update equation for parameter θ is: θ = θ - learning_rate * gradient

5 ) Repeat steps 2-4: Iterate steps 2 to 4 until a stopping criterion is met. This criterion can be a maximum number of iterations, reaching a certain level of convergence, or other conditions specific to the problem.

The key Point of BGD is calculate gradient using entire training dataset at each iteration. This makes it computationally expensive for large dataset but ensure more accurate estimate of gradient. It also guarantees convergence to global minimum (or maximum) of the cost function

Despite its advantages, BGD has some limitation

It requires the entire training dataset to be loaded into memory, which can be challenging for large datasets.

Additionally, BGD may converge slowly if the cost function is non-convex or has many local minima

To overcome the memory and computational limitations of BGD, variations such as Stochastic Gradient Descent (SGD) and Mini-Batch Gradient Descent (MBGD) are often used

SGD updates the parameters using only one random training sample at a time, while MBGD updates the parameters using a small subset (mini-batch) of the training dataset. These variations offer faster convergence and can handle larger datasets more efficiently

@github

Mini-Batch gradient descent can find a balance between the robustness of SGD and the efficiency of BGD.

@github NoteBook

github.com/sachinkumar160…

@kdnuggets

SGD method is doing one iteration or one row at a time, and therefore, the fluctuations are much higher than the batch gradient descent @kdnuggets

kdnuggets.com/2020/05/5-conc…

@Sachintukumar

If this thread was helpful to you

1. Follow me @Sachintukumar
for daily content

2. Connect with me on Linkedin: linkedin.com/in/sachintukum…

3. RT tweet below to share it with your friend

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

Read 6 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Enter Twitter Thread URL to Unroll

Sachin Kumar

People who liked this thread also liked...

Try unrolling a thread yourself!

More from @Sachintukumar

Sachin Kumar

Sachin Kumar

Sachin Kumar

Sachin Kumar

Sachin Kumar

Sachin Kumar

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!