• Everyone knows about ChatGPT but not everyone is aware of how it works.
Here is an attempt at explanation ↓
[ In 10 Steps ]
• It is a large language model based which uses a technique called "transformer" to understand and generate human-like responses to text-based input.
• Transformer is a neural network architecture that excels at processing sequential data, such as text.
Sep 22, 2022 • 23 tweets • 5 min read
"Learn SQL"
Great advice no doubt.
• But what topics to cover?
• Which SQL database to use?
• What resources to learn from?
Here's is a track you can follow ↓
1/22
Let's start with choosing the SQL database to learn.
• There are several of databases like Postgres, MS SQL server, MS Access, Oracle.
• But for learning purposes I'd suggest going with MySQL.
• For reasons that it is secure, free & open source and the support is great.
Sep 14, 2022 • 11 tweets • 3 min read
Interview Question:
• What is Covariance?
• What is Correlation?
• What are the differences between them?
Explain briefly ↓
0/9
COVARIANCE
• Covariance tells us the systematic relationship between two random variables, in which a change in one reflects the change in other.
• It measures the joint variability of two random variables.
• The formula for covariance is:
1/9
Sep 12, 2022 • 11 tweets • 2 min read
Interview Questions
• How does k-means work?
• What are its stopping criteria?
• What are its pros and cons?
• How do you choose its number of clusters?
Explain briefly ↓
0/4 1. Working
• k-means is an unsupervised algorithm.
• We want to create groups of similar data points using this algorithm.
Sep 2, 2022 • 17 tweets • 2 min read
Another common interview question
• What are the assumptions of Linear Regression?
• How do we check them?
• How can we fix them?
Here's the answer ↓
0/5 1. Linear Relationship
It is assumed that the relationship between the dependent and independent variables is linear.
Aug 31, 2022 • 15 tweets • 3 min read
Random Forests is a favorite for interviews!
By far the most common questions that I have been asked are one way or other related to Random Forests
It's important to know it inside out.
Here's are some of those questions:
0/8
Q: What ensemble principle is used in Random Forests?
A: Random Forest works on the principle of the bagging ensemble technique.
Bagging stands for Bootstrap Aggregation.
In Bagging, random data samples in a training set are used with replacement.
1/9
May 25, 2022 • 14 tweets • 4 min read
Machine Learning | Mathematical Resources
Here are some of the resources that I went through for understanding mathematics relevant to machine learning understanding ↓ 1. Linear Algebra
A lot of people asked how to learn TensorFlow and prepare for the certification exam,
Here is how you can do it ↓
• Tensorflow is a free and open-source framework by Google that allows creating and deploying machine learning/deep learning models.
• Tensorflow certification is an official exam that you can take and have a trophy, although it's not necessary for learning.
Jan 4, 2022 • 10 tweets • 2 min read
What format to save your model?
SavedModel or H5?
What are those? What's the difference?
Let's see ↓
• If you have been working on machine learning or deep learning with tensorflow, you must have saved your models.
• Often we see them saved as ↓
Jan 3, 2022 • 5 tweets • 2 min read
Pooling from Scratch
• We saw in the last thread a simple implementation of convolution operation on an image.
• Let's check out a tiny implementation of pooling too ↓
• First, we would read an image and plot it. Notice the original size of the image.
• We will do a (2x2) pool with a stride of 2.
• So if we do the calculation, the image size should get halved.
Jan 3, 2022 • 6 tweets • 2 min read
Convolution from Scratch!
Have you ever implemented a convolution operation on an image from scratch?
Let's do it on a sample image ↓
• First we should open an image and check it out in original format.
• We can get its shape that'll help us to slide the filter over it.
• Now we have to define the filter to be passed.
Dec 8, 2021 • 13 tweets • 4 min read
Randomness and Seeds
• Reproducibility of results is something that we often desire in machine learning
• Due to the random nature of weights getting the same results is quite difficult.
How do we tackle these? ↓
• Oftentimes in machine learning we have operations with randomness as a component.
• On each run of the program or application the results are different, which in turn becomes a problem if we want to get the same results later on.
Oct 20, 2021 • 14 tweets • 4 min read
MAPE is another metric used in performance evaluation in machine learning.
The formula looks a tad bit complex but it isn't.
Let's try to break it down. ↓
• To start with, Mean Absolute Error is a metric which shows how far a value is to the target value.
Oct 7, 2021 • 5 tweets • 2 min read
Ensemble in Machine Learning
Ensemble methods include fitting multiple models with varying features and aggregating their results for the final prediction.
• Choose a sample from the training data
• Train a model
• Save the model - repeat for n times
• Take the output of each of the models and aggregate (say using average).