π§΅
Ridge Regression (RR) is regularization technique used in statistical modeling & ML to handle the problem of multicollinearity (high correlation) among predictor variables
May 15 β’ 8 tweets β’ 3 min read
Are you completely new to SQL and do not know where to start?
Here is a simple concept roadmap for learning SQL as a complete beginner:
π§΅
Polynomial regression is type of regression analysis where relationship between independent variable(s) and dependent variable is modeled as an nth-degree polynomial function.
It is an extension of simple linear regression which assumes linear relationship between the variable
May 13 β’ 6 tweets β’ 2 min read
πΈCONCAT_WS() in SQL { Very Helpful }
A Thread π§΅
CONCAT_WS() function in SQL is used to concatenate multiple strings into single string with specified separator between each string
"WS" stands for "with separator." This function is commonly used to construct strings contain multiple values such create comma-separated list
A Thread π§΅
SGD is an optimization algorithm often used in machine learning applications to find the model parameters that correspond to the best fit between predicted and actual outputs. Itβs an inexact but powerful technique.
A Thread π§΅
(BGD) is optimization algorithm commonly used in ML & optimization problems to minimize the cost function or maximize the objective function
It is type of GD algorithm that update model parameters by taking the average gradient of entire training dataset at each iteration
Topic -- Principle Component Analysis
(PCA) Part 1
PCA statistics is science of analyzing all the dimension & reducing them as much as possible while preserving exact information
You can monitor multi-dimensional data (can visualize in 2D or 3D dimension) over any platform using the Principal Component Method of factor analysis.
Apr 29 β’ 6 tweets β’ 4 min read
Hello Folks π¨βπ»
If you are someone who is learning SQL, then this list can be helpful to you.
SQL - END-TO-END Learning Resources and Guide π ( Must Read ) 1. SQL for Data Science
π§΅
Refers to phenomenon where the performance of ML algorithms deteriorates as No. of dimension or feature of input data β¬οΈ
This is because the volume of space increases exponentially with No. of dimension which causes data to become sparse & distance btwn data point to increase
A Thread π§΅
Feature construction is a critical aspect of feature engineering, which involves the process of creating new features or transforming existing ones to improve the performance of machine learning models.
π§΅
π―Are NULL values same as that of zero or a blank spaceβ
πΊA NULL value is not at all same as that of zero or a blank space.
πΊNULL value represents a value which is unavailable, unknown, assigned or not applicable whereas a zero is a number and blank space is a character.
Topic -- Outlier Detection & Removal using Z-score Method
A Thread π§΅
The Z-score method is statistical approach used for detecting & removing outlier in dataset. An outlier is observation that lies far away from other observation in dataset. Such observations can significantly affect statistical properties of dataset & lead to erroneous conclusion
Topic - Handling Mixed Variable in Feature Engineering π¨βπ»
A Thread π§΅
Handling missing Variable is very important as many machine learning algorithms do not support data with missing values. If you have missing values in the dataset, it can cause errors and poor performance with some machine learning algorithms.
Apr 15 β’ 8 tweets β’ 3 min read
30 Most Important SQL Interview Question { Must Read }