Feature construction is a critical aspect of feature engineering, which involves the process of creating new features or transforming existing ones to improve the performance of machine learning models.
The goal of feature construction is to extract meaningful information from raw data and represent it in a way that can be effectively used by machine learning algorithms.
There are several ways to perform feature construction including:
1⃣Mathematical transformations: This involves applying mathematical functions such logarithmic, exponential or square root transformations to existing features to create new features.
2⃣ Aggregation: This involves summarizing data by grouping them based on some criteria such as time, category, or geographic location
3⃣ Interaction features: This involves creating new features by combining two or more existing features
4⃣ Textual feature engineering: This involves processing textual data by extracting features such as word counts, TF-IDF, n-grams, and sentiment analysis.
feature construction is a crucial step in feature engineering that can significantly impact the performance of machine learning models. It requires a deep understanding of the data, the problem domain, and the machine learning algorithms used
Feature creation is one of most common techniques in FE . It also known as feature construction or feature synthesis It involves creation of new features from existing data that performed by combining or transforming existing feature @scaler_official
Topic -- Principle Component Analysis
(PCA) Part 1
PCA statistics is science of analyzing all the dimension & reducing them as much as possible while preserving exact information
You can monitor multi-dimensional data (can visualize in 2D or 3D dimension) over any platform using the Principal Component Method of factor analysis.
Step by step explanation of Principal Component Analysis
STANDARDIZATION
COVARIANCE MATRIX COMPUTATION
FEATURE VECTOR
RECAST THE DATA ALONG THE PRINCIPAL COMPONENTS AXES
Refers to phenomenon where the performance of ML algorithms deteriorates as No. of dimension or feature of input data ⬆️
This is because the volume of space increases exponentially with No. of dimension which causes data to become sparse & distance btwn data point to increase
Many ML algorithms struggle to find meaningful patterns & relationships in high-dimensional data & may suffer from overfitting or poor generalization performance. This can lead to longer training time increased memory requirements & reduced accuracy & efficiency in predictions.
🎯Are NULL values same as that of zero or a blank space❓
🔺A NULL value is not at all same as that of zero or a blank space.
🔺NULL value represents a value which is unavailable, unknown, assigned or not applicable whereas a zero is a number and blank space is a character.
🎯What is the usage of the NVL() function❓
🔹Answer
🔺You may use NVL function to replace null values with a default value. 🔺The function returns the value of second parameter if first parameter is null.
🔺If the first parameter is anything other than null, it is left alone