This is one of the fundamental mathematical tools we need in Machine Learning. [Do not freak out, I will break it into small components]
Start with
- Basic understanding of what Functions are
- Graph of a Function
- Contour Plots
- Vector Field
Once you feel confident with Functions, move into Differential Calculus
- Derivative of a Function
- Partial Derivatives
- Rules of Differentiation
- Vector-Valued Functions
◦ Derivatives
◦ Partial Derivatives
- Gradient
- Chain Rule
- Jacobian & Hessian Matrix
Did you get overwhelmed? Not yet? Well this is the hardest part [maybe 😐]
Applications of Multi-variable Derivatives
- Critical Points
- Local Maxima, Minima & Saddle Point
- Second Derivative Test
- Lagrange Multipliers
- Least Squares Interpolation
- Gradient Descent
To end Vector Calculus, learn a bit of Integration [not much is required 🎉]
- Area Under the Curve
- Arc Length of Function Graphs and Parametric Curves
- Rules of Integration
- Double Integrals
That's it for this topic, phew 😮💨
Again there are few exceptional free resources online which you can follow & become great at it.
Start with some basics
- Probability Models and Axioms
◦ Addition and Multiplication Rule
- Conditional Probability
- Independence
- Total Probability Theorem
- Bayes Rule
- Counting
This is probably the most important part that everyone needs to get right.
- Discrete & Continous Random Variable
- PMF, PDF, CDF
- Expected Value, Variance, & Standard Deviation
- Know about the most popular distributions
- Co-variance and Correlation
Finish it off with
- The Weak Law of Large Numbers
- Central Limit Theorem
In case you love Probability then try Markov Chains
1) SL has a feedback mechanism.
UL has no feedback mechanism.
2) Supervised learning involves building a model for predicting, or estimating.
In unsupervised learning, we can learn relationships and structures from data
-regularization
-simpler model architecture
-more training data
-reduce noise in the data
-reduce the number of input attributes
-shorter training cycles
Few things to keep in mind before starting
- Learn By Doing, Practicing & Not Just Reading
- Code By Hand [very effective]
- Share, Teach, Discuss and Ask For Help
- Use Online Resources
- Be consistent
- Learn to Use Debugger
I have done all the below-mentioned concepts as part of the #100DaysOfCode challenge and the code can be found in my @github profile.
[Projects & exercise not done. let me know if you want the solutions]
Since we're currently in July, so start from this month.
Understanding Data Science and getting started with Python
- what is data science?
- what does a data scientist do?
- find out various resources
- Set up the system
- Learn Python basics
- Introduction to Pandas & Numpy
August -
Mathematics, Statistics & SQL
- Linear Algebra
- Introduction to Probability
- Statistics - inferential & descriptive
- Exploratory Data Analysis
- SQL for Data science
- Projects on EDA and SQL
Start engaging in the Data Science & Machine Learning community
TDS is a Medium publication having audience-oriented content about Data Science, along with blogs on related fields such as Machine Learning, Programming, Visualization, and Artificial Intelligence.
DSC is one of the leading repositories of Data Science content that is regularly updated with the latest trends across domains such as Artificial Intelligence, Machine Learning, Deep Learning, Analytics, Big Data, and much more.
It is a Linear Algebra Library for #Python, the reason it is so important for Data Science is that almost all of the libraries in the PyData Ecosystem rely on NumPy as one of their main building blocks👨🏫.
NumPy arrays are the main way we use Numpy. Numpy arrays essentially come in two flavors: vectors and matrices. Vectors are strictly 1-d arrays and matrices are 2-d (but you should note a matrix can still have only one row or one column).
2⃣Built-in Methods
There are lots of built-in ways to generate Arrays
- zeros
- ones
- eye
- arange
- linspace
1. Business Understanding: We should have clarity of what is the exact problem we are going to solve.
What is the problem that we are trying to solve? - Asking the right questions as a Data Scientist starts with understanding the goal of the business.
2. Analytical Approach: How can we use data to answer the question? We should decide the analytical approach to follow which can be of 4 types
- Descriptive
- Statistical
- Predictive
- Prescriptive
and it indicates the necessary data content, formats, and sources to be gathered