"People need to know Maths to become Data Scientists or Machine Learning Engineer"

- True! 😀

But, how much do we need to know? 🤔⁉️

This thread 🧵 is an outline of the concepts we should know
1. Let's start with Linear Algebra

You can start working on Data Science or ML without knowing them.

But at some time you may wish to dive deeper.

If you ask me, if there was 1 area of Maths that I would suggest you improve before the other, it would be Linear Algebra.
If I could convince you to learn a minimum of Linear Algebra for Machine Learning, it would be the following👇:

- Systems of Linear Equations & Solving them
- Matrices
- Vector Spaces
- Linear Independence
- Basis & Rank
- Linear Mappings / Projections
Still not convinced? Here's the usage👇

- Principal knowledge of Matrix Operations is essential for Vector Calculus

- We need to use projections for dimensionality reduction with PCA

- Linear Algebra plays a central role in solving least-squares problems in Linear Regression.
2. Second is Analytical Geometry:

This one is a sub-section of Linear Algebra.

Here, you add some geometric intuition to all of the concepts mentioned in the previous tweet.

- Norms
- Inner Products
- Lengths & Distances
- Angles & Orthogonality
Application👇

- Inner products and corresponding norms & metrics along with notions of similarity & distances are used to develop the Support Vector Machine.

- Orthogonal projections play a central role in regression via maximum likelihood estimation and PCA
3. Matrix Decompositions:

This completes the topic Linear Algebra😀

Here we need to know
- Determinants
- Eigenvalues & Eigenvectors
- Eigendecomposition and Diagonalization
Don't get demotivated, all these topics are easy to learn with the help of some free online resources👇

Youtube playlists that are goldmine:
- @3blue1brown youtube.com/playlist?list=…

- @khanacademy youtube.com/playlist?list=…

- Free MIT Course youtube.com/playlist?list=…
4. Vector Calculus

This is one of the fundamental mathematical tools we need in Machine Learning. [Do not freak out, I will break it into small components]

Start with
- Basic understanding of what Functions are
- Graph of a Function
- Contour Plots
- Vector Field
Once you feel confident with Functions, move into Differential Calculus

- Derivative of a Function
- Partial Derivatives
- Rules of Differentiation
- Vector-Valued Functions
◦ Derivatives
◦ Partial Derivatives
- Gradient
- Chain Rule
- Jacobian & Hessian Matrix
Did you get overwhelmed? Not yet? Well this is the hardest part [maybe 😐]

Applications of Multi-variable Derivatives
- Critical Points
- Local Maxima, Minima & Saddle Point
- Second Derivative Test
- Lagrange Multipliers
- Least Squares Interpolation
- Gradient Descent
To end Vector Calculus, learn a bit of Integration [not much is required 🎉]

- Area Under the Curve
- Arc Length of Function Graphs and Parametric Curves
- Rules of Integration
- Double Integrals

That's it for this topic, phew 😮‍💨
Again there are few exceptional free resources online which you can follow & become great at it.

- MIT youtube.com/playlist?list=…

- @khanacademy khanacademy.org/math/different…
5. Let's End it with some Probability

Start with some basics
- Probability Models and Axioms
◦ Addition and Multiplication Rule
- Conditional Probability
- Independence
- Total Probability Theorem
- Bayes Rule
- Counting
This is probably the most important part that everyone needs to get right.

- Discrete & Continous Random Variable
- PMF, PDF, CDF
- Expected Value, Variance, & Standard Deviation
- Know about the most popular distributions
- Co-variance and Correlation
Finish it off with

- The Weak Law of Large Numbers
- Central Limit Theorem

In case you love Probability then try Markov Chains

You can follow my favorite youtube channel
youtube.com/c/BrandonFoltz…

or this gold mine -
khanacademy.org/math/statistic…
I could have added Statistics in this thread but decided to keep it separate. Learning the above concepts will make stats much easier.

Credits - "MATHEMATICS FOR MACHINE LEARNING" book

If you're feeling lost then DM me and I will try to convince you how easy it can be for you.
That's it for this tread 👋

Please do add if you feel I have missed something!

A retweet for the first one would really mean a lot 🙏

If you liked my content and want to get more threads on Data Science, Machine Learning & Python, do follow me @PiyalBanik

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Piyal Banik

Piyal Banik Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @PiyalBanik

11 Jul
Here are this week's Data Science Interview Questions along with the correct answer

Thread 🧵👇

#MachineLearning #Python #100DaysOfCode
Answer by @josh_ko_naman

1) SL has a feedback mechanism.
UL has no feedback mechanism.

2) Supervised learning involves building a model for predicting, or estimating.
In unsupervised learning, we can learn relationships and structures from data

Answer by @ammaryh92 & @arunkumarai

-regularization
-simpler model architecture
-more training data
-reduce noise in the data
-reduce the number of input attributes
-shorter training cycles

Read 7 tweets
9 Jul
15 Days roadmap to master #Python basics for #DataScience & #MachineLearning without having any Prior Experience.

[ Join the #100DaysOfCode & #66daysofdata challenge to keep yourself motivated ]

Thread 🧵👇
Few things to keep in mind before starting
- Learn By Doing, Practicing & Not Just Reading
- Code By Hand [very effective]
- Share, Teach, Discuss and Ask For Help
- Use Online Resources
- Be consistent
- Learn to Use Debugger
I have done all the below-mentioned concepts as part of the #100DaysOfCode challenge and the code can be found in my @github profile.

[Projects & exercise not done. let me know if you want the solutions]

github.com/Piyal-Banik/10…
Read 21 tweets
3 Jul
Want to learn Data Science but confused about where to start and what to follow?

Here are the ultimate 12 months Learning path to becoming a Data Scientist 👨‍🎓

Note: I'm personally following this roadmap

🧵👇

#DataScience #MachineLearning #100DaysOfCode #66DaysOfCode #Python
Since we're currently in July, so start from this month.

Understanding Data Science and getting started with Python
- what is data science?
- what does a data scientist do?
- find out various resources
- Set up the system
- Learn Python basics
- Introduction to Pandas & Numpy
August -

Mathematics, Statistics & SQL
- Linear Algebra
- Introduction to Probability
- Statistics - inferential & descriptive
- Exploratory Data Analysis
- SQL for Data science
- Projects on EDA and SQL

Start engaging in the Data Science & Machine Learning community
Read 14 tweets
2 Jul
Best Data Science blogs to follow in 2021

🧵👇

#DataScience #66daysofdata #100DaysOfCode
1. Towards Data Science

TDS is a Medium publication having audience-oriented content about Data Science, along with blogs on related fields such as Machine Learning, Programming, Visualization, and Artificial Intelligence.

towardsdatascience.com
2. Data Science Central

DSC is one of the leading repositories of Data Science content that is regularly updated with the latest trends across domains such as Artificial Intelligence, Machine Learning, Deep Learning, Analytics, Big Data, and much more.

datasciencecentral.com
Read 9 tweets
1 Jul
NumPy 🔥

It is a Linear Algebra Library for #Python, the reason it is so important for Data Science is that almost all of the libraries in the PyData Ecosystem rely on NumPy as one of their main building blocks👨‍🏫.

Here's everything you need🧵👇

#DataScience #100DaysOfCode
1⃣Numpy Arrays

NumPy arrays are the main way we use Numpy. Numpy arrays essentially come in two flavors: vectors and matrices. Vectors are strictly 1-d arrays and matrices are 2-d (but you should note a matrix can still have only one row or one column). Image
2⃣Built-in Methods

There are lots of built-in ways to generate Arrays
- zeros
- ones
- eye
- arange
- linspace Image
Read 11 tweets
19 Jun
Ever wondered how a Data Scientist thinks about a problem? Here are the major steps involved in tackling a data science problem.

Thread 🧵👇

#DataScience #MachineLearning #100DaysOfCode
1. Business Understanding: We should have clarity of what is the exact problem we are going to solve.

What is the problem that we are trying to solve? - Asking the right questions as a Data Scientist starts with understanding the goal of the business.
2. Analytical Approach: How can we use data to answer the question? We should decide the analytical approach to follow which can be of 4 types
- Descriptive
- Statistical
- Predictive
- Prescriptive
and it indicates the necessary data content, formats, and sources to be gathered
Read 12 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Follow Us on Twitter!

:(