Since we're currently in July, so start from this month.
Understanding Data Science and getting started with Python
- what is data science?
- what does a data scientist do?
- find out various resources
- Set up the system
- Learn Python basics
- Introduction to Pandas & Numpy
August -
Mathematics, Statistics & SQL
- Linear Algebra
- Introduction to Probability
- Statistics - inferential & descriptive
- Exploratory Data Analysis
- SQL for Data science
- Projects on EDA and SQL
Start engaging in the Data Science & Machine Learning community
Learn about Validation, Hyperparameter tuning & Time Series
- Validation Strategies
- Hyperparameter tuning
- Time Series
- Time Series Project
Build Resume and apply for Internships
December -
Getting started with Neural Networks & Deep Learning
- Setup the system for Deep Learning or learn using @GoogleColab
- Introduction to Deep Learning (ANN)
- Introduction to Keras
Start writing Articles
January 2022 -
Convolutional Neural Network
- Understand CNN
- Image classification using Keras
- Transfer Learning in Computer Vision
Computer Vision Projects
- Project 1: Color Detection
- Project 2: Perform Face Detection on Family Photos.
- Project 3: Human Emotion and Gesture Recognition
March -
Natural Language Processing
- Understand RNN, LSTM, GRU
- Text Preprocessing & Cleaning
- Text Classification
April -
Advanced Natural Language Processing
- Text Summarisation
- Word Embeddings
- Topic Modelling
- NLP Project
- Transfer Learning in NLP
TDS is a Medium publication having audience-oriented content about Data Science, along with blogs on related fields such as Machine Learning, Programming, Visualization, and Artificial Intelligence.
DSC is one of the leading repositories of Data Science content that is regularly updated with the latest trends across domains such as Artificial Intelligence, Machine Learning, Deep Learning, Analytics, Big Data, and much more.
It is a Linear Algebra Library for #Python, the reason it is so important for Data Science is that almost all of the libraries in the PyData Ecosystem rely on NumPy as one of their main building blocks๐จโ๐ซ.
NumPy arrays are the main way we use Numpy. Numpy arrays essentially come in two flavors: vectors and matrices. Vectors are strictly 1-d arrays and matrices are 2-d (but you should note a matrix can still have only one row or one column).
2โฃBuilt-in Methods
There are lots of built-in ways to generate Arrays
- zeros
- ones
- eye
- arange
- linspace
1. Business Understanding: We should have clarity of what is the exact problem we are going to solve.
What is the problem that we are trying to solve? - Asking the right questions as a Data Scientist starts with understanding the goal of the business.
2. Analytical Approach: How can we use data to answer the question? We should decide the analytical approach to follow which can be of 4 types
- Descriptive
- Statistical
- Predictive
- Prescriptive
and it indicates the necessary data content, formats, and sources to be gathered
Data scientist use their analytical and technical capabilities to extract meaningful insight from data.
2. Machine Learning Engineer
Machine Learning engineer's final output is the working software, and their audience for this output consists of other software components that run automatically with minimal human supervision. The decisions are made by machines.