As I started to learn Data Science I didn't know what skills should I learn and where. That was a ton of content and I didn't know which one should I take. I have read more than a hundred articles and talk with some of my data scientist friend and gathered experience
1
during my journey. I want to share a roadmap and skills that you need as a junior Data Scientist and resources to learn.
1- Start with a language programming and best of all Python. You can learn Python from 3 resources.
Taking one of these courses is enough.
2
A- 2022 Complete Python Bootcamp From Zero to Hero in Python by Jose Portilla in Udemy.
Jose Portilla is my favorite instructor. This Course has a GitHub repo where you can access Codes there.
Ps1: if you watch all of the courses and read all of the books in the world related to Data Science. You wouldn't learn Data Scienc e until you take your hands dirty.
Ps2:Best way to learn is learning by doing. As you learn for example Python do some projects and then go
16
to SQL. Repeat this method for all skills that you are learning.
Ps3. For getting feedback and keeping yourself Motivated share stuff you are learning and projects you are doing on social media.
Ps4: These courses give you skills that a junior Data Scientist needs
17
Ps4: Take Kaggle seriously. You can find their many datasets and related Notebooks. Learn from Notebooks and make on top of them.
Ps5- to be updated read the paper, blog, and hear the podcast.
If you find this Thread useful please feel free to retweet and like it.
18
I will update this thread.
So if you have any suggestions it would definitely make me happy to read that.
The success of all Machine Learning algorithms depends on how you present the data. Every model gets input data and gives us an output. When your goal is to get the best possible output from input,
1/
You need to present the best data to the model. This is a problem that Feature Engineering solves. Feature Engineering refers to the process of using the domain of Knowledge to extract features from raw data.
2/
In other words, Feature Engineering selects the most useful features from our raw data and presents them to our model, whereby we improve the performance of our model.
Let's assume you have three Features(age, height, salary) in your example.
The first feature varies from 1 to 90. The second one varies from 120 to 210 and the Third one varies from 1000 Euro to 4500 Euro. #Thread
1/
As you can see the value of your features are in a different range. In this case, if you want to use gradient descent to find optimum parameters for your model( for instance linear regression), that leads to a slow speed of your model to converge. In this case,
/2
you can utilize Feature Scaling to bring the value of features in a range from 0 to 1 depending on the Scaling technique, that you use. So you improve the speed of your model convergence.
3/