Mikaeil Profile picture
Sep 7 19 tweets 11 min read
As I started to learn Data Science I didn't know what skills should I learn and where. That was a ton of content and I didn't know which one should I take. I have read more than a hundred articles and talk with some of my data scientist friend and gathered experience
1
during my journey. I want to share a roadmap and skills that you need as a junior Data Scientist and resources to learn.

1- Start with a language programming and best of all Python. You can learn Python from 3 resources.
Taking one of these courses is enough.

2
A- 2022 Complete Python Bootcamp From Zero to Hero in Python by Jose Portilla in Udemy.

Jose Portilla is my favorite instructor. This Course has a GitHub repo where you can access Codes there.

udemy.com/course/complet…

3
B- programming for everybody by Coursera. Please note that by using audit you can access courses for free but without certification.

de.coursera.org/learn/python?s…

C- python in #w3school. It's free and user-friendly.

w3schools.com/python/

2- After learning #Python it's

4
A- SQL for data science in Coursera by university of Davis university.

de.coursera.org/learn/sql-for-…

B- The Complete SQL Bootcamp 2022: Go from Zero to Hero by Jose Portilla in Udemy.

udemy.com/course/the-com…

5
C- SQL by mode.com. Here you can learn and practice simultaneously. You don't need to install any database management system.

mode.com/sql-tutorial/

3- Now it's time to learn Machine Learning

A- Python for Data Science and Machine Learning

6
Bootcamp by Jose Portilla in Udemy.

udemy.com/course/python-…

B- Introduction to Machine learning Course by Udacity.

udacity.com/course/intro-t…

7
C- Building an Effective Machine Learning Workflow with scikit-learn.
@justmarkham is a great instructor. He makes everything easy to understand.

courses.dataschool.io/building-an-ef…

4- Once you learn Python, SQL, and MachineLearning and know how they work and implement them, you can

8
go in and learn theory and Math models. This is called the top-down learning approach.

A- Probability and Statistics:To p or not to p?
If you have no previous statistical knowledge.

de.coursera.org/learn/probabil…

B- Statistical Learning.

I highly recommend this course.

9
youtube.com/playlist?list=…

C- StatQuest.
It's really easy to understand Math here.
youtube.com/c/joshstarmer

D- 3blau1brown

It's another good free course.

10
to learn Math.

youtube.com/c/3blue1brown

5- Learn Git and Github

A- @justmarkham has a good tutorial about Github.

dataschool.io/how-to-contrib…

B- Here is a good short tutorial using Git and Jupyter Notebook.



11
6- Date Visualization.

There is a bunch of tools like Matplotlib and seaborn, Tableau, and Power BI. It seems that Power BI has more users than others.

A- Analyzing and Visualizing Data with Power BI.

This course is for beginners and free.

edx.org/course/data-an…

12
B- Introduction to Power BI by DataCamp.

It's another free course that every beginner can learn from.

datacamp.com/courses/introd…

C- Microsoft's Learn Power BI

It's from Microsoft's Power BI learning center and is good for all levels and free.

docs.microsoft.com/en-us/training…

13
7- Excel

A- Excel Skills for Business specialization on Coursera

coursera.org/specialization…

B- Excel course by freecodecamp on YT.



C- Master data analytics and become an insightful Excel professional.

14
this course is less focused on business applications and more on analyzing data in general.

edx.org/professional-c…

15
Ps1: if you watch all of the courses and read all of the books in the world related to Data Science. You wouldn't learn Data Scienc e until you take your hands dirty.

Ps2:Best way to learn is learning by doing. As you learn for example Python do some projects and then go

16
to SQL. Repeat this method for all skills that you are learning.

Ps3. For getting feedback and keeping yourself Motivated share stuff you are learning and projects you are doing on social media.

Ps4: These courses give you skills that a junior Data Scientist needs

17
Ps4: Take Kaggle seriously. You can find their many datasets and related Notebooks. Learn from Notebooks and make on top of them.

Ps5- to be updated read the paper, blog, and hear the podcast.

If you find this Thread useful please feel free to retweet and like it.

18
I will update this thread.
So if you have any suggestions it would definitely make me happy to read that.

#ArtificialIntelligence #AI #ML #DataScience #DataScientists #CodeNewbies #Tech #deeplearning #CyberSecurity #Python #Coding #javascript #rstats #100DaysOfCode

19

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Mikaeil

Mikaeil Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @ItsMikaeil

Sep 10
------------------Feature Engineering----------------------

The success of all Machine Learning algorithms depends on how you present the data. Every model gets input data and gives us an output. When your goal is to get the best possible output from input,

1/
You need to present the best data to the model. This is a problem that Feature Engineering solves. Feature Engineering refers to the process of using the domain of Knowledge to extract features from raw data.

2/
In other words, Feature Engineering selects the most useful features from our raw data and presents them to our model, whereby we improve the performance of our model.

(hopefully, you get the point 😀).

3/
Read 10 tweets
Sep 9
Let's assume you have three Features(age, height, salary) in your example.
The first feature varies from 1 to 90. The second one varies from 120 to 210 and the Third one varies from 1000 Euro to 4500 Euro.
#Thread
1/
As you can see the value of your features are in a different range. In this case, if you want to use gradient descent to find optimum parameters for your model( for instance linear regression), that leads to a slow speed of your model to converge. In this case,
/2
you can utilize Feature Scaling to bring the value of features in a range from 0 to 1 depending on the Scaling technique, that you use. So you improve the speed of your model convergence.

3/
Read 5 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(