1/ Book overview and review
Practical Data Science with Python (By Nathan George)
𧡠See thread below #datascience#python
2/ π€ Author
- Nathan George
- Data scientist at a fintech company
- Taught at Regis University, DataCamp and Manning LiveProject
- Mentor students at Udacity AI and Machine Learning NanoDegree
3/ π Book details
- About 600 pages
- 21 Chapters
- 6 Parts
4/ π€ Who is this book for?
- Those wanting to break into data science
- Have beginner to intermediate level of Python coding skill
5/ π 6 Parts of the Book 1. An Introduction and the Basics 2. Dealing with Data 3. Statistics for Data Science 4. Machine Learning 5. Text Analysis and Reporting 6. Wrapping Up
6/ Part 1. An Introduction and the Basics
- Provide a brief history of the field of data science, skill sets, specializations and best practices for data science projects
- Provide tips on getting started with Python and its practical setup for data science projects
7/ Part 2. Dealing with Data
- How to handle various data sources (SQLite, Pandas, Numpy, MS Word, PDF, MS Excel, etc.)
- How to web scrape
8/ Part 3. Statistics for Data Science
- Explains concisely and with practical examples on key statistical concepts (Probability, Distribution and Sampling)
- Covers hypothesis testing concepts and statistical tests using scipy.stats module
9/ Part 4. Machine Learning
- Preparing data prior to machine learning model building (feature engineering, feature selection, etc.)
- Model building (Classification, regression, performance evaluation, hyperparameter optimization, AutoML)
10/ Part 5. Text Analysis and Reporting
- Clustering methods (k-means, hierarchical, etc.)
- Working with text (preprocessing, text analysis, sentiment analysis, etc.)
11/ Part 6. Wrapping Up
- Making data dashboards with Streamlit
- Ethics and Privacy issues pertaining to data and machine learning
- How to stay up to date
12/ My impressions 1/2
- The author provided a thorough coverage on essential topics of using Python for practical implementation of data science projects
- Key libraries were covered: Python standard libraries, Scikit-learn, Pandas, Numpy, Scipy, Beautiful soup, etc.
13/ My impressions 2/2
- Data science concepts were explained alongside code examples. Thus, the book can easily serve as a handy desk reference for data practitioners
14/ Getting the book
- The ebook is available at a heavy discount to $5 packt.link/nathangeorge
β’ β’ β’
Missing some Tweet in this thread? You can try to
force a refresh
1. Data Science Starter Kit
This starter kit provides a framework that will help pinpoint you in the right direction and help you take your first steps towardsdatascience.com/data-science-sβ¦
2. How to Master Python for Data Science
This article will have navigate you through the landscape of the Python language at their intersection with data science, which will help you get started in no time. towardsdatascience.com/how-to-master-β¦
Getting started on the Open #Bioinformatics Research Project initiative
ππ§΅π See thread below
1. Watch the introductory video on the Open Bioinformatics Research Project initiative for:
- Intro to the initiative
- High-level overview of the dataset
- Ideas for which types of analysis to perform
2/ 1. Craft your own personal learning plan
Earlier this year I made a video that details the steps you can take to craft your own personal learning plan for your data journey. Everyone's plan is different, make your own! Here's how...
3/ 2. Work on data projects using datasets that is interesting to you
When starting out, I found that working on datasets that's interesting to you will help you engage in the process. Be persistent and work on the project to completion (end-to-end).
How? DataβModelβ Deployment
Hereβs a cartoon illustration Iβve drawn a while back:
The #machinelearning learning curve
ππ§΅π See thread below
2/ Starting the learning journey
The hardest part of learning data science is taking that first step to actually start the journey.
3/ Consistency and Accountability
After taking that first step, it may be challenging to maintain the consistency needed to push through with the learning process. And thatβs where accountability steps in.
Hi friends, hereβs my new hand-drawn cartoon illustration βοΈ
Quickly deploy #machinelearning models
ππ§΅π See thread below
2/ Deployment of machine learning models is often overlooked especially in academia
- We spend countless hours compiling the dataset, processing the data, fine tuning the model and perhaps interpreting and making sense of the model
- Many times we stop at that
- Why not deploy?
3/ Topics include:
- Overview of Data science
- Probability and Statistics
- Data cleaning
- Feature engineering
- Modeling
- Classical Machine learning
- Deep learning
- SQL
- Python data structures