Once the system works, start improving it by
- increasing training data size
- increasing the number of features
- trying a more complex ML model
- optimizing model hyper-parameters
Step 6. Push your code to a public GitHub repo and write a beautiful README
The README file is the first thing your future employer will se.
Explain the problem you wanted to solve, and the solution you built.
GitHub actions are *free* computing that makes your life easier.
Here are 3 use cases for ML projects ↓
➡️ Continuous Integration and Deployment (CI/CD)
Machine Learning is software engineering. As such, it is crucial you automate:
→ code updates (aka integration), and
→ code releases to your production environment (aka deployment)
➡️ Batch feature pipelines
This is a program that runs on a chron-like schedule, that fetches raw data from a data source (e.g. a data warehouse), computes ML features, and saves them to a storage service (e.g. a feature store).