How to solve Machine Learning problems in the real world
3 practical tips to make your ML life easier ๐งตโ
๐๐ผ๐ป๐๐ฒ๐ ๐
Online courses and Kaggle-style competitions are great resources to learn the fundamentals of ML.
However, the daily job of a machine learning engineer requires an ๐ฎ๐ฑ๐ฑ๐ถ๐๐ถ๐ผ๐ป๐ฎ๐น ๐น๐ฎ๐๐ฒ๐ฟ ๐ผ๐ณ ๐๐ธ๐ถ๐น๐น๐ that you wonโt master there.
Here are the ๐๐ผ๐ฝ ๐ฏ most recurring hidden problems I have faced in my ML life, and my tips for you to deal with them.
GitHub actions are *free* computing that makes your life easier.
Here are 3 use cases for ML projects โ
โก๏ธ Continuous Integration and Deployment (CI/CD)
Machine Learning is software engineering. As such, it is crucial you automate:
โ code updates (aka integration), and
โ code releases to your production environment (aka deployment)
โก๏ธ Batch feature pipelines
This is a program that runs on a chron-like schedule, that fetches raw data from a data source (e.g. a data warehouse), computes ML features, and saves them to a storage service (e.g. a feature store).
To build a Machine Learning product you need to spend money on 3 types of services:
โ Computing, like CPUs and GPUs so you can train and deploy your models.
โ Orchestration, to kick off the 3 pipelines of your system
โ Storage, to save features, models, and experiment runs
And the thing is, not all these services cost you the same.
โ Orchestration and storage are not expensive ๐ธ
โ Computing, on the other hand, can get very expensive ๐ธ๐ธ๐ธ๐ธ๐ธ