To train a Machine Learning model later, you need enough historical data (features, targets) in your Feature Store.
Run the feature script for a range of past dates, to get enough training data.
Step 4: Model training script ๐๏ธ
1 โ fetches historical (features, targets) from the Feature Store.
2 โ trains and evaluate the best ML model possible for this data, e.g. XGBoostRegressor.
3 โ stores the trained model in the Model Registry.
Step 5: Automate execution of the feature script ๐ฐ๏ธ
Create a GitHub action to automatically run the feature script (from step 1) every hour.
GitHub actions are serverless computing power to run your code on a schedule. For free.
Beautiful.
Step 6: Create a web app to show model predictions ๐จ๐ฝโ๐ป
Streamlit is a powerful Python library to develop and deploy web data apps.
Your app
1 โ loads the model and features from the *Feature Store*,
2 โ computes model predictions and shows them on a beautiful UI.
BOOM!
Bonus ๐
You can create another GitHub action to automate the model training script.
Why re-train the model? ๐ค
Because ML model performance decreases over time.
The best way to mitigate this is to regularly re-train the model, like once a week.
Wanna level up in ML/MLOps?
Join my e-mail list and get one article ๐๐๐ฒ๐ฟ๐ ๐ฆ๐ฎ๐๐๐ฟ๐ฑ๐ฎ๐ ๐บ๐ผ๐ฟ๐ป๐ถ๐ป๐ด โ datamachines.xyz/subscribe/
Every week I share real-world Data Science/Machine Learning content.
Follow me @paulabartabajo_ so you do not miss what's coming next.
Wanna help?
Like/Retweet the first tweet below to spread the wisdom โโโ