Post

How to get URL link on X (Twitter) App

On the Twitter thread, click on or icon on the bottom
Click again on or Share Via icon
Click on Copy Link to Tweet
Paste it above and click "Unroll Thread"!
More info at Twitter Help

Pau Labarta Bajo

Jul 7, 2023 • 10 tweets • 4 min read • Read on X

Scrolly

Wanna learn MLOps?

Stop reading blog posts.
Build a prediction service instead 🚀

Here is a project you can build (for free) 👩🏽‍💻👨‍💻↓↓↓

Let's build a Machine Learning service to predict the Air Quality Index (AQI) in your city in the next 3 days, using a 100% serverless stack.

You will learn a lot, AND you will build something useful for society.

Win-win 🏆🏆

These are steps to build this ↓

Step 1: Feature generation script 🐍

1 → fetches raw weather and pollutant data from an external API like

2 → computes features from this raw data (aka model inputs), and targets (aka model outputs)

3 → stores these features in the *Feature Store* https://t.co/72uTTBYnqFaqicn.org/city/barcelona

Step 2: Backfill historical (features, targets) ⏮️

To train a Machine Learning model later, you need enough historical data (features, targets) in your Feature Store.

Run the feature script for a range of past dates, to get enough training data.

Step 4: Model training script 🏋️

1 → fetches historical (features, targets) from the Feature Store.

2 → trains and evaluate the best ML model possible for this data, e.g. XGBoostRegressor.

3 → stores the trained model in the Model Registry.

Step 5: Automate execution of the feature script 🕰️

Create a GitHub action to automatically run the feature script (from step 1) every hour.

GitHub actions are serverless computing power to run your code on a schedule. For free.

Beautiful.

Step 6: Create a web app to show model predictions 👨🏽‍💻

Streamlit is a powerful Python library to develop and deploy web data apps.

Your app

1 → loads the model and features from the *Feature Store*,

2 → computes model predictions and shows them on a beautiful UI.

BOOM!

Bonus 🎁

You can create another GitHub action to automate the model training script.

Why re-train the model? 🤔

Because ML model performance decreases over time.
The best way to mitigate this is to regularly re-train the model, like once a week.

Wanna level up in ML/MLOps?

Join my e-mail list and get one article 𝗘𝘃𝗲𝗿𝘆 𝗦𝗮𝘁𝘂𝗿𝗱𝗮𝘆 𝗺𝗼𝗿𝗻𝗶𝗻𝗴 ↓
datamachines.xyz/subscribe/

https://twitter.com/1408789941040058369/status/1677286924644491267

Every week I share real-world Data Science/Machine Learning content.

Follow me @paulabartabajo_ so you do not miss what's coming next.

Wanna help?
Like/Retweet the first tweet below to spread the wisdom ↓↓↓

https://twitter.com/1408789941040058369/status/1677286924644491267

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @paulabartabajo_

Pau Labarta Bajo

@paulabartabajo_

May 6

Crash course on Kubernetes for ML Engineers
Hands-on in 9 steps ↓

Kubernetes is one of the hard skills you nonstop find in job descriptions for ML engineers.

Yet, it is one of the tools most ML engineers are scared of.

Let me help you be less scared of Kubernetes, by deploying your first Python app.

𝗦𝘁𝗲𝗽 𝟭 > Install the tools
> uv to create the project and manage Python dependencies.
↳ github.com/astral-sh/uv

Read 19 tweets

Pau Labarta Bajo

@paulabartabajo_

Feb 22

Crash course on 𝗞𝘂𝗯𝗲𝗿𝗻𝗲𝘁𝗲𝘀 for ML Engineers
Hands-on in 9 steps ↓

𝗦𝘁𝗲𝗽 𝟭 > Install the tools
> uv to create the project and manage Python dependencies.
↳ github.com/astral-sh/uv

Read 20 tweets

Pau Labarta Bajo

@paulabartabajo_

Jan 15

Wanna learn to 𝗯𝘂𝗶𝗹𝗱 𝗠𝗟 𝘀𝘆𝘀𝘁𝗲𝗺𝘀?

Here are 𝟯 𝗿𝗲𝗮𝗹-𝘄𝗼𝗿𝗹𝗱 𝗲𝘅𝗮𝗺𝗽𝗹𝗲𝘀 you can build TODAY 👩🏽‍💻👨‍💻↓

𝗪𝗵𝘆 𝗠𝗟 𝘀𝘆𝘀𝘁𝗲𝗺𝘀 𝗮𝗻𝗱 𝗻𝗼𝘁 𝗷𝘂𝘀𝘁 𝗠𝗟 𝗺𝗼𝗱𝗲𝗹𝘀?
Because ML models are not enough in real-world ML projects.
Until you don't put them to work, by building a

-> Feature pipeline
-> Training pipeline
-> Inference pipeline

they produce 0 business value.

𝗗𝗼 𝘆𝗼𝘂 𝗻𝗲𝗲𝗱 𝗮𝗻 𝗲𝘅𝗮𝗺𝗽𝗹𝗲?

In this video I present 3 ML systems built by @KTHuniversity Master Students under the supervision of the great @jim_dowling

Read 5 tweets

Pau Labarta Bajo

@paulabartabajo_

Jan 5

ML Project Idea 💡

Let's predict air quality in Poland 💨🇵🇱↓

In this repository, you can find the complete source code of an ML app that

→ predicts air quality (as measured by the PM10 metric) 💨
→ in Poland 🇵🇱
→ for the next 7️⃣ days

Click on this link to see the code ↓
github.com/erno98/ID2223/…

The project includes a hosted version of the final app in Streamlit Cloud.

Click on this link to see it in action ↓
…itystreamlit-app-p8sjf5.streamlit.app

Read 5 tweets

Pau Labarta Bajo

@paulabartabajo_

Jan 5

ML Project Idea 💡

Let's predict taxi demand in NYC in the next 60 minutes 🚕↓

Business problem 💼

Let's create a predictive model to forecast the number of taxi rides that will happen in Manhattan (New York City)

- in the next hour
- for each taxi zone (e.g. Zone 113 "Lower Manhattan)

Let's do it in 6 steps ↓

Step 1. Fetch raw data

You can grab historical taxi rides from this public website ↓
nyc.gov/site/tlc/about…

Read 10 tweets

Pau Labarta Bajo

@paulabartabajo_

Dec 16, 2024

ML Project Idea 💡

Let's predict air quality ↓

Here is a full example, with source code, to learn how to build a complete ML app that predicts air quality in different European cities.

Clone the code, modify it, and deploy it!
github.com/logicalclocks/…

→ 𝗙𝗼𝗹𝗹𝗼𝘄 𝗺𝗲 @paulabartabajo_ for more ML project ideas
→ Image credits go to 🙏instagram.com/photo.natadmay/

Read 4 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Enter URL or ID to Unroll

Pau Labarta Bajo

Try unrolling a thread yourself!

More from @paulabartabajo_

Pau Labarta Bajo

Pau Labarta Bajo

Pau Labarta Bajo

Pau Labarta Bajo

Pau Labarta Bajo

Pau Labarta Bajo

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!