Tweet

How to get URL link on Twitter App

On the Twitter thread, click on or icon on the bottom
Click again on or Share Via icon
Click on Copy Link to Tweet
Paste it above and click "Unroll Thread"!
More info at Twitter Help

Pau Labarta Bajo

@paulabartabajo_

Jul 25 • 6 tweets • 2 min read Twitter logo

Read on Twitter

Advice for ML beginners💡

GitHub actions are *free* computing that makes your life easier.

Here are 3 use cases for ML projects ↓

➡️ Continuous Integration and Deployment (CI/CD)

Machine Learning is software engineering. As such, it is crucial you automate:

→ code updates (aka integration), and
→ code releases to your production environment (aka deployment)

➡️ Batch feature pipelines

This is a program that runs on a chron-like schedule, that fetches raw data from a data source (e.g. a data warehouse), computes ML features, and saves them to a storage service (e.g. a feature store).

Feature pipelines are present in every ML system.

➡️ Inference pipelines

Batch scoring is one of the most popular ways to generate fresh predictions from an ML model.

They fetch recent features, and a model artifact, generate predictions, and save them in a storage layer.

Wanna become a real-world ML engineer?

Join the Serverless ML Community ↓↓↓
serverless-ml.carrd.co

https://twitter.com/1408789941040058369/status/1683824501954338816

Wanna get more tweets like this?
→ Follow me @paulabartabajo_

Wanna help me spread the word?
→ Like/Retweet the first tweet below ↓↓↓

https://twitter.com/1408789941040058369/status/1683824501954338816

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @paulabartabajo_

Pau Labarta Bajo

@paulabartabajo_

Jul 26

XGBoost is one of the most effective algorithms for time-series prediction.

But, you need to prepare your data carefully.

Here is a Python library to help you prepare your data ↓

@joaopcnogueira, one of my students from the Real World ML Tutorial, has built 𝘁𝘀𝟮𝗺𝗹 a Python library that lets you transform

- a time series dataset, into
- a training dataset, with features and targets

Enjoy it

And give it. a star ⭐ on GitHub ↓
github.com/joaopcnogueira…

Wanna build your first real-world ML system?

Join the Real-World ML Tutorial + Community and get LIFETIME ACCESS to

→ 3 hours of video lectures 🎬
→ Full source code 👨‍💻
→ Discord private community 👨‍👩‍👦

Use code "NINJA" at checkout for a 20% discount

realworldmachinelearning.carrd.co

Read 4 tweets

Pau Labarta Bajo

@paulabartabajo_

Jul 26

3 reasons why your XGBoost model does not work

And 3 ways to solve them
↓↓↓

1️⃣ You are overfitting the training data

This is common in highly non-stationary problems, like cryptocurrency price prediction.

Solution. Use cross-validation and hyper-parameter tuning, to adjust the model's bias-variance and get good out-of-sample metrics.

2️⃣ You miss an essential feature in your dataset

You need more/better features to increase the signal-to-noise ratio in your data.

Solution. Pull more raw features and generate better ones through feature engineering.

Read 5 tweets

Pau Labarta Bajo

@paulabartabajo_

Jul 19

Wanna train more ML models for less money? 💸

3 tips to optimize your ML budget 🧠↓

To build a Machine Learning product you need to spend money on 3 types of services:

→ Computing, like CPUs and GPUs so you can train and deploy your models.
→ Orchestration, to kick off the 3 pipelines of your system
→ Storage, to save features, models, and experiment runs

And the thing is, not all these services cost you the same.

→ Orchestration and storage are not expensive 💸
→ Computing, on the other hand, can get very expensive 💸💸💸💸💸

Read 11 tweets

Pau Labarta Bajo

@paulabartabajo_

Jul 18

The most effective thing you can do to land an ML job is to

- pick a problem you care about
- build an ML solution, and
- release it to the public.

Here is an example to inspire you 🤗↓

The most effective way to learn and showcase your ML skills is to build a 𝗰𝗼𝗺𝗽𝗹𝗲𝘁𝗲 𝗠𝗟 𝗽𝗿𝗼𝗷𝗲𝗰𝘁 and publish

→ the source code on GitHub, and
→ a public working app

Here is an example ↓

𝗣𝗿𝗲𝗱𝗶𝗰𝘁𝗶𝗻𝗴 𝗡𝗕𝗔 𝗴𝗮𝗺𝗲 𝗿𝗲𝘀𝘂𝗹𝘁𝘀 𝘄𝗶𝘁𝗵 𝗠𝗮𝗰𝗵𝗶𝗻𝗲 𝗟𝗲𝗮𝗿𝗻𝗶𝗻𝗴 🏀

@curiovana has built a complete ML app, that

- fetches the list of upcoming NBA matches
- generates useful predictive features
- predicts these games' outcomes.

How did he do it?

Read 8 tweets

Pau Labarta Bajo

@paulabartabajo_

Jul 17

Wanna learn enough git to be a data scientist?

A hands-on tutorial in 10 steps 👩🏽‍💻👨‍💻↓↓↓

#1 Create your project folder and cd into it

#2 Create a README file.

This is the first thing anyone visiting your repository will see.
You better have one. And you better make it pretty.

Read 14 tweets

Pau Labarta Bajo

@paulabartabajo_

Jul 10

Looking for effective ways to learn MLOps?

Forget theory and get your hands on a real-world problem 🧠

Here is a project you can build (for free) using Python 👩🏽‍💻👨‍💻↓↓↓

Let's build an ML service to predict the price of Ethereum (ETH) in the next 1 hour, using Python 🐍 and serverless tools.

You will learn a lot, AND you might even make some money 💰

These are the steps to build this system ↓

Step 1: Feature generation script 🐍

1 → fetches raw data on actual trades ETH/USD from the Kraken API:

2 → engineers new features from the raw data (aka model inputs), and targets (aka model outputs)

3 → stores these features in the *Feature Store* https://t.co/5hwgydEGFjdocs.kraken.com/rest/

Read 11 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Enter Twitter Thread URL to Unroll

Pau Labarta Bajo

Try unrolling a thread yourself!

More from @paulabartabajo_

Pau Labarta Bajo

Pau Labarta Bajo

Pau Labarta Bajo

Pau Labarta Bajo

Pau Labarta Bajo

Pau Labarta Bajo

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!