๐Ÿฆ‰DVC Profile picture
Apr 20 โ€ข 14 tweets โ€ข 6 min read Twitter logo Read on Twitter
Here's a comprehensive thread on how to build a seamless batch-scoring experience.

You will learn how to:

๐Ÿค– Design ML pipelines with @DVCorg

๐Ÿˆ Training with @gitlab and CML

๐Ÿงช ML experiments with @Iterativeai Studio

๐Ÿšข Deployments with @ApacheAirflow

๐Ÿงต [1/6]
๐Ÿค– Design #ML pipelines using #DVC! Streamline your process with the following steps:

1๏ธโƒฃ Data prep

2๏ธโƒฃ Feature engineering

3๏ธโƒฃ Model training

4๏ธโƒฃ Model evaluation

DVC optimizes run time & tracks changes for a more efficient workflow!

Get started with 'dvc exp run' ๐Ÿš€

๐Ÿงต [2/6]
๐Ÿˆ Training with @gitlab and CML

๐Ÿš€ Set up train jobs using #GitLabCI & #CML! Trigger training jobs on new Merge Requests or commits with [exp] tags.
Achieve 2 goals:

1๏ธโƒฃ Control experiment triggering based on code/params changes

2๏ธโƒฃ Ensure merges into the main branch link to the latest model

Supercharge your workflow with DVC & CML!

๐Ÿงต [3/6]
๐Ÿงช ML experiments with @Iterativeai Studio with features such as:

โœ… Automated experiment launches on code changes

โœ… Version tracking for code & artefacts

โœ… Easy-to-manage development process

โœ… UI for launching new experiments

โœ… Collaborate with the team

#DVC

๐Ÿงต [4/6]
๐Ÿšข Deployments with @ApacheAirflow

@ApacheAirflow helps to schedule and run pipelines (DAG) for various data engineering and machine learning purposes.

We will need to use the 'run_scoring' BashOperator for our use case.

๐Ÿงต [5/6]
Thanks for reading through this post! ๐ŸŽ‰

Thanks to @mnrozhkov for creating a follow-through blog on this โค๏ธ

You may check it out here ๐Ÿ‘‡

iterative.ai/blog/automate-โ€ฆ

Follow @DVCorg
Congratulations!!!

You just learnt how to:

- Set up Git repo.

- Connect it to the GitLab repo.

- Use Airflow to create DAG for batch-scoring app.

- Use DVC for dataset & model versioning.
- Use CML for experiment automation.

- Set up a production environment for real-time model deployment.

โ€ข โ€ข โ€ข

Missing some Tweet in this thread? You can try to force a refresh
ใ€€

Keep Current with ๐Ÿฆ‰DVC

๐Ÿฆ‰DVC Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @DVCorg

Feb 28
What is your favourite IDE/Code Editor for Machine Learning?

1. Jupyter Lab/Notebook
2. R Studio
3. Pycharm
4. VS Code
5. Spyder
6. Others(name it)
Follow @DVCorg

A lot of people keep mentioning @code

Check out this @code Extension for Machine learning experiment management -

marketplace.visualstudio.com/items?itemNameโ€ฆ
These are the Top @code extensions for Machine learning in 2023.

1. GitHub Copilot
2. DVC
3. GitLens
4. Jupyter
5. Jupyter Notebook Renderers
6. Python
7. Pylance
8. Python Indent
9. Indent-rainbow
10. Todo MD

You can also share other ones to educate others.
Read 5 tweets
Feb 13
Woah! Been here? Is deep learning model training going horribly wrong? ๐Ÿ™‹๐Ÿฝโ€โ™‚๏ธ

Iterative Studio makes this easy to see so you don't waste time and resources!

๐Ÿงตย 1/7
With Iterative Studio and DVCLive, you can monitor the progress of your long-running experiments against others that you or your team have performed. All are easily accessed at work, at home, or by the rest of your team on the project.

๐Ÿงต2/7
You provide a couple environment variables for your model training job:

You can enter your STUDIO_TOKEN and dvc exp run if running locally

๐Ÿงต3/7
Read 7 tweets
Jan 16
๐Ÿ‘จ๐Ÿปโ€๐Ÿ’ป Setup CI/CD in your machine learning projects using these simple yet powerful โ€œโ™พ๏ธ CML commandsโ€:

๐Ÿ”„ ci
๐Ÿƒโ€โ™‚๏ธ runner
โคด๏ธ pr
๐Ÿ’ฌ comment
๐Ÿง‘๐Ÿผโ€๐Ÿซ tensorboard

@Iterativeai @DVCorg
#cml #tensorboard #opensource

๐Ÿงต[1/7]
๐Ÿ”„ ci

The โ€œcml ciโ€ command prepares the whole repository for CML operations after creating the cml.yaml file

๐Ÿงต[2/7]
๐Ÿƒโ€โ™‚๏ธ runner

When a workflow requires computational resources, CML can automatically allocate cloud instances using โ€œcml runnerโ€. You can spin up instances on AWS, Azure, GCP, Kubernetes or any other provider. We can start a runner using โ€œcml runner launchโ€ command.

๐Ÿงต[3/7]
Read 7 tweets
Jan 12
๐ŸŽ Ding Dong! Hereโ€™s a flash from Iterative Community this month๐Ÿ‘‡

๐Ÿฆฎ MLOps Guide
๐Ÿงช DVC Extension
๐ŸŒŒ A Fable about MLOps
๐Ÿ“ Cheatsheet for DVC
๐Ÿง‘โ€๐Ÿ’ป Data Query Language

@Iterativeai @DVCorg
#mlOps #data #community

๐Ÿงต[1/7]
๐Ÿฆฎ MLOps Guide

For their engineering final project at @Insper, Arthur Olga, Gabriel Monteiro, Guilherme Leite, and Vinicius Lima created the MLOps Guide, which provides a Complete MLOps development cycle using DVC, CML, and IBM Watson.

mlops-guide.github.io

๐Ÿงต[2/7]
๐Ÿงช DVC Extension

@erykml1 wrote a fabulous, in-depth tutorial on experiment tracking using our new DVC Extension for VS Code ๐Ÿ‘‡

towardsdatascience.com/turn-vs-code-iโ€ฆ

๐Ÿงต[3/7]
Read 7 tweets
Dec 20, 2022
๐Ÿ‘จ๐Ÿปโ€๐Ÿ’ป Ever tried deploying a model? and ended up entangled in scripts?

Hereโ€™s how MLEM lets us do โ€œsingle command deploymentsโ€ โšก๏ธ:

- Deployment Options
- Setting up the Environment
- Run the command
- Getting Predictions

@Iterativeai @DVCorg
#mlem #aws #deployment

๐Ÿงต[1/6]
๐Ÿš€ MLEM gives us a simple and powerful API to deploy to platforms such as:

โœ… @awscloud Sagemaker
โœ… Docker Container
โœ… Heroku
โœ… Kubernetes

๐Ÿงต[2/6]
๐ŸŒฑ Setting up the Environment

Setting up the environment varies for each case, but here let's take the example of Heroku.

We can either set โ€œHEROKU_API_KEYโ€ environment variable or use Heroku CLI to run โ€œheroku loginโ€.

๐Ÿงต[3/6]
Read 6 tweets
Dec 15, 2022
๐——๐—ฉ๐—–๐—Ÿ๐—ถ๐˜ƒ๐—ฒ + ๐˜†๐—ผ๐˜‚๐—ฟ ๐—ฐ๐—ผ๐—ฑ๐—ฒ = ๐—˜๐˜…๐—ฝ๐—ฒ๐—ฟ๐—ถ๐—บ๐—ฒ๐—ป๐˜ ๐˜๐—ฟ๐—ฎ๐—ฐ๐—ธ๐—ถ๐—ป๐—ด!

Produce an experiment tracker in CLI or @Code with DVCLive and a few lines added to your code or notebook

Version your ML experiments fast with no database or SaaS needed colab.research.google.com/drive/1VKEBdSgโ€ฆ
1/4
๐—›๐—ผ๐˜„ ๐—ฑ๐—ผ๐—ฒ๐˜€ ๐—ถ๐˜ ๐˜„๐—ผ๐—ฟ๐—ธ?

1 . Install DVC and DVCLive ๐š™๐š’๐š™ ๐š’๐š—๐šœ๐š๐šŠ๐š•๐š• --๐šž๐š™๐š๐š›๐šŠ๐š๐šŽ ๐š๐šŸ๐šŒ

2. Initialize your repo with ๐—š๐—ถ๐˜ and ๐——๐—ฉ๐—–

2/4
3 . In your code or notebook, enable DVC experiment tracking using DVCLive with ๐šœ๐šŠ๐šŸ๐šŽ_๐š๐šŸ๐šŒ_๐šŽ๐šก๐š™=๐šƒ๐š›๐šž๐šŽ

3/4
Read 4 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(