Alright, kicking off my #DataAISummit talk on Model Monitoring with Apache Spark and @VertaAI in 5 mins! Catch it at this link databricks.com/session_na21/m… or follow along below! (1/n) #mlops #modelmonitoring #verta
At #SparkSummit 2017, we launched ModelDB and kickstarted the experiment management space. Excited to talk see the impact #VertaModelMonitoring will have on #MLOps! (2/n)
ML runs whole businesses today ranging from robo- advisors to fraud detection to Anti-Money Laundering to RPA. (3/n)
AI-ML doesn't always work as expected! FICO scores stopped working in the pandemic and threw a wrench into lending models (4/n)
We see this in the field everyday --whether lost revenue, whether misuse of models or incorrect pricing decisions leading to losses. (5/n)
So what's the goal of #ModelMonitoring? It's fairly simple -- Ensuring model results are consistently of high quality! (6/n)
What's hard about this? (1) Knowing when a model fails, (2) Finding the root cause, and (3) Remediating it. (7/n)
Why is it hard to know if a model has failed? First, ground-truth is very hard to come by, second, ground-truth is often time-shifted, and third, your model is usually only one part of the decision process. (8/n)
Next challenge is jungles. Pipeline jungles. A model has a ton of dependencies, is it the model or the dependencies that are causing the failure? (9/n)
Finally, monitoring is only useful if you can close the loop and take action. (10/n)
So do you need a monitoring tool or can you write ad-hoc jobs? Works for one model, but what about 20 models and 30 datasets? (11/n)
Why are we excited to solve #ModelMonitoring? Because it's a hard problem! (12/n)
Introducing #VertaModelMonitoring: flexible, customizable, and closing the loop (13/n)
How does it work? Ingest stats, ground-truth, and analyze, analyze, analyze (14/n)
Batch and live are no different, just some aggregation magic (and lots of perf optimizations!) (15/n)
My demo setup with Apache Spark ML -- pre-processing and then GBDT! (16/n)
Monitoring is easy: (1) define how you want to profile your data (use built-in profilers or define your own) and compute stats. This could be any statistic (17/n)
(2) Visualize and explore my stats!(18/n)
(3) Define alerts, ideally on every step of your pipeline! (19/n)
But alert fatigue is real. And so root cause analysis is crucial. With Spark ML pipelines (or any pipeline), we can trace back dependencies to find the underlying issue with the model pipeline. (20/n)
In summary! (21/n)
The framework is now available in Community Preview! Check it out here: info.verta.ai/model-monitori…

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Manasi Vartak

Manasi Vartak Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @DataCereal

28 May
Alright, let's live-tweet again! Super excited to live-tweet my colleague, @VertaAI's Cory Johannsen's talk about why AAP is Not the same as ML Monitoring! #MLOps databricks.com/session_na21/w…
What is APM (New Relic, DataDog, Elastic etc. etc.)? Tracking metrics about your system, aggregating, dashboarding, alerting (1/n) Image
APM lets you track health, perf, availability etc. the Golden signals (2/n) Image
Read 18 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Follow Us on Twitter!

:(