Santiago Profile picture
Jan 28 8 tweets 2 min read
Can you guess what their biggest struggle is?

I regularly talk to companies using machine learning, from Fortune-500 to the ice cream parlor in the block around the corner.

Surprisingly, building models is not an issue for them.

Wanna guess? ↓
"Don't worry about the model" is what I usually get.

The real struggle? → "What do we do with this Jupyter notebook running the model that we built"?

It's not about building models. It's about making them useful.
Many feel like having a model is the end of the road.

In reality, it is just the beginning.

The fundamental hurdle is understanding what to do with them.
Deploying a machine learning model is challenging.

Some of the things you need to worry about:

• Scalability
• Reliability
• Monitoring
• Availability
• Versioning
• CI/CD/CT

The list goes on and on.
Unfortunately, there are very few people out there with the experience to do this successfully.

And it's tough to make any progress if you can't get your models into production!
I've been following @abacusai for a while.

They build a platform that 10x your ability to go from a notebook to a production system.

This is critical!
An end-to-end machine learning platform is game-changing for our industry.

Abstract the technical details and let the team focus on what matters.

More business, less repeatable, uninteresting, boring technical details.
I'm excited about 2022.

I think we will see massive progress on the MLOps front.

This will accelerate experimentation and adoption like never before.

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Santiago

Santiago Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @svpino

Jan 29
When I started with machine learning, I always made the same mistake:

I confused a couple of metrics that look very similar but are entirely different.

Let's fix that for you.

2. When we train a machine learning model, we need to compute how different our predictions are from the expected results.

For example, if we predict a house's price as $150,000, but the correct answer is $200,000, our "error" is $50,000.
3. There are multiple ways we can compute this error, but two common choices are:

• RMSE — Root Mean Squared Error
• MAE — Mean Absolute Error

These have different properties that will shine depending on the problem you want to solve.
Read 15 tweets
Jan 24
I built a machine learning model that predicts whether your car will crash today.

And it's 99% accurate!

Here is the secret: ↓
This thread is the answer to this question.
Before getting into the details, let's jump right into the source code of my model:
Read 10 tweets
Jan 21
Occam's Razor:

Given two solutions with similar characteristics, the simplest and most direct one is the correct answer.

This thread answers the following question:
Option 3 is probably the simplest one to tackle first.

It talks about "the speed of the training process" and relates it to overtraining and overcomplicating results.

A quick training process doesn't necessarily reduce complexity. This option is not correct.
Read 7 tweets
Jan 21
Three deep learning myths:

1. A lot of math is needed
2. A lot of data is needed
3. An expensive computer is needed

If these are preventing you from starting, reconsider.

(Hat tip to the FastAI Course.)
Data Structures and Algorithms are an underrated set of skills for any software professional.

They are definitely very important!

That being said, I don't think they are absolute requirements for deep learning work.

Understanding the math underpinnings of anything you do will definitely open doors for you.

However, stating that you can't do deep learning unless you understand all of the math involved is not a serious statement.

Read 5 tweets
Jan 16
Using more features from your data never comes for free.

Let's talk about dimensionality.

2. Two days ago I asked this question.

Let's now analyze each option starting with Option 3 (probably the easiest one we can discard.)
3. Option 3 states that when we cut down the number of features, we need to "make up the difference" by adding more data.

Removing features reduces the number of dimensions in our data.

It concentrates the samples we have in a lower-dimensional space.
Read 12 tweets
Jan 14
The complexity of turning a Jupyter notebook into a production system is frequently underestimated.

Having a model that performs great on a test set is not the end of the road but just the beginning.

Fortunately, there's something for you here!

2. The productionization of machine learning systems is one of the most critical topics in the industry today.

There's been a lot of progress, and it's getting better, but for the most part, we are just at the beginning of this road.
3. Not only the space is still immature, but it's very fragmented.

Talk to three different teams, and it's very likely they all use different tools, processes, and focus on different aspects of the lifecycle of their systems.
Read 7 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(