Post

How to get URL link on X (Twitter) App

On the Twitter thread, click on or icon on the bottom
Click again on or Share Via icon
Click on Copy Link to Tweet
Paste it above and click "Unroll Thread"!
More info at Twitter Help

Santiago

@svpino

Mar 9, 2021 • 16 tweets • 5 min read • Read on X

Here is an underrated machine learning technique that will give you important information about your data and model.

Let's talk about learning curves.

Grab your ☕️ and let's do this thing!

🧵👇

Start by creating a model. Something simple. You are still exploring what works and what doesn't, so don't get fancy yet.

We are now going to plot the loss (model error) vs. the training dataset size. This will help us answer the following questions:

▫️ Do we need more data?
▫️ Do we have a bias problem?
▫️ Do we have a variance problem?
▫️ What's the ideal picture?

▫️ Do we need more data?

As you increase the training size, if both curves converge towards each other and stop improving, you don't need more data.

If there's room for them to continue closing the gap, then more data should help.

This one should be self-explanatory: if our errors stopped improving after adding more data, it's unlikely that more of it will do any good.

But if we still see the loss improving, more data should help push it even lower.

▫️ Do we have a bias problem?

If the training error is too high, we have a high bias problem.

Also, if the validation error is too high, we have a problem with the bias —either low or high bias.

A high bias indicates that our model is not powerful enough to learn the data. This is why our training error is high.

If the training error is low, that's a good thing: our model can fit the data.

High validation error indicates that our model is not performing well on the validation data. We probably have a bias problem.

To know in which direction, we need to look at the training error to decide.

▫️ Low training error: low bias
▫️ High training error: high bias

▫️ Do we have a variance problem?

If there's a big gap between the training error and the validation error, we have high variance.

A low training error also indicates that we have high variance.

High variance indicates that the model fits the data too well (probably memorizing it.)

When testing with the validation set, we should see the big gap indicating that the model did great with the training set, but sucked with the validation set.

A couple more important points:

▫️ High bias + low variance: we are underfitting.
▫️ High variance + low bias: we are overfitting.

▫️ What's the ideal picture?

These are the curves that you should be looking forward to getting.

Training and validation error converged both to a low error.

Here is another chart that does an excellent job at explaining bias and variance.

You want low bias + low variance, but keep in mind there's always a tradeoff between them: you need to find a good enough balance for your specific use case.

If these threads help, then make sure to follow me, and you won't be disappointed.

And for even more in-depth machine learning stories, make sure you head over digest.underfitted.io. The first issue coming this Friday!

🐍

https://twitter.com/svpino/status/1361703093847683074?s=20

Here is a quick guide that will help you dealing with overfitting and underfitting:

https://twitter.com/svpino/status/1361703093847683074?s=20

https://twitter.com/Krindox/status/1369279719502397443?s=20

https://twitter.com/SidhomSlim/status/1369345869095571462?s=20

Both error or score will work to create good learning curves.

They will simply work as opposites: You always want to maximize score and minimize error.

https://twitter.com/SidhomSlim/status/1369345869095571462?s=20

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @svpino

Santiago

@svpino

Aug 4

AI is changing everything. Full stop.

If you still don't get it, watch this.

Look at the attached video. A company using this tool will execute 100x faster than everyone else. There's simply no match for how fast AI can transform what you do.

I'm working here with @PromptQL. They will help you build a reasoning AI that is specialized to your business.

This makes an ocean of difference:

• Connect to all of your data
• Build a massive knowledge graph
• Incorporate your unique know-how
• Learn over time

The learning part is the thing that blew my mind:

You can teach the system how to interpret your data and how you prefer things to be done.

This knowledge can be reviewed, edited, and deployed so everyone at your company starts using the new version of the model.

Read 4 tweets

Santiago

@svpino

Jul 7

Here is how you can test your applications using an LLM:

We call this "LLM as a Judge", and it's much easier to implement than most people think.

Here is how to do it:

1/11

(LLM-as-a-judge is one of the topics I teach in my cohort. The next iteration starts in August. You can join at .)

2/11ml.school

We want to use an LLM to test the quality of responses from an application.

There are 3 scenarios:

1. Choose the best of two responses
2. Assess specific qualities of a response
3. Evaluate the response based on additional context

3/11

Read 11 tweets

Santiago

@svpino

Jun 6

Bye-bye, virtual assistants! Here is the most useful agent of 2025.

An agent with access to your Gmail, Calendar, and Drive, and the ability to do things for you is pretty mind-blowing.

I asked it to read my emails and reply to every cold outreach message.

My mind is blown!

AI Secretary and the folks @genspark_ai will start printing money!

You can try this out here:

Check their announcement video and you'll see some of the crazy things it can do for you. genspark.ai

The first obvious way I've been using AI Secretary:

100x better email search.

For example, I just asked it to "show me the last 3 emails asking for an invoice for the Machine Learning School cohort."

I also asked it to label every "email containing feedback about the cohort."

Read 6 tweets

Santiago

@svpino

Jun 5

You can now have a literal army of coding interns working for you while you sleep!

Remote Agent is now generally available. This is how we all get to experience what AI is really about.

Here is what you need to know:

Remote Agent is a coding agent based on @augmentcode. They were gracious enough to partner with me on this post.

Remote Agent:

• Runs in the cloud
• Works autonomously
• Can handle small tasks from your backlog

Here is a link to try it out: fnf.dev/4jobOrw

If you have a list of things you've always wanted to solve, let an agent do them:

• Refactor code and ensure tests still run
• Find and fix bugs
• Close open tickets from your backlog
• Update documentation
• Write tests for untested code

Read 5 tweets

Santiago

@svpino

Jun 4

Knowledge graphs are infinitely better than vector search for building the memory of AI agents.

With five lines of code, you can build a knowledge graph with your data.

When you see the results, you'll never go back to vector-mediocrity-land.

Here is a quick video:

Cognee is open-source and outperforms any basic vector search approach in terms of retrieval relevance.

• Easy to use
• Reduces hallucinations (by a ton!)
• Open-source

Here is a link to the repository: github.com/topoteretes/co…

Here is the paper explaining how Cognee works and achieves these results:

arxiv.org/abs/2505.24478

Read 4 tweets

Santiago

@svpino

May 26

Cursor, WindSurf, and Copilot suck with Jupyter notebooks. They are great when you are writing regular code, but notebooks are a different monster.

Vincent is an extension fine-tuned to work with notebooks.

10x better than the other tools!

Here is a quick video:

You can try Vincent for free. Here is a link to the extension:

It works with any of the VSCode forks, including Cursor and Windsurf. The free plan will give you enough to test it out.marketplace.visualstudio.com/items?itemName…

The extension will feel familiar to you:

• You can use it with any of the major models (GPT-X, Gemini, Claude)
• It has an option to Chat and Edit with the model
• It has an Agent mode to make changes to the notebook autonomously

But the killer feature is the Report View.

Read 4 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Enter URL or ID to Unroll

Santiago

Try unrolling a thread yourself!

More from @svpino

Santiago

Santiago

Santiago

Santiago

Santiago

Santiago

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!