Santiago Profile picture
I tell stories about technology and teach hard-core Machine Learning at YouTube:
fly51fly Profile picture ☀️ Leon-Gerard Vandenberg 🇳🇱🇨🇦🇦🇺 Math+e/acc Profile picture Zagani Ponda 🌬️侘寂 🌭 Profile picture Red Redux V Profile picture Dr. Hansom Davidoff Profile picture 89 subscribed
Mar 31 10 tweets 4 min read
What a week, huh?

1. Mojo 🔥 went open-source
2. Claude 3 beats GPT-4
3. $100B supercomputer from MSFT and OpenAI
4. Andrew Ng and Harrison Chase discussed AI Agents
5. Karpathy talked about the future of AI

And more.

Here is everything that will keep you up at night: Mojo 🔥, the programming language that turns Python into a beast, went open-source.

This is a huge step and great news for the Python and AI communities!

With Mojo 🔥 you can write Python code or scale all the way down to metal code. It's fast!…
Mar 13 14 tweets 4 min read
The batch size is one of the most important parameters when training neural networks.

Here is everything you need to know about the batch size:

1 of 14 Image I trained two neural networks.

Same architecture, loss, optimizer, learning rate, momentum, epochs, and training data. Almost everything is the same.

Here is a plot of their losses.

Can you guess what the only difference is?

2 of 14 Image
Jan 5 5 tweets 2 min read
I had an amazing machine learning professor.

The first thing I learned from him was how to interpret learning curves. (Probably one of the best skills I built and refined over the years.)

Let me show you 4 pictures and you'll see how this process flows:

1/5 Image I trained a neural network. A simple one.

I plotted the model's training loss. As you can see, it's too high.

This network is underfitting. It's not learning.

I need to make the model larger.

2/5 Image
Dec 21, 2023 4 tweets 2 min read
AI will be one of the most crucial skills for the next 20 years.

If I were starting today, I'd learn these:

• Python
• LLMs
• Retrieval Augmented Generation (RAG)

Here are 40+ free lessons and practical projects on building advanced RAG applications for production:

This is one of the most comprehensive courses you'll find. It covers all of LangChain and LlamaIndex.

And it's 100% FREE!

@activeloopai, @towards_AI, and @intel Disruptor collaborated with @llama_index to develop it.

Here is the link:

Oct 25, 2023 8 tweets 4 min read
The best real-life Machine Learning program out there:

"I have seen hundreds of courses; this is the best material and depth of knowledge I've seen."

That's what a professional Software Engineer finishing my program said during class. This is the real deal.

I teach a hard-core live class. It's the best program to learn about building production Machine Learning systems.

But it's not a $9.99 online course. It's not about videos or a bunch of tutorials you can read.

This program is different.

It's 14 hours of live sessions where you interact with me, like in any other classroom. It's tough, with 30 quizzes and 30 coding assignments.

Online courses can't compete with that.

I'll teach you pragmatic Machine Learning for Engineers. This is the type of knowledge every company wants to have.

The program's next iteration (Cohort #8) starts on November 6th. The following (Cohort #9) on December 4th.

It will be different from any other class you've ever taken. It will be tough. It will be fun. It's the closest thing to sitting in a classroom.

And for the first time, the next iteration includes an additional 9 hours of pre-recorded materials to help you as much as possible!

You'll learn about Machine Learning in the real world. You'll learn to train, tune, evaluate, register, deploy, and monitor models. You'll learn how to build a system that continually learns and how to test it in production.

You'll get unlimited access to me and the entire community. I'll help you through the course, answer your questions, and help with your code.

You get lifetime access to all past and future sessions. You get access to every course I've created for free. You get access to recordings, job offers, and many people doing the job you want to do.

No monthly payments. Ever.

The link to join is in the attached image and in the following tweet.
Image The link to join the program:
The cost to join is $385.

November and December are the last two iterations remaining at that price. The cost will go up starting in January 2024.

Today, there are around 800 professionals in the
Oct 2, 2023 8 tweets 3 min read
AI is changing how we build software.

A few weeks ago, I talked about using AI for code reviews. Many dismissed the idea, saying AI can't help beyond trivial suggestions.

You are wrong.

Here are a few examples of what you can do with @CodiumAI's open-source pull request agent: Image Here, the agent generated the description of a pull request.

It looks at every commit and file involved and summarizes what's happening automatically.

You can do this by using the "/describe" command. Image
Sep 21, 2023 5 tweets 2 min read
There is a considerable risk to start building with Large Language Models.

Prompt lock-in is a big issue, and I'm afraid many people will find out about it the hard way.

There's no cross-compatibility for many of your prompts. If you change your model, your prompts will stop working.

Here are two examples:

First, an application where an LLM generates marketing copy for a site. Here, you expect open-ended responses. A prompt like that will work across different models with little or no modifications. Use cases like this have high prompt portability.

Second, an LLM that interprets and classifies a customer request. This use case requires terse and structured responses. These prompts are model-dependent and have low portability.

Here is what makes matters worse:

The more complex the responses, the more time you need writing prompts and the less portable they are. In other words, the more you invest, the more you'll lock your implementation to one specific model.

What's the solution?

First, be careful how much you invest in writing prompts for a model that could stop working any day. Having to migrate to a different model will come at a steep cost.

Second, it's too early to understand how these models will evolve. Don't outsource too much to a Large Language Model. The more you do, the more significant the risk.

If you are using an LLM as part of a product, how are you protecting against this? The biggest issue is not whether the model has the capacity to answer a prompt.

The problem is about the variability of that answer. For example, this is an issue when you require a strictly formatted response.

You can solve a problem using GPT-3.5, GPT-4, and Llama 2. But, in many cases, you'll need different prompts for every one of these models.

That's the issue.

Sep 13, 2023 4 tweets 3 min read
I started freelancing at $8/hour.

It took a while, but I made $600,000 in Upwork alone. The last time I used the platform, I got paid $200/hr.

I started by building web applications. At some point, I started focusing on Machine Learning systems.

While on Upwork, I learned how to find jobs and get hired. I became a Top Rated Plus freelancer with 100% Job Success.

I've never met anyone with a closing rate higher than mine. I sent 79 proposals and closed 19 of them. If you don't think a 24% closing rate is high, you don't know Upwork.

A few months ago, I recorded a 1-hour video with everything I know about Upwork:

• How to structure your profile so clients can't ignore you.

• How to find the projects that everyone else misses.

• How to get hired, regardless of how many people apply.

• How to structure your proposals and cover letter.

I've been selling this course for $40, but today, I'm running an experiment:

The next 100 people who buy the course can do it for 50% off.

That's $20!

$20 to learn how to crack one of the most profitable online marketplaces for freelancers. I'm biased, but it sounds like a steal to me.

And I'll go one step further:

If you take my course and don't find it valuable, let me know, and I'll refund you. No questions asked.

Here is the link with the discount:

Remember: Only 100 copies will go for $20. After that, the course goes back to $40.

Whenever I post about this, people ask me to prove I'm not lying about my $600,000 earnings. It's a fair ask, so here is my Upwork profile:

To see my profile, log into the platform before.

Hope I can help you break free from the rat race!
A screenshot of my Upwork profile. Somebody asks a valid question in the replies:

Why would I sell this for $20 when I'm increasing competition for myself on the platform?

There are two reasons:

First, I'm not planning to use the platform anymore. I'm done with freelancing and selling my time for money. You could say, "I'm retired."

Second, freelancing is not a zero-sum game. More capable freelancers will lead to more work for everyone else, not less.

Here is a simple way to think about it:

I focused on taking the Machine Learning models to production using Amazon SageMaker.

I engaged with many clients who wanted to work with me but were too early. They needed 6 - 12 months to focus on other areas before being ready.

I would have benefited from more data engineers and data scientists helping these companies become ready for me. More freelancers would have been good for my business!
Jul 24, 2023 4 tweets 2 min read
Nothing beats FREE education!

Here is a free, 1-week cohort that will teach you how to build AI products using OpenAI.

It starts on August 14, and you can apply right now!

Here are the details you want to know: Image This cohort will teach you how to use OpenAI's API and ChatGPT to build an application from scratch.

It's completely free.

You can apply here: .

This will be a hands-on, technical course, and you should be familiar with Python to…
Jul 20, 2023 4 tweets 2 min read
Yes, GPT-4 seems to be getting worse.

But now we have new information. And well, it's complicated.

Yesterday, I posted about a study showing that GPT-4 success rate deciding whether a number is prime went from 97.6% in March to 2.4% in June.

The report also showed how the……
Is GPT-4 getting worse? Check the following post for more information about the reason we misinterpreted the original study:

@sayashk and @random_walker did an excellent job breaking down the original findings and ran the experiment that shows that GPT-4 was never good at………
Jul 19, 2023 9 tweets 3 min read
GPT-4 is getting worse over time, not better.

Many people have reported noticing a significant degradation in the quality of the model responses, but so far, it was all anecdotal.

But now we know.

At least one study shows how the June version of GPT-4 is objectively worse than……
Accuracy comparison between the March version of GPT-4 with the June version on the problem of determining whether a number is prime. In March, GPT-4 solved 97.6% of problems accurately, while in June, it solved only 2.4% of the problems. Here is the original paper:

And you can reproduce the results using this Google Colab:……
Jul 15, 2023 4 tweets 2 min read
Photography will never be the same.

In 10 minutes, you can turn your photo gallery into unlimited, amazing pictures. For free!

How much imagination do you have? Image Follow these steps to generate your photos:

1. Find a few photos of you. The more, the merrier.
2. Go to and get an API KEY.
3. Run the code in the notebook below (Upload your photos first.)

Here is the code:…
Jul 11, 2023 8 tweets 3 min read
How do you think companies are training their Large Language Models? Where do you think the data come from?

Web scraping.

This is one of the most valuable skills you can learn.

Here is how it works and how you can learn it for free: The Web Data Masterclass is a collection of videos about web data and how to collect it:

You'll find tutorials and how-tos from leading data scientists and engineers like @MariyaSha888, @ykilcher, @Jeffdelaney23, and @kunalstwt.

And it's free!…
Jul 9, 2023 4 tweets 2 min read
This is the unfortunate state of AI shitfluencing.

People with nothing to add to the conversation and zero originality pump content like this to farm followers.

I'm sad for everyone who believes them. I normally leave these people, but I’m not going to put up with lies and exaggerations that prey on people who don’t know better.

I’ve had more than a few conversations with students that want to quit or never learn programming because they read that AI killed the practice.……
Jul 7, 2023 4 tweets 2 min read
You can now fine-tune an LLM without writing a single line of code!

A breakthrough in the open-source LLM space that can increase the speed of AI development and adoption by an order of magnitude.

Let me start from the beginning:

A Large Language Model comes out of the factory……
Here is a link to an article with a step-by-step demo of fine-tuning a model without writing any code: .

Thanks to @monsterapis for partnering with me on this…
Jul 5, 2023 7 tweets 2 min read
Another deep learning breakthrough:

Deep TDA, a new algorithm using self-supervised learning, overcomes the limitations of traditional dimensionality reduction algorithms.

t-SNE and UMAP have long been the favorites. Deep TDA might change that forever.

Here are the details: Dimensionality reduction algorithms like t-SNE and UMAP have been around for a long time and are essential to analyze complex data.

Specifically, t-SNE is one of the most popular algorithms I've seen used in the industry.

Hinton and van der Maaten developed it in 2008.
Jun 28, 2023 4 tweets 2 min read
Every site in the world will have one of these:

I built a custom virtual agent that knows everything about my course. You can chat with it and ask any questions you want.

It took me 10 minutes, and I didn't write a single line of code!

Here is how you can do the same: You can try my application here: .

Ask any questions about my course. Some examples:

• What's the cost?
• How many sessions are there?
• What happens in Session 5?
• Who is the program for?

Here is how I built it:…
Jun 27, 2023 9 tweets 2 min read
While everyone is looking at Large Language Models, Object Detection is one of the most useful applications of Machine Learning.

Here is a 2-minute introduction to Object Detection: Object Detection helps us find objects in pictures.

We can do that by training a Machine Learning model with lots of example pictures until it can spot objects by itself.

There are two main ways a computer detects objects:
Jun 23, 2023 7 tweets 3 min read
I have 50,000 rows of data.

How long does cleaning, transforming, and exploring this information take?

1 billion people worldwide use spreadsheets. Some are fast, but most will need days to do it.

AI can do this in seconds. Here is how: I'm using @AkkioHQ.

They built one of the few no-code tools I've seen that feel like a game-changer for data manipulation.

If you want to follow along, create a free account here and upload your dataset. .

Let me show you 3…
Jun 22, 2023 5 tweets 2 min read
This new library will 2x your Machine Learning workflow.

It's open-source, and in 30 seconds, it'll find problems with your model you didn't know you had.

I've tried doing this by hand. It takes weeks.

Read on for the details: The library is Giskard.

It's fully open-source. Star the repository here: .

Attached you can see a report from Giskard.

With a few lines of code, you can get information that would take a lot of work to produce manually.…
Jun 21, 2023 5 tweets 2 min read
I just spoke to myself, and it was weird.

You can now clone your voice using a few seconds of audio, give the robot personality, and deploy a clone of yourself.

Here is a step-by-step process on how to do this in less than two minutes: I'm using @myshell_ai:

1. Record 60 seconds of audio using your phone
2. Focus on sound quality. Avoid background noises
3. Go to
4. Click on Create a Robot and give it a name
5. Upload your audio. Click on Clone

It will take a few seconds to complete.