Democratizing knowledge one keystroke at a time.
PhD in NLP, full-time professor, CS Department @UdeLaHabana.
Co-founder @syalia_srl.
Sep 20 β’ 6 tweets β’ 2 min read
LLMs cannot reason.
Despite their impressive capabilities, all LLMs, including OpenAI o1, are still fundamentally limited by design constraints that make them incapable of true, open-ended reasoning.
Let's break it down. 𧡠(1/5)
Reason 1: Stochastic Sampling.
LLMs rely on probabilities to pick the next token. Even when you fix the temperature, randomness is still built into the language modeling paradigm.
But logic is anything but random.
Jul 26, 2023 β’ 14 tweets β’ 3 min read
What is Machine Learning?
Here's a 3 min intuitive introduction explaining why this is the most powerful paradigm shift for conventional software development.
No math, no code, just intuitions. Let's dive in.
Conventional software is about automating stuff.
A client comes up with a problem, and to solve it, as a software developer, you must understand very precisely how a human would do it.
Then you can proceed to automate that process.
Jul 22, 2022 β’ 20 tweets β’ 4 min read
Clustering is a process for discovering relationships between objects, by placing them in different groups according to how similar they are with each other.
βGiven a set of objects, is there a natural, unbiased way to cluster them?
Meet the ugly duckling theorem.
𧡠1 of 20
Say we have three objects, two swans π¦’π¦’ and an ugly duckling π¦.
Obviously, the natural way to cluster them is by placing the two swans together and the duckling in a different group, right?
Well, it depends on which features you choose to look into.
2 of 20
Jun 29, 2022 β’ 24 tweets β’ 4 min read
I've spent the last couple of years disrupting traditional software companies with machine learning and data science ideas directly out of my group's core research.
I've found that most issues arise from three critical areas.
Here's what I've learned...
𧡠1/24
Most of the obstacles I've seen can be grouped in one of the following three categories:
1οΈβ£ the language
2οΈβ£ the development process
3οΈβ£ the expected results
Let's tackle them one by one.
2/24
Apr 12, 2022 β’ 12 tweets β’ 6 min read
AutoML is a growing subfield of machine learning, that aims to automate some of the most boring and time-consuming parts of designing, training, and deploying a machine learning pipeline.
Here are 10 open source AutoML tools you can start using today:π
βοΈauto-sklearn
Probably the most popular AutoML system, it sits on top of everyone's favourite ML framework, scikit-learn, and gives you a black-box AutoML wrapper that abstracts away most of scikit-learn's estimators.
It's probably the most important theoretical question in computer science, and it sounds weirdly abstract.
But deep down, it has a very intuitive explanation.
If you have heard of this and want to learn a bit more, read on...
π§΅π
Computer Science is all about finding clever ways to solve difficult problems.
We have found clever algorithms for a bunch of them: sorting stuff, finding shortest paths, solving equations, simulating physics...
But some problems seem to be way too hard π
Oct 19, 2021 β’ 6 tweets β’ 3 min read
If you're looking for an easy way to introduce more machine into your products and services, AutoML is a good bet.
β¨ Here's a short (and incomplete) list of open source and commercial AutoML systems you can start using today:
A drop-in replacement for scikit-learn that can train classifiers or regressors automatically. Based on Bayesian Optimization, and production-ready.
Oct 17, 2021 β’ 7 tweets β’ 2 min read
Hi π I'm Alejandro from Havana, Cuba π¨πΊ
By day, I'm a CS college professor and AI researcher. By night, I come to Twitter to explain things as simple as possible, but not simplistically.
β¨ Follow me if you're interested in technical and philosophical aspects of AI and ML.
𧡠I like to write threads, once or twice a week, on topics related with AI and CS in general.
βοΈ I also like to go deeper on some topics with short essays and technical guides. You can check them all at apiad.net.
Here's my Twitter schedule: π
Oct 3, 2021 β’ 7 tweets β’ 2 min read
Hey folks π!
π Happy Sunday to everyone! How about we make a round of presentations? Let's meet and follow as many cool people as possible today!
Tell me who you are and why should we follow you. I'll start π
π I'm Alejandro from Havana, Cuba π¨πΊ
By day, I'm a CS college professor and AI researcher. On the nights, I enjoy explaining things as simple as possible, but not simplistically.
β¨ Follow me if you're interested in AI, machine learning, and computer science in general.
Oct 1, 2021 β’ 25 tweets β’ 5 min read
Solving a machine learning problem requires more than choosing the right model.
One of he most challenging and possibly critical design decisions in any ML process is figuring out which evaluation metrics to use.
Let's talk about choosing the right metric in ML.
π§΅π
βοΈ This is the last of 5οΈβ£ threads I shared with you this week on foundational topics in machine learning.
If you want a recap, or haven't seen the rest, check out the pinned tweet @AlejandroPiad.
Sep 28, 2021 β’ 18 tweets β’ 3 min read
Hey folks π!
This is the second thread in a series on the foundations of Machine Learning.
β Today, I want to start discussing the different types of ML flavours we can find.
This is a very high-level overview of the different ML paradigms.
π§΅π 1/18
Last time we talked about how Machine Learning works.
π Basically, it's about having some source of experience E for solving a given task T, that allows us to find a program P which is (hopefully) optimal w.r.t. some metric M.
Sep 27, 2021 β’ 22 tweets β’ 4 min read
Hey folks π!
This is first thread in a Twitter series about the foundations of Machine Learning. To begin, I want to answer this simple question.
β What is Machine Learning?
This is my preferred way of explaining it.
π§΅π 1/22
Machine Learning is a computational approach to problem-solving with four key ingredients:
1οΈβ£ A task to solve T
2οΈβ£ A performance metric M
3οΈβ£ A computer program P
4οΈβ£ A source of experience E
Sep 26, 2021 β’ 7 tweets β’ 2 min read
Hey folks π!
π Happy Sunday to everyone! How about we make a round of presentations? Let's meet and follow as many cool people as possible today!
Tell me who you are and why should we follow you. I'll start π
I'm Alejandro π¨πΊ. I'm passionate about explaining things as simple as possible, but not simplistically.
π I teach programming, compilers, and AI in the computer science major at the University of Havana.
Sep 24, 2021 β’ 8 tweets β’ 2 min read
For most of our history on Earth, human-to-human communication has been ephemeral.
We didn't need to care too much for what we said, as most of it wasn't supposed to be remembered.
This has changed in the last 30 years.
We now have, for the first time ever, the technology to store every one of our thoughts forever.
Whatever we say in public can be recalled years later and used to support (or contradict) any claims about ourselves, our past, our character.
Sep 23, 2021 β’ 22 tweets β’ 5 min read
Hey folks, today is a Theory Thursday π§!
The perfect excuse to share with you random bits of theory from some dark corner of Computer Science and make it as beginner-friendly as possible.
Today I want to talk about *Algorithmic Complexity*.
π§΅π 1/22
To get started, take a look at the following code. How long do you think it will take to run it?
Let's make that question more precise. How long do you think it will take to run it in the worst-case scenario?
π 2/22
Sep 22, 2021 β’ 19 tweets β’ 4 min read
Grad school is hard. You need to be both open to explore new and crazy ideas and able to laser-focus on one of them at a time.
These are three healthy research habits I wish I had known when I started my PhD.
π§΅π 1/19
Grad school is one of the most intense moments in a researcher's career.
You have to go through a cycle of ideation, experimentation, and writing, as fast as you can, multiple times during those few years, which risks burning you out.
π 2/19
Sep 21, 2021 β’ 16 tweets β’ 4 min read
Hey folks, today is a Technical Tuesday π€!
Let's talk about practical technologies that you can use today to improve your existing workflows.
In this thread I will tell you about *AutoML*
π§΅π 1/16
AutoML stands for *Automated Machine Learning*.
It encompasses a bunch of technologies and paradigms to gradually automate the process of creating machine learning solutions.
π‘ AutoML is about raising the abstraction level in ML and reducing the grunt work.
π 2/16
Sep 20, 2021 β’ 11 tweets β’ 2 min read
Hey folks, today is a Mindblowing Monday π€―!
A day to share with you amazing things from every corner of Computer Science.
Today I want to talk about Generative Adversarial Networks.
π§΅π 1/11
π¬ But let's begin with some eye candy.
Take a look at this mind-blowing 2-minute video and, if you like it, then read on, I'll tell you a couple of things about it...
<>
π 2/11
Jul 16, 2021 β’ 7 tweets β’ 2 min read
Most of the decisions that we remember making are which-decisions.
Which-decisions are the ones we have to make explicitely, when given two or more options, and it is impossible not to choose.
Which recipe to cook, which brand of phone to buy, which career to study.
π𧡠1/7
Their explicit nature forces us to think and weigh their implications consciously before choosing.
We can make the wrong choice, sure. But, in hindsight, more often than not we tell ourselves that we chose the best we could with the information we had.
π 2/7
Jul 14, 2021 β’ 30 tweets β’ 5 min read
The literature review is one of the most important, but often the most dread boring part of any research. I know, I've been there too.
In this thread, I will show you some techniques to make this process easier, and maybe even a little fun.
π𧡠1/30
β οΈ Disclaimer: this advice is based on personal experience from me and my close colleagues. I work in Computer Science, specifically in machine learning.
Every field is different, but I believe this advice is useful for anyone at least in quantitative fields.
π 2/30
Jun 30, 2021 β’ 14 tweets β’ 3 min read
Convincing an audience to care for your idea, project, product, or business requires being persuasive when writing and talking.
In this thread you'll learn about the VSN-C framework, a simple way to structure a talk or writen piece for maximum persuasion.
π§΅π 1/14
Patrick Winston, legendary professor of AI at MIT, was a marvelous lecturer and communicator.
Part of his magic recipe was the VSN-C framework, a way of structuring an exposition or talk for maximizing the chance of convincing your audience.