Post

How to get URL link on X (Twitter) App

On the Twitter thread, click on or icon on the bottom
Click again on or Share Via icon
Click on Copy Link to Tweet
Paste it above and click "Unroll Thread"!
More info at Twitter Help

Josep Ferrer

@rfeers

Jul 28, 2024 • 10 tweets • 3 min read • Read on X

Scrolly

The Transformers architecture clearly explained 👇🏻

Today I'm starting a new series of threads to simplify the concept of Transformers and what's behind the Natural Language abilities of LLMs.

Let's start with the basics of the Transformer architecture:

The encoder/decoder concept. 🧠✨

1️⃣ 𝗪𝗛𝗔𝗧 𝗜𝗦 𝗔 𝗧𝗥𝗔𝗡𝗦𝗙𝗢𝗥𝗠𝗘𝗥?
A Transformer is a neural network that excels at understanding the context of sequential data and generating new data from it.

They are the first to rely solely on self-attention, without using RNNs or convolution.

2️⃣ 𝗧𝗥𝗔𝗡𝗦𝗙𝗢𝗥𝗠𝗘𝗥 𝗔𝗦 𝗔 𝗕𝗟𝗔𝗖𝗞 𝗕𝗢𝗫
Imagine a Transformer for language translation as a BLACK BOX. 🎩
• Input: A sentence in one language.
• Output: Its translation.

But what happens inside this black box? Let's find out! 🔍

3️⃣ 𝗘𝗡𝗖𝗢𝗗𝗘𝗥/𝗗𝗘𝗖𝗢𝗗𝗘𝗥 architecture
• Input: Spanish sentence ¿De quién es?
• Encoder: Transforms it into a structured format capturing its essence.
• Decoder: Receives this encoded data and generates the translation.
• Output: The translated sentence: Whose is it?

4️⃣ 𝗧𝗛𝗘 𝗔𝗥𝗖𝗛𝗜𝗧𝗘𝗖𝗧𝗨𝗥𝗘 BEHIND THE TRANSFORMERS
Each encoder and decoder is made up of layers. Here's how they work:
• Encoders: Process the input sequentially, layer by layer.
• Decoders: Take the encoded data and generate the output step by step.

Both use self-attention and feed-forward neural networks, enabling the generation of natural language.

Tomorrow we will break down the architecture of both core elements of the Transformers architecture.

Do you want to understand the Transformers architecture?
Then go check my last article about Transformers👇🏻

aigents.co/data-science-b…

If you are interested in...
• Python 🐍
• SQL 💾
• ML/MLOps 🛠
• LLMs & NLP 🗣
• DataViz 🗣
• AI Engineering ⚙️

Then follow me → @rfeers

Did you like this post?

Then join my freshly started DataBites newsletter to get all my content right to your mail every week! 🧩

👉🏻 databites.tech

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @rfeers

Josep Ferrer

@rfeers

Sep 15, 2025

How to make your LLMs smarter and more efficient explained!👇🏻

(Don't forget to bookmark for later 😉)

Creating an LLM demo is a breeze.
But... refining it for production? That's where the real challenge begins! 🛠️

Teams often grapple with LLMs lacking deep knowledge or delivering inaccurate outputs.

How do we fix this?

Optimization isn't a one-size-fits-all. Approach it along two axes:

🧠 𝗖𝗼𝗻𝘁𝗲𝘅𝘁 𝗢𝗽𝘁𝗶𝗺𝗶𝘇𝗮𝘁𝗶𝗼𝗻: Is the model missing the right info?
⚙️ 𝗟𝗟𝗠 𝗢𝗽𝘁𝗶𝗺𝗶𝘇𝗮𝘁𝗶𝗼𝗻: Is the model's output off-target? 🎯

Let's break down the three primary tools 💥

Read 10 tweets

Josep Ferrer

@rfeers

Apr 19, 2025

Multiple-class Logistic Regression clearly explained 👇🏻

(Don't forget to bookmark for later! 😉)

By default, Logistic Regression is like a coin toss - heads or tails, A or B.

But what if you have multiple classes?

That's where we adapt our model for MULTIPLE CHOICES!
There are two main ways:

1️⃣ 𝗢𝗡𝗘-𝗩𝗦-𝗥𝗘𝗦𝗧 (𝗢𝘃𝗥):
The Logistic Regression model excels in classifying binary choices.

So... what if we train multiple Logistic Regression classifiers for every class?

💡 The idea would be to focus on classifying a single class vs the rest.

Read 13 tweets

Josep Ferrer

@rfeers

Apr 15, 2025

Simple Linear Regression exemplified for dummies👇🏻

(Don't forget to bookmark for later! 😉)

1️⃣ 𝗗𝗔𝗧𝗔 𝗚𝗔𝗧𝗛𝗘𝗥𝗜𝗡𝗚 𝗣𝗛𝗔𝗦𝗘
We're using height and weight - a classic duo often assumed to have a linear relationship.

But assumptions in data science? No way! 🧐

Let's find out:
- Do height and weight really share a linear bond?

Do you like this post?

Then join my DataBites newsletter to get all my content right to your mail every Sunday! 🧩

👉🏻 🤓databites.tech

Read 18 tweets

Josep Ferrer

@rfeers

Apr 14, 2025

Linear Regression clearly explained 👇🏻

Linear regression is the simplest statistical regression method used for predictive analysis.

It can be performed with multiple variables.... but today we'll focus on a single one.

Also known as Simple Linear Regression.

1️⃣ 𝗦𝗜𝗠𝗣𝗟𝗘 𝗟𝗜𝗡𝗘𝗔𝗥 𝗥𝗘𝗚𝗥𝗘𝗦𝗦𝗜𝗢𝗡
In Simple Linear Regression, we use one independent variable to predict a dependent one.

The main goal? 🎯
Finding a line of best fit.

It's simple yet powerful, revealing hidden trends in data.

Read 13 tweets

Josep Ferrer

@rfeers

Mar 15, 2025

Linear Regression clearly explained 👇🏻

Read 13 tweets

Josep Ferrer

@rfeers

Mar 13, 2025

The Transformer's encoder clearly explained 👇🏻

1️⃣ 𝗪𝗛𝗔𝗧'𝗦 𝗧𝗛𝗘 𝗘𝗡𝗖𝗢𝗗𝗘𝗥? 🧠

The Encoder is the part responsible for processing input tokens through self-attention and feed-forward layers to generate context-aware representations.

👉 It’s the powerhouse behind understanding sequences in NLP models.

Are you enjoying this post?

Then join my newsletter DataBites to get all my content right to your mail every week! 🧩

👉🏻 databites.tech

Read 16 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Enter URL or ID to Unroll

Josep Ferrer

Try unrolling a thread yourself!

More from @rfeers

Josep Ferrer

Josep Ferrer

Josep Ferrer

Josep Ferrer

Josep Ferrer

Josep Ferrer

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!