Post

How to get URL link on X (Twitter) App

On the Twitter thread, click on or icon on the bottom
Click again on or Share Via icon
Click on Copy Link to Tweet
Paste it above and click "Unroll Thread"!
More info at Twitter Help

Tivadar Danka

@TivadarDanka

Nov 3, 2022 • 18 tweets • 6 min read • Read on X

Scrolly

Behold one of the mightiest tools in mathematics: the camel principle.

I am dead serious. Deep down, this tiny rule is the cog in many methods. Ones that you use every day.

Here is what it is, how it works, and why it is essential.

First, the story.

The old Arab passes away, leaving half of his fortune to his eldest son, third to his middle son, and ninth to his smallest.

Upon opening the stable, they realize that the old man had 17 camels.

This is a problem, as they cannot split 17 camels into 1/2, 1/3, and 1/9 without cutting some in half.

So, they turn to the wise neighbor for advice.

The wise man says "hold my camel", and solves the problem by lending one to the boys.

Now the stable has 18. The eldest son takes 9 home, while the middle and smallest son leaves with 6 and 2, as their father wished.

The wise man takes his camel back, and everybody is happy.

Thus, the camel principle is born: adding and subtracting the same quantity doesn't change the equality, but can help in the computation.

In mathematics, you cannot live without this principle.

I'll show you two examples.

The first one is the quadratic equation.

Its solution formula is one of the few things that everybody remembers from high school. Even if they are woken up in the middle of the night.

This formula is derived from the camel principle. Let me show you how!

After factoring out 𝑎 from the equation, we notice that the famous identity

(α + β)² = α² + 2αβ + β²

might help to factor the quadratic equation into a product.

To achieve that, we apply the camel principle!

After adding and subtracting the same quantity, the terms with 𝑥 factor into a product.

This leads straight to the solution formula.

There is an alternative version of the camel principle, performing a similar feat: multiplying and dividing with the same quantity.

This doesn't change the equality either.

To illustrate, let's look at derivatives, the main engine behind mathematics, physics, and optimization.

(And tons of other fields that allowed technology to get where it is now.)

How would you calculate the derivative of a composite function?

This is a quintessential question. Without this, you don't have backpropagation, gradient descent, and thus neural networks.

(At least until someone invents a clever alternative. But that'll take a while.)

You guessed right: the camel principle!

(At least, the second version, where you multiply and divide with the same quantity.)

After the camel principle is applied, the limit can be carried out termwise.

(For those with the eagle's eyes: yes, the denominator can be zero. You can epsilon-delta your way out of that, but I won't do it here.)

And thus, we have the chain rule, one massive pillar of science and technology.

This is what we use to perform backpropagation, enabling us to train our neural networks in a reasonable time.

The lesson here: tiny mathematical curios such as the camel principle are often dismissed as "lacking any applications".

However, such short-sightedness frequently leads astray.

By understanding atoms, you are able to build skyscrapers.

https://twitter.com/TivadarDanka/status/1588131890040434688

If you have enjoyed this explanation, share it with your friends and give me a follow! I regularly post deep-dive explainers such as this.

Understanding mathematics will make you a better engineer, and I want to help you with that.

https://twitter.com/TivadarDanka/status/1588131890040434688

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @TivadarDanka

Tivadar Danka

@TivadarDanka

Jan 8

First, the story:

The old Arab passes away, leaving half of his fortune to his eldest son, third to his middle son, and ninth to his smallest.

Upon opening the stable, they realize that the old man had 17 camels.

This is a problem, as they cannot split 17 camels into 1/2, 1/3, and 1/9 without cutting some in half.

So, they turn to the wise neighbor for advice.

Read 18 tweets

Tivadar Danka

@TivadarDanka

Jan 1

The single most undervalued fact of linear algebra: matrices are graphs, and graphs are matrices.

Encoding matrices as graphs is a cheat code, making complex behavior simple to study.

Let me show you how!

If you looked at the example above, you probably figured out the rule.

Each row is a node, and each element represents a directed and weighted edge. Edges of zero elements are omitted.

The element in the 𝑖-th row and 𝑗-th column corresponds to an edge going from 𝑖 to 𝑗.

To unwrap the definition a bit, let's check the first row, which corresponds to the edges outgoing from the first node.

Read 18 tweets

Tivadar Danka

@TivadarDanka

Dec 11, 2025

To unwrap the definition a bit, let's check the first row, which corresponds to the edges outgoing from the first node.

Read 18 tweets

Tivadar Danka

@TivadarDanka

Dec 9, 2025

Matrix multiplication is not easy to understand.

Even looking at the definition used to make me sweat, let alone trying to comprehend the pattern. Yet, there is a stunningly simple explanation behind it.

Let's pull back the curtain!

First, the raw definition.

This is how the product of A and B is given. Not the easiest (or most pleasant) to look at.

We are going to unwrap this.

Here is a quick visualization before the technical details.

The element in the i-th row and j-th column of AB is the dot product of A's i-th row and B's j-th column.

Read 17 tweets

Tivadar Danka

@TivadarDanka

Nov 23, 2025

The single biggest argument about statistics: is probability frequentist or Bayesian?

It's neither, and I'll explain why.

Buckle up. Deep-dive explanation incoming.

First, let's look at what is probability.

Probability quantitatively measures the likelihood of events, like rolling six with a dice. It's a number between zero and one. This is independent of interpretation; it’s a rule set in stone.

In the language of probability theory, the events are formalized by sets within an event space.

The event space is also a set, usually denoted by Ω.)

Read 33 tweets

Tivadar Danka

@TivadarDanka

Nov 19, 2025

To unwrap the definition a bit, let's check the first row, which corresponds to the edges outgoing from the first node.

Read 18 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Enter URL or ID to Unroll

Tivadar Danka

Try unrolling a thread yourself!

More from @TivadarDanka

Tivadar Danka

Tivadar Danka

Tivadar Danka

Tivadar Danka

Tivadar Danka

Tivadar Danka

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!