Post

How to get URL link on X (Twitter) App

On the Twitter thread, click on or icon on the bottom
Click again on or Share Via icon
Click on Copy Link to Tweet
Paste it above and click "Unroll Thread"!
More info at Twitter Help

Deedy

@deedydas

Nov 23 • 12 tweets • 2 min read • Read on X

NVIDIA's $7B Mellanox acquisition was actually one of tech's most strategic deals ever.

The untold story of the most important company in AI that most people haven't heard of

1/12

Most people think NVIDIA = GPUs. But modern AI training is actually a networking problem.

A single A100 can only hold ~50B parameters. Training large models requires splitting them across hundreds of GPUs.

2/12

Enter Mellanox.

They pioneered RDMA (Remote Direct Memory Access) which lets GPUs directly access memory on other machines with almost no CPU overhead. Before RDMA, moving data between GPUs was a massive bottleneck.

3/12

The secret sauce is in Mellanox's InfiniBand.

While Ethernet does 200-400ns latency, InfiniBand does ~100ns. For distributed AI training where GPUs constantly sync gradients, this 2-3x latency difference is massive.

4/12

Mellanox didn't just do hardware.

Their GPUDirect RDMA software stack lets GPUs talk directly to network cards, bypassing CPU & system memory. This cuts latency another ~30% vs traditional networking stacks.

5/12

NVIDIA's master stroke: Integrating Mellanox's ConnectX NICs directly into their DGX AI systems.

The full stack - GPUs, NICs, switches, drivers - all optimized together. No one else can match this vertical integration.

6/12

The numbers are staggering:
- HDR InfiniBand: 200Gb/s per port
- Quantum-2 switch: 400Gb/s per port
- End-to-end latency: ~100ns
- GPU memory bandwidth matching: ~900GB/s

7/12

Why it matters: Training SOTA scale models requires:
- 1000s of GPUs
- Petabytes of data movement
- Sub-millisecond latency requirements
Without Mellanox tech, it would take literally months longer.

8/12

The competition is playing catch-up:
- Intel killed OmniPath
- Broadcom/Ethernet still has higher latency
- Cloud providers mostly stuck with RoCE
NVIDIA owns the premium AI networking stack

9/12

Looking ahead: CXL + Mellanox tech will enable even tighter GPU-NIC integration.

We'll see dedicated AI networks with sub-50ns latency and Tb/s bandwidth. The networking advantage compounds.

10/12

In the AI arms race, networking is the silent kingmaker.

NVIDIA saw this early.

The Mellanox deal wasn't about current revenue - it was about controlling the foundational tech for training next-gen AI.

11/12

Next time you hear about a new large language model breakthrough, remember: The GPUs get the glory, but Mellanox's networking makes it possible.

Sometimes the most important tech is invisible.

12/12

• • •

Missing some Tweet in this thread? You can try to force a refresh

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @deedydas

Deedy

@deedydas

Nov 19

Everyone thinks this is an exaggeration but there are so many software engineers, not just at FAANG, who I know personally who literally make ~2 code changes a month, few emails, few meetings, remote work, < 5 hours/ week, for ~$200-300k.

Here are some of those companies:

Oracle
Salesforce
Cisco
Workday
SAP
IBM
VMware
Intuit
Autodesk
Veeva
Box
Citrix
Adobe

The “quiet quitting” playbook is well known:

- “in a meeting” on slack
- scheduled slack, email, code at late hours
- private calendar with blocks
- mouse jiggler for always online
- “this will take 2 weeks” (1 day)
- “oh, the spec wasn’t clear”
- many small refactors
- “build is having issues”
- blocked by another team
- will take time bcuz like “race condition”
- “can you create a jira for that?”

Read 5 tweets

Deedy

@deedydas

Oct 29

Programming languages have widely varying ability to communicate logic succinctly.

If you look at character counts of 10 basic programs in each, Java has 2x higher entropy than Python.

Very different result for natural spoken languages.

Heres a sample of 5 algorithms that were compared and their respective character counts.

When you extend the analysis to 100 programs, C / Java are nearly 2.5-3x less information dense than Python.

In other words, Python is the most token efficient way for LLMs to program in

Read 4 tweets

Deedy

@deedydas

Oct 22

Height in China has exploded faster than any country in history at 1.75cm/decade!

Studies show the average male height over the last ~50yrs went from 5'6" to 5'9", leaving only Lebanon, Russia and Turkey are higher in Asia.

This is proof of how economy affects genetics.

1/4

Causes of growth:
— Socioeconomic improvements
— One-child policy may have led to concentration of resources
— Increased protein and dairy consumption.

The 2.5cm/decade growth in the 70s far outdoes the 1cm/decade the west saw in the 1870-1970 period!

2/4

Women in China prefer guys over 180cm (5' 11"). Dating and height-increasing drugs is a part of this story too.

GeneScience makes >$1B selling Jintropin, which is HGH, along with Anhui Anke which can cost $8.6k-$24k/yr and is a booming market.

3/4

Read 5 tweets

Deedy

@deedydas

Oct 16

Compilers was was known to be the hardest CS class at Cornell which was hard as it is.

We were handed a 8-page PDF at the start of sem for a language spec we'd be implementing by the end of sem, split into 6 parts.

On part 5, the median was a 0/100 and most the class failed.

For those curious, here's the rest of the spec. Part 1:

Part 2:

Read 4 tweets

Deedy

@deedydas

Oct 9

BREAKING: One of India's most massive hacks is happening right now!

~31M rows of Star Health Insurance data — name, DOB, address, phone, PAN card and salary for Indians is selling it for $150k.

Hacker claims CISO Amarjeet Khurana sold him the data.

Nothing is private in India.

You can buy and see a sample of the data here: starhealthleak.st

Just to remind anyone wondering what the consequence of a data leak like this is:

— Identity theft
— Financial fraud
— Targeted scams
— Hacking other accounts
— Phishing attempts
— Account takeovers
— Extortion

It's really a long list..

Read 4 tweets

Deedy

@deedydas

Oct 9

Nobel Physics Prize winner Geoff Hinton's middle name is Everest.

It comes from his great-great-granduncle Sir George Everest, once British surveyor-general of India, after whom Mount Everest was named.

Talk about lofty family expectations.

His lineage is full of shockers.

His great-great-grandfather? George Boole, the namesake of "boolean" that is essential to computers and the Information Age.

His great-grandfather coined the word "tesseract"

Read 7 tweets

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Share this page!

Enter URL or ID to Unroll

Deedy

Try unrolling a thread yourself!

More from @deedydas

Deedy

Deedy

Deedy

Deedy

Deedy

Deedy

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!