Deedy Profile picture
Nov 23 12 tweets 2 min read Read on X
NVIDIA's $7B Mellanox acquisition was actually one of tech's most strategic deals ever.

The untold story of the most important company in AI that most people haven't heard of

1/12 Image
Most people think NVIDIA = GPUs. But modern AI training is actually a networking problem.

A single A100 can only hold ~50B parameters. Training large models requires splitting them across hundreds of GPUs.

2/12
Enter Mellanox.

They pioneered RDMA (Remote Direct Memory Access) which lets GPUs directly access memory on other machines with almost no CPU overhead. Before RDMA, moving data between GPUs was a massive bottleneck.

3/12
The secret sauce is in Mellanox's InfiniBand.

While Ethernet does 200-400ns latency, InfiniBand does ~100ns. For distributed AI training where GPUs constantly sync gradients, this 2-3x latency difference is massive.

4/12
Mellanox didn't just do hardware.

Their GPUDirect RDMA software stack lets GPUs talk directly to network cards, bypassing CPU & system memory. This cuts latency another ~30% vs traditional networking stacks.

5/12
NVIDIA's master stroke: Integrating Mellanox's ConnectX NICs directly into their DGX AI systems.

The full stack - GPUs, NICs, switches, drivers - all optimized together. No one else can match this vertical integration.

6/12
The numbers are staggering:
- HDR InfiniBand: 200Gb/s per port
- Quantum-2 switch: 400Gb/s per port
- End-to-end latency: ~100ns
- GPU memory bandwidth matching: ~900GB/s

7/12
Why it matters: Training SOTA scale models requires:
- 1000s of GPUs
- Petabytes of data movement
- Sub-millisecond latency requirements
Without Mellanox tech, it would take literally months longer.

8/12
The competition is playing catch-up:
- Intel killed OmniPath
- Broadcom/Ethernet still has higher latency
- Cloud providers mostly stuck with RoCE
NVIDIA owns the premium AI networking stack

9/12
Looking ahead: CXL + Mellanox tech will enable even tighter GPU-NIC integration.

We'll see dedicated AI networks with sub-50ns latency and Tb/s bandwidth. The networking advantage compounds.

10/12
In the AI arms race, networking is the silent kingmaker.

NVIDIA saw this early.

The Mellanox deal wasn't about current revenue - it was about controlling the foundational tech for training next-gen AI.

11/12
Next time you hear about a new large language model breakthrough, remember: The GPUs get the glory, but Mellanox's networking makes it possible.

Sometimes the most important tech is invisible.

12/12

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Deedy

Deedy Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @deedydas

Nov 19
Everyone thinks this is an exaggeration but there are so many software engineers, not just at FAANG, who I know personally who literally make ~2 code changes a month, few emails, few meetings, remote work, < 5 hours/ week, for ~$200-300k.

Here are some of those companies:
Oracle
Salesforce
Cisco
Workday
SAP
IBM
VMware
Intuit
Autodesk
Veeva
Box
Citrix
Adobe
The “quiet quitting” playbook is well known:

- “in a meeting” on slack
- scheduled slack, email, code at late hours
- private calendar with blocks
- mouse jiggler for always online
- “this will take 2 weeks” (1 day)
- “oh, the spec wasn’t clear”
- many small refactors
- “build is having issues”
- blocked by another team
- will take time bcuz like “race condition”
- “can you create a jira for that?”
Read 5 tweets
Oct 29
Programming languages have widely varying ability to communicate logic succinctly.

If you look at character counts of 10 basic programs in each, Java has 2x higher entropy than Python.

Very different result for natural spoken languages. Image
Heres a sample of 5 algorithms that were compared and their respective character counts. Image
When you extend the analysis to 100 programs, C / Java are nearly 2.5-3x less information dense than Python.

In other words, Python is the most token efficient way for LLMs to program in Image
Read 4 tweets
Oct 22
Height in China has exploded faster than any country in history at 1.75cm/decade!

Studies show the average male height over the last ~50yrs went from 5'6" to 5'9", leaving only Lebanon, Russia and Turkey are higher in Asia.

This is proof of how economy affects genetics.

1/4 Image
Causes of growth:
— Socioeconomic improvements
— One-child policy may have led to concentration of resources
— Increased protein and dairy consumption.

The 2.5cm/decade growth in the 70s far outdoes the 1cm/decade the west saw in the 1870-1970 period!

2/4 Image
Image
Women in China prefer guys over 180cm (5' 11"). Dating and height-increasing drugs is a part of this story too.

GeneScience makes >$1B selling Jintropin, which is HGH, along with Anhui Anke which can cost $8.6k-$24k/yr and is a booming market.

3/4 Image
Read 5 tweets
Oct 16
Compilers was was known to be the hardest CS class at Cornell which was hard as it is.

We were handed a 8-page PDF at the start of sem for a language spec we'd be implementing by the end of sem, split into 6 parts.

On part 5, the median was a 0/100 and most the class failed. Image
For those curious, here's the rest of the spec. Part 1: Image
Image
Image
Image
Part 2: Image
Read 4 tweets
Oct 9
BREAKING: One of India's most massive hacks is happening right now!

~31M rows of Star Health Insurance data — name, DOB, address, phone, PAN card and salary for Indians is selling it for $150k.

Hacker claims CISO Amarjeet Khurana sold him the data.

Nothing is private in India. Image
You can buy and see a sample of the data here: starhealthleak.stImage
Just to remind anyone wondering what the consequence of a data leak like this is:

— Identity theft
— Financial fraud
— Targeted scams
— Hacking other accounts
— Phishing attempts
— Account takeovers
— Extortion

It's really a long list..
Read 4 tweets
Oct 9
Nobel Physics Prize winner Geoff Hinton's middle name is Everest.

It comes from his great-great-granduncle Sir George Everest, once British surveyor-general of India, after whom Mount Everest was named.

Talk about lofty family expectations.
His lineage is full of shockers.

His great-great-grandfather? George Boole, the namesake of "boolean" that is essential to computers and the Information Age. Image
His great-grandfather coined the word "tesseract" Image
Read 7 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(