NVIDIA's $7B Mellanox acquisition was actually one of tech's most strategic deals ever.
The untold story of the most important company in AI that most people haven't heard of
1/12
Most people think NVIDIA = GPUs. But modern AI training is actually a networking problem.
A single A100 can only hold ~50B parameters. Training large models requires splitting them across hundreds of GPUs.
2/12
Enter Mellanox.
They pioneered RDMA (Remote Direct Memory Access) which lets GPUs directly access memory on other machines with almost no CPU overhead. Before RDMA, moving data between GPUs was a massive bottleneck.
3/12
The secret sauce is in Mellanox's InfiniBand.
While Ethernet does 200-400ns latency, InfiniBand does ~100ns. For distributed AI training where GPUs constantly sync gradients, this 2-3x latency difference is massive.
4/12
Mellanox didn't just do hardware.
Their GPUDirect RDMA software stack lets GPUs talk directly to network cards, bypassing CPU & system memory. This cuts latency another ~30% vs traditional networking stacks.
5/12
NVIDIA's master stroke: Integrating Mellanox's ConnectX NICs directly into their DGX AI systems.
The full stack - GPUs, NICs, switches, drivers - all optimized together. No one else can match this vertical integration.
6/12
The numbers are staggering:
- HDR InfiniBand: 200Gb/s per port
- Quantum-2 switch: 400Gb/s per port
- End-to-end latency: ~100ns
- GPU memory bandwidth matching: ~900GB/s
7/12
Why it matters: Training SOTA scale models requires:
- 1000s of GPUs
- Petabytes of data movement
- Sub-millisecond latency requirements
Without Mellanox tech, it would take literally months longer.
8/12
The competition is playing catch-up:
- Intel killed OmniPath
- Broadcom/Ethernet still has higher latency
- Cloud providers mostly stuck with RoCE
NVIDIA owns the premium AI networking stack
9/12
Looking ahead: CXL + Mellanox tech will enable even tighter GPU-NIC integration.
We'll see dedicated AI networks with sub-50ns latency and Tb/s bandwidth. The networking advantage compounds.
10/12
In the AI arms race, networking is the silent kingmaker.
NVIDIA saw this early.
The Mellanox deal wasn't about current revenue - it was about controlling the foundational tech for training next-gen AI.
11/12
Next time you hear about a new large language model breakthrough, remember: The GPUs get the glory, but Mellanox's networking makes it possible.
Sometimes the most important tech is invisible.
12/12
• • •
Missing some Tweet in this thread? You can try to
force a refresh
Everyone thinks this is an exaggeration but there are so many software engineers, not just at FAANG, who I know personally who literally make ~2 code changes a month, few emails, few meetings, remote work, < 5 hours/ week, for ~$200-300k.
Here are some of those companies:
Oracle
Salesforce
Cisco
Workday
SAP
IBM
VMware
Intuit
Autodesk
Veeva
Box
Citrix
Adobe
The “quiet quitting” playbook is well known:
- “in a meeting” on slack
- scheduled slack, email, code at late hours
- private calendar with blocks
- mouse jiggler for always online
- “this will take 2 weeks” (1 day)
- “oh, the spec wasn’t clear”
- many small refactors
- “build is having issues”
- blocked by another team
- will take time bcuz like “race condition”
- “can you create a jira for that?”
Height in China has exploded faster than any country in history at 1.75cm/decade!
Studies show the average male height over the last ~50yrs went from 5'6" to 5'9", leaving only Lebanon, Russia and Turkey are higher in Asia.
This is proof of how economy affects genetics.
1/4
Causes of growth:
— Socioeconomic improvements
— One-child policy may have led to concentration of resources
— Increased protein and dairy consumption.
The 2.5cm/decade growth in the 70s far outdoes the 1cm/decade the west saw in the 1870-1970 period!
2/4
Women in China prefer guys over 180cm (5' 11"). Dating and height-increasing drugs is a part of this story too.
GeneScience makes >$1B selling Jintropin, which is HGH, along with Anhui Anke which can cost $8.6k-$24k/yr and is a booming market.