James Wang Profile picture
Aug 20, 2019 9 tweets 3 min read Read on X
1/ Cerebras just built a chip with 50x the transistor count, 1,000x the memory and 10,000x the bandwidth of Nvidia’s flagship GPU. One such 'chip' could replace an entire rack of Nvidia GPUs.

What the heck is going on?
2/ It’s no coincidence that the fastest AI chip today—Nvidia’s V100—is also the largest @ 815mm^2. More area = more cores and memory.

But why not make the chip even bigger? Two reasons..
3/ First, the ASML lithography machines have a reticle size of 858mm^2. V100 is basically at the limit of what today's manufacturing allows.
4/ Second, defect rate. The larger the chip, the greater the chance for chip defects. As chip size grows, the defect rate -> 100%, yield -> 0%. It’s very hard to make large chips and still have enough working ones to sell.
5/ So what did Cerebras do? They made the largest chip ever—a chip the size of an entire wafer.

If this were pizza, Nvidia/Intel/AMD make 30cm pies and then cut them up and sell tiny 2cm^2 slices. Cerebras sells a giant uncut 21cm^2 slab.
6/ How did Cerebras get around the reticle limit and defect rate?

Their chips still use a conventional photomask and expose the wafer sections at a time. But they worked with TSMC to lay additional wires to wire all the chips together so they can work as a whole.
7/ Cerebras deals with defects through build-in redundancy. The chip has extra cores and I/O lanes built-in. If one core is a dud, data moves around it just as traffic is re-routed around a closed city block. According to Cerebras, only 1.5% of the hw is for redundancy.
8/ Why is one giant chip better than a bunch of tiny ones?

Because you can fit the whole neural network on one chip rather than dice it up and spread it across dozens of GPUs. It is easier to program, way faster, and far more power efficient.

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with James Wang

James Wang Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @draecomino

Aug 26, 2022
Going to attempt this absurdly technical recipe from the Contra cookbook tonight and live tweet the process. Let’s begin!
First, make bay leaf oil. Dried bay leaf is notorious for having little flavor. But fresh bay is very good. The oil takes on a piney/eucalyptus note. Gorgeous color too.
I’m using skate wing to make the sauce. It needs to be rich and milky so I’m adopting a Chinese technique - fry and rapid boil to emulsify the fat. It’s still a bit thin so I might reduce more.
Read 10 tweets
May 15, 2022
This year in the markets - learnings and outlook twitter.com/i/spaces/1PlKQ…
Books recommendations on the talk:

amazon.com/Idea-Factory-G…
Read 4 tweets
Oct 16, 2021
I’m going to have a fun TIME tonight.
love this UI for the avax bridge
good news, DAI is sent from the ethereum side.
bad news, it's nowhere to be found on the avax side.
#futureoffinance
Read 7 tweets
May 31, 2021
ETH, with a little help from @LidoFinance and @CurveFinance, can generate 12% yield. But where does this yield come from?

Let's break down the ponzu recipe. 🧫👇 Image
The Ethereum blockchain gives rewards to computers that validate transactions. If you hold ETH, you can validate transactions. The easiest way to do this is to use a service like Lido. The yield is currently ~6%. lido.fi Image
Normally when you stake your ETH, your ETH is locked up. Lido gives you staked ETH (stETH) tokens in return. This makes you ETH liquid and allows you to do stuff with them.
Read 6 tweets
Aug 25, 2020
The public cloud makes it easy for anyone to start a software company—but at a cost—your margins now belong to AWS. Thread:👇
There are three ways of paying for software infrastructure:
1. have your customer pay for it (cheap)
2. build your own data center (somewhat costly)
3. rent from a public cloud (very costly)
The cheapest infra is no infra. This is the classic enterprise software model: the customer buys your sw to run on their own hw. Selling pure sw yields the highest margins in industry: 90%+.
Read 8 tweets
May 13, 2020
1/ How James Cameron’s Terminator 2 predicted modern AI chips and sparked the debate on AI safety. An appreciation thread.👇
2/ This is the chip that powers the T-800. Based on its appearance and commentary from chief architect Miles Dyson, the movie makes three predictions about future processors: 1) neural net acceleration 2) multi-core design 3) 3D fabrication.

Let’s look at these claims.
3/ Among the many technologies Cameron could have picked for Terminator, neural network processor was spot on. Neural net is the breakout technology of the past decade. As of 2020, there are ~100 companies building neural net processors with annual revenues exceeding $5 billion.
Read 8 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(