Stephen Shankland Profile picture
Aug 17, 2020 49 tweets 18 min read Read on X
I'm tuning in to #HotChips2020 today, the 32nd year of the IEEE conference. My favorite graphic so far is AMD's 8-core Ryzen 4000 family of chips, the subject of a later presentation today. Thread/ Image
I watched the quantum computing presentations yesterday from IBM, Intel, Microsoft, and Google. Pretty interesting if you can handle the high gorpiness factor. The technology is very green, but is maturing. Here's that thread:
Right now we're starting with server chip news from Intel and IBM. Intel Ice Lake Xeon processor for servers using 2 chip sockets (Ice Lake-SP). Vs last-gen Cascade Lake, it's got 18% more instructions per clock tick performance boost. Image
IBM's Power10 server chip will arrive in servers in a little more that a year. It's enormous, with 18 billion transistors and 602mm2 area. Each core has 8 threads, and the high-end model will have 15 cores. (16 with one spare!) Built with Samsung 7nm process. Image
Power10-based servers can handle up to 2 petabytes of memory. Holy mackerel. (DDR4 to start but can be upgraded to DDR5 later.) You can up up to 16 processors into a single system. Big Iron! Image
Big Blue has Big Ideas about sharing enormous piles of memory within a pod of servers. "Power 10 open the door to memory disaggregation," says Bill Starke. Image
Now IBM's Brian Thompto is talking performance gains vs today's Power9 server chip. Estimates based on pre-silicon analysis predict some big boosts. Also notable 2.6x increase in performance per watt for better energy efficiency. Image
Expect Power10 to deliver 10X supercomputing speed (Top500 supercomputer ranking uses Linpack as benchmark) and better AI performance too (Resnet is AI image recognition software). Image
Now we're on to Marvell's upcoming Thunder X3 server chip, one of the upstart contenders vs Intel and AMD scale-out servers (the kind that are stacked up by the thousands in data centers). #HotChips2020 Image
Marvell's Rabin Sugumar promises 30% performance increase in single-thread performance for ThunderX3 vs ThunderX2 at a given clock speed. Image
Overall, though, expect a sizable performance increase. Frequency increase helps. Image
Now Anthony Saporito at IBM talking about IBM's chip for the latest z15 mainframe. Like all things mainframe it starts with a mainframe no-really-it's-still relevant pitch. "Programs writing for the first mainframes written back in 1964 still run on today's models." Image
Mainframes are good for computing loads with very high transaction volume, like credit card purchases or hotel reservations. IBM mainframes actually use 2 chips, the CP for computing and SP for storage. They're both honking big processors. #HotChips2020 Image
IBM mainframes have lots of error detection & correction features. The system can even roll back a chip to previous known-good state or automatically migrate one core's entire state to a backup core. IBM blasts its machines with proton beams to make sure they work. Neat!
Back when I wrote about servers a lot, it was a big deal when shiny new Linux came to hoary old IBM mainframes. Now it's pretty ordinary. Linux runs in virtual machine compartments organized by the hypervisor, but now there's a new ultravisor beneath for better security controls. Image
Now onto AMD Ryzen 4000 series chips, aka "Renoir," presented by is architect, Sonu Arora. 8 cores, more I/O, 2X performance per watt is a pretty big improvement. 156mm2 which is 1/4 the die area of those big IBM server chips. Image
Performance increased a bunch vs earlier "Picasso" chip design. Instructions per clock tick up 15%. Single-thread performance up 25% at the same 15W power limit, and multithread speed up 200%. #HotChips2020 Image
Ryzen's Vega graphics got a big boost, too, which will be important with Intel's Tiger Lake competitor getting Xe graphics. Not as big a boost vs the Xe jump, but a major decrease in chip surface area required. Image
Renoir consumes 59% less power during app execution than AMD's earlier Picasso, in part because it spends more time in low-power states. That means better battery life — or better options for boosting CPU/GPU performance when needed. Image
Ryzen 4000 has built-in support for USB 3.2 (most often delivered via USB-C ports these days). (Vs USB 4 and Thunderbolt 4 in Intel Tiger Lake). USB Image
"I'm so proud of what we have achieved with Renoir." –architect Sonu Arora. Image
Now onto Intel Tiger Lake, the new mobile chip coming this fall. Xavier Vera: "The top goal was higher performance for the same power budget," for power levels of 9W to 65W. A challenge to balance power-constrained multithread with power-unconstrained single-thread workloads. Image
Intel last week detailed lots of Tiger Lake improvements based on new SuperFin manufacturing technology: cnet.com/news/intel-tig…

But here are the core improvements to transistors & metal layers above in today's slides. Expect a significant perf boost vs current Ice Lake chips. ImageImage
You want block diagrams? We got block diagrams! This is an Intel Tiger Lake example with four Willow Cove cores, but Intel can employ different core counts. It integrates Thunderbolt 4, USB 4, DisplayPort 1.4, 6 cameras, 8K video, and PCIe Gen4. #HotChips2020 Image
You probably can't understate the importance of Xe graphics to Intel Tiger Lake. Integrated graphics have been meh but Xe should mean less need for discrete AMD or Nvidia GPUs. (Intel will offer discrete Xe GPUs, too — a first option this year and a gamer option in 2020.) Image
With PCIe Gen4 built straight into the Tiger Lake CPU, you'll be able to directly attach an SSD to the CPU. Image
Intel likes the idea of its "non-coherent fabric," but coincidentally those are the exact words my editor used to criticize my writing style.

(OK not really.)
DVFS power management means Intel can slosh around computing priorities to get the most work done for a given level of power consumption. DVFS states change frequently but Intel wouldn't say how frequently. Image
This is peeking ahead to this afternoon's talk on Intel's Xe graphics arriving with Tiger Lake, but here's a diagram showing how much chip real estate the GPU takes on the SOC. It's outlined in orange. Image
Keynote from Intel's Raja Koduri starts with a tribute to Fran Allen, who did a lot of work with compilers (which translate human-written languages into machine code). "She was the pioneer of this idea 'no transistor left behind.'" — the title of Koduri's talk. Image
I wonder, with Apple now moving from Intel chips to its own Arm-design chips, whether Intel execs will be wearing Apple Watches less often when on stage. (This is Intel's @Rajaontheedge.) Image
Demand for compute is doubling every 3-4 months these days, Koduri says. And of course we're generating piles of data to store and process. "We need more capacity and bandwidth at every level of the memory hierarchy." Image
On to Moore's Law, which people have predicted for decades will die soon. Koduri: "We definitely haven't explored the full entitlement of Moore's Law." Image
Koduri cites Jim Keller (who recently left Intel) vision for increasing transistor density by a factor of 50, perhaps over a decade. "We firmly believe there is a lot more transistor density to come. We are a persistent bunch. We are very good at compounding gains 1% at a time." Image
Koduri emphasizes that we need general-purpose computing performance increase, not just specialists (read: CPUs, not just GPUs and AI accelerators)
Architecture impact = performance X generality. Image
One of the tributes to Jim Killer that Koduri offered: a back-of-the-envelope calculation that a rack of servers could have a million CPU cores to keep Fortnite gamers happy with their cloud gaming. #HotChips2020 Image
Ninja programmers are great, but Intel has oneAPI initiative to write software abstraction layers to shield programmers from the complexities having of lots of processor types — CPU, GPU, AI, etc. Goal: to increase their productivity, use every transistor effectively. Image
Then (adapted from famed Hennessy and Patterson processor textbook): Software has fun on top of the hardware. Image
Koduri updates the cartoon for modern computing: Image
Xbox Series X processor is all about the graphics (as you might expect). #HotChips2020 Image
Here are the hardware details of the chip for the game console. Image
You can push Moore's Law, but it'll cost you, Microsoft says. The Xbox chips have had about the same area across three generations, but the cost went way up, in particular with new TSMC 7nm manufacturing process. Image
You can push Moore's Law, but it'll cost you, Microsoft's Jeff Andrews says. The Xbox chips have had about the same area across three generations, but the cost went way up, in particular with new TSMC 7nm manufacturing process. Image
There's a growing gap between shader calculation needs and memory size and bandwidth, says Microsoft's Mark Grossman. Image
One way to fix the problem is to reduce processing for areas that are lower priority, for example because of little color-value change compared to neighboring pixels. Like so many compression/efficiency moves, it's all about throwing out data humans won't know is missing. Image
Small amount of extra silicon area on the Xbox Series X processor boosts AI/machine learning tasks 3x-10x for jobs like super-resolution graphics or character behavior. Image
Microsoft promises slick and groovy graphics for the Xbox Series X, of course. No video demo, but here's a still shot they offered up. Image
But personally, I'm more interested in this high-resolution graphic. Amazing to realize what happens when you can plop down 15.3 billion transistors onto something the size of a skinny postage stamp. That's it for me and @hotchipsorg 32 tonight. Image

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Stephen Shankland

Stephen Shankland Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @stshank

Oct 14, 2022
Is the Pixel 7 Pro as good as a "real" camera? With @madebygoogle bragging of its "pro-level zoom" and AI smarts, I wanted to see how close it got to my $10,000 of DSLR gear. Answer: closer than any other phone. Here's my assessment & a thread. 1/ cnet.com/tech/mobile/i-…
You won't get as much detail from a 12-megapixel Pixel 7 Pro photo as from a 30-megapixel Canon 5D Mark IV in ordinary light, but it does well with color and sharpness. Left is P7P, right is 5D4, the second pair of images cropped to equivalent framing for pixel peeping. 2/
The zoom range of the Pixel 7 Pro is remarkable. It shoots natively at 0.5x, 1x, 2x, 5x, and 10x. It has clever processing tricks for intermediate levels and for reaching 30x (zoom fusion, Super Res Zoom). Here's one collection showing the full range. 3/
Read 15 tweets
Oct 12, 2022
Interesting brain-machine interface research from UC Davis' Weijian Yang. Elon Musk's Neuralink is going for electrodes, but Yang is working on using photons to stimulate specific neurons more precisely. Can now manipulate up to 50 specific neurons simultaneously in a mouse brain
Now moving from light to sound. Mikhail Shapiro from Caltech, using ultrasound to image brain activity — based on blood flow and microbubbles (from cyanobacteria). Also less invasive than electrodes. Much higher resolution than functional MRI (magnetic field) for imaging.
Ways to actually communicate with neurons using ultrasound: pushing, heating, or cavitation. "Ultrasound does directly activate the neurons," Shapiro says. (In culture, not in vivo.)
Read 7 tweets
Sep 22, 2022
Qualcomm CEO @cristianoamon just said @Qualcomm's future automotive business is now $30 billion worth of deals, up from $19B a couple months ago. "ADAS is becoming more material in our pipeline," referring to advanced driver assistance systems like lane keeping, auto braking.
Two of Qualcomm's car chip customers making appearances at the company's automotive day saying they're happy with Snapdragon car chips: General Motors CEO Mary Barra and Stellantis CEO Carlos Tavares. 2/
Qualcomm's @nakulduggal, head of the automotive business, boasts of @qualcomm's technology like AI processing & CV2X communications and its global presence. "Having all this this under one roof is not something I can imagine any other company having
access to." 3/
Read 10 tweets
Sep 22, 2022
After tons of pixel peeping, I say the iPhone 14 Pro is a big step up that serious photographers will appreciate. I love the 48-megapixel sensor for detail & cropping. Image quality like dynamic range is noticeably better. My assessment (and a thread). 1/ cnet.com/tech/mobile/wh…
Here's one example comparison, showing the same night view with iPhone 13 Pro and 14 Pro (12 megapixels, night mode), overexposed +1.3EV in editing to show better where noise is a problem. 13 Pro at left shows worse noise, detail and color. 2/
The 48-megapixel sensor means you can print photos much larger, crop in more, and rotate to fix crooked horizons. Here's a comparison of the print size of 12MP vs 48MP compared to a 5.8-inch-tall iPhone 14 Pro. 3/
Read 13 tweets
Sep 6, 2022
Interesting stats from @California_ISO from today's extremely hot weather and overtaxed electrical grid. You can see here power demand spiked at 50GW at 5:50 p.m. PT, while solar generation was tailing off for the day. caiso.com/TodaysOutlook/… thread 1/
Natural gas is what kept the lights on and the air conditioners running tonight. Natural gas power plans supplied about 26GW of power steadily since 6 p.m. PT. 2/
Solar supplies a *lot* of power to California during the day, but it fades as power usage rises in the evening, when prices soar (leading to the famous "duck curve," not shown here). Wind is comparatively small. 3/
Read 18 tweets
Sep 5, 2022
Wondering, without much of a sense of optimism, whether upgrading from Quicken Willmaker 2021 to 2022 will fix the bugs that I've wasted hours fighting today. (v2023 comes out in October and apparently goes online only. How long until it's only available via subscription?)
The biggest problem is I can't divide property in unequal shares among various people. The UI changes the % numbers I've already entered in crazy ways.

And here's the error message if I try deleting the duplicate entry for myself in the contacts section.
Since it's the last non-online version, I decided to upgrade. Couldn't get on the website (only not-yet-available v2023) so bought via Amazon download. The update fixed my biggest problem, yay! But, weirdly, disowned one of my kids, so I have to re-enter stuff.
Read 4 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(