The @awscloud explanation of their outage earlier this week has been posted.

aws.amazon.com/message/12721/
Because this is incredibly dense and technical, let me try to simplify it. I'm sure I will be condescendingly corrected if I get this wrong...
"We made a change internally that caused a bunch of internal things to become extremely chatty, like AWS employees defending the company if someone says something even slightly unflattering on Twitter."
"All of the chatty stuff made it really hard to understand what was going on because everything started behaving like CDK evangelists and never shutting the hell up for one goddamned second. As a result, engineers had to basically guess what was broken."
"It's always DNS, so they started there. It was not DNS, which is one for the record books. They then focused on moving traffic service by service, which eventually cleared the congestion issues."
"The thing that broke was basically 'the internal AWS network' which is kind of important as it turns out. A lot of things fell over as a result. We have a lot of learning to do about this newly discovered behavior and its triggers. We are deeply sorry."
As @lizthegrey points out, "making changes to DNS to mitigate" appears to be homed through us-east-1; using @awscloud for DNS looks like it may be A Mistake You Should Avoid as a result. This is important, concerning, and more than a smidgen disappointing as a customer.
In all, it's a solid writeup that does an admirable job of transparency.

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Corey Quinn

Corey Quinn Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @QuinnyPig

30 Nov
Welcome to my first-ever livetweet of an @aselipsky #reinvent keynote as a part of my requinnvent.com coverage. It's 8:30, sarcastically loud, I haven't slept, and it's time to see what our @awscloud friends have worked on all year long.
A reminder: Snarking about companies is usually okay; snarking about people (presenters, etc) is not. Punch up, not down. Be kind.

The failure mode of "clever" is "asshole."
We begin with a jarring transition from "loud rock" to "easy listening combined with a Windows screensaver theme." Clearly the #reinvent graphics refresh was delayed due to... I dunno, not having a bad enough team name or something.
Read 64 tweets
29 Nov
Getting into the @awscloud #reinvent partner keynote:

Staff: “Sorry, employees (orange lanyards) aren’t allowed in keynotes.”

Me: “It’s red. The lighting is super odd here.”

Once again I snuck past the “no Corey allowed in keynotes” rule!
And now I will livetweet the @awscloud partner keynote in this thread. #reInvent
@awscloud And now @dougyeum takes the stage and thanks us all for being here in person even though he won't be. "I'm moving to a new role within Amazon. Later, hosers."
Read 50 tweets
29 Nov
Now talking about and now the first launch of #reinvent coming from Bill Vass, VP and The Ultimate AWS Bill.
….but he might be referred to as “Dances With Robots” if the #reinvent music kicks in again.
“Build AI enabled robots” but I just wanted to build a website… ☹️
Read 8 tweets
24 Nov
I swear, @Quinnypiglet has some quotable moments. Some examples in a holiday thread...
@Quinnypiglet To Ethel, our "dog":

"Sit down, you rude bastard." Harsh, but fair.
@Quinnypiglet "Daddy, close your damn mouth, stop talking, and eat."

Honestly, everyone says this sooner or later.
Read 8 tweets
23 Nov
It'd be rude of me not to test out @_msw_ and company's fine work.

A thread.
Had to add a route table to the subnet that spoke ipv6. Had to also force traffic over @tailscale since @sonic doesn't offer ipv6 natively without tunneling yet, but then it "Just worked" since the emergence node has ipv6 on it. I'm in!
Let's start with security and update the thing. apt says "working" but it's been a suspiciously long time for that to be, y'know. Accurate.
Read 11 tweets
6 Nov
I need a Twitter plane game to play since it's no longer working hours. (Fun fact! On this Seattle --> SFO flight I once read and reviewed @RealGeneKim's entire "Unicorn Project" manuscript before landing.)

Game: name a company and I'll assign them a more appropriate motto.
"We named our company after the most expensive city in the US because we don't do subtle."
When your snack cupboard has its own loading dock.
Read 85 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(