1/n OK, let me explain what's going on with the Facebook right now.

First, let's talk "routing". The Internet is a mesh of routers that forward packets. Packets go from source through a series of routers until they reach their destination.
2/ Your packets follow a path or route through the network. Here's the path my packets are currently taking going to twitter.
3/ Routers in the core of the Internet need to know the location of every IP address on the Internet. In this manner, they know which direction to forward a packet so that it reaches it's destination.
4/ IPv4 addresses are 32-bits in size, so roughly 4-billion possible IPv4 addresses. In theory, Internet routers could track each address separately, but that would take a lot of work. So instead, routers track IP addresses by subnet -- a range of addresses at the same location.
5/ The average subnet is 4-thousand addresses in size. This means instead of 4-billion entries in a routing table, there are only around a million. Note that different subnets vary in size, some are much larger, some are smaller.
6/ An IP address is thus split into two sections, the first bits at the front (on average, the first 20 bits) that are meaningful for routers. This is the "prefix". The remaining bits are only used once the packet reaches the target subnet.
7/ When you own address space, you must advertise your network prefix to your neighboring routers, who in turn, announce your route to their neighboring routers, and so on until the entire Internet knows your location.
8/ If you stop announcing the location of your prefix, well, then routers on the Internet stop forwarding packets to your network. They forget about you, and packets sent to you go nowhere.

Don't do that.
9/ Facebook did that.
10/ Now let's talk about DNS. We generally don't refer to machines on the Internet by their IP address, we refer to them by their name, like "twitter.com" or "facebook.com".

Our apps use DNS underneath to convert names to IP addresses.
11/ Router announcements (using BGP) and name lookups (using DNS) represent the logical structure of the Internet. When things fail, it's usually DNS and sometimes BGP.
12/ Facebook put their DNS servers inside it's own address space (instead of locating them elsewhere). Thus, because of the BGP problem, there's also a DNS problem.
13/ Not that it matters. If the DNS servers were functioning, they'd simply point to the offline address space, so attempts to contact Facebook still wouldn't work for routing reasons. But right now, the unreachability shows up as DNS reasons.
14/ DNS works by contacting intermediate servers called "resolvers". It's likely your local ISP's resolver (like 75.75.75.75 for Comcast) or a public resolver like Google's 8.8.8.8 or CloudFlare's 1.1.1.1.
15/ Since a billion people are running multiple apps trying to reach the Facebook every minute, they are now overloading resolvers trying to get an answer. And getting none, since Facebook's servers are down.
16/ A property of DNS is that successful answers are "cached" for a period of time. Once you get an answer, you probably won't ask again for some time. The amount of time is included in the response, such as "this answer is good for another hour).
17/ When lookups fail, there's no good caching of that failure. Indeed, you'll likely want to know the answer as soon as Facebook's servers become available again, rather than waiting hours or days before trying to lookup facebook.com again.
18/ So resolvers are now heavily loaded, meaning that a Facebook failure is causing failures throughout the rest of the Internet as DNS fails.
19/ But here's the FUN part: it's not just Facebook's DNS that's failed, but their internal network as well. Reports are that employees are locked out of their own buildings.
rawstory.com/even-facebook-…
20/ You can imagine right now that employees are no applying sledge hammers to concrete walls to make a hole so they can get into the server rooms to fix the BGP problem, because their badges don't work.
21/ In theory, fixing this BGP problem should be a quick fix. But when your entire infrastructure is interdependent on itself, then there's a lot of impediments to fixing the core problem.
22/ It's easy to point fingers and laugh, but in truth, problems at this scale are huge, and answers only obvious in hindsight.
23/ CloudFlare has a post on this blog.cloudflare.com/october-2021-f…
24/ Ah, not sledgehammers, but angle grinders.
25/ Ah, apparently they did have physical security issues, but did not destroy things, so this story is less fun.

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Robᵉʳᵗ Graham #PcapsOrItDidntHappen

Robᵉʳᵗ Graham #PcapsOrItDidntHappen Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @ErrataRob

1 Oct
October is "Cybersecurity Awareness Month", which is where the cybersecurity industrial complex will try to scare you about the evil witchcraft of hackers without providing any useful advice on how to protect yourself.
They will tell you to be suspicious of suspicious-looking links and not click on them, as if this were practical advice and that it's your failure for not doing this one simple things.
They will tell you to keep everything up-to-date on patches, as if that's a practical thing. Sure, you desktop and phone are pretty easy to keep up-to-date, but when was the last time you cared about updating the software in your TV?
Read 5 tweets
25 Sep
1/ There's no educating conspiracy-theorists like this guy, but nonetheless I'm going to try. This thing as a simple, rational explanation, though I'm not sure I can make it simple enough for conspiracy theorists.
2/ Cendyne is a company that does marketing for hotel companies, including sending out mass email.

When they get a customer, they often register a new domain on behalf of that customer, like mjh-email.com, hyatt-email.com, deniham-email.com.
3/ Their apps, such as those used to schedule events at venues like hotels, will do lookups on those domains. They do so in various ways, like "mail1.mjh-email.com" or also "mjh1.contact-client.com" -- both of which point to 66.216.133.36.
Read 12 tweets
23 Sep
1/ The post by @briankrebs is garbage. It's typical conspiracy-theory nonsense that seeks anomalies that can't otherwise be explained (except by the conspiracy).

It should try harder to explain them. In the following tweets I show you how.
2/ Take this part of the article, an inexplicable SPF entry that looks nothing like any of the other Trump Organization domains.

Yes, but it looks exactly like other domains that Cendyne manages on behalf of client hotel companies.
3/ It's Cendyne who registered the domain, not the Trump Organization. The domains are for sending bulk email, for which they use Listrak machines, which all have similar configurations.
Read 7 tweets
23 Sep
1/n In two days, they'll present the Maricopa audit live at 4pm Eastern. I plan on live tweeting it, as responses to this tweet, so you can bookmark this and check back Friday.

I'm certain there will be no value to my tweets, so you probably shouldn't.
2/n The report leaked early, so naturally I read it and wrote up a response discussing the cybersecurity bits.
blog.erratasec.com/2021/09/check-…
3/n Most of the news about the Cyber Ninjas is concerned about whether the results come out right (Biden vs. Trump). This is probably the most important part.

But my expertise is in the cybersecurity parts.
azcentral.com/story/news/pol…
Read 53 tweets
22 Sep
Wow. I was wrong with this tweet. So I deleted it and made a snapshot of it.

Conventional wisdom is that SSDs don't need defragmentation, which mostly right, meaning partly wrong.

Windows knows it's an SSD when asked to "defrag" and does what's appropriate. Image
I just asked Windows to defragment my SSD, which took less than a second, because all it did was make sure any "trims" needing done were fully completed. It knew the difference between SSD and rotating disk and did what was needed. Image
Read 5 tweets
22 Sep
Sooo.....

You have two choices:
#1 fall back on the "experts have debunked it". I'm an expert, I've debunked it.
#2 spend considerable amount of time understanding the issue so that you can competently debate it and answer questions, which frankly, isn't worthy your time
The short answer is this: the forensics investigators looked only at the C: boot drive, not the D: data drive were records are preserved. Thus, they could not have said whether or not records were correctly preserved according to state law.
Secondly, it's not a valid forensics report, because among other things, they violate forensics ethics by not putting their name on it and redacting information without disclosing the fact of redaction to the reader.
Read 13 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!

Follow Us on Twitter!

:(