i hope all of you tech leaders constantly asking me what it looks like, how hard it is, is it worth it? -- listen to this, 👉please👉 honeycomb.io/resources/eaze…
when @captfuzzbucket was hired, deploys took *four* hours every day of trying and failing and trying and failing to get their code out. every day. 🥶

frankly, they didn't realize that there was a better way, *period*.
"success is a catastrophe you have to survive" -- @ceejbot quote (destined for a sticker)
they had absolutely no visibility into what was happening. the answers were nowhere. ANY data at all would be a help.
at minute 18. they are talking about how soul killingly awful these systems are to support; they burn good people out, and fast. 😕
"but WHY? It wasn't doing that much! Something's very wrong here."
And then came 4/20.
EIGHT. HOURS. [ed: cringing in solidarity]
So that's when @ceejbot had the leverage and the exec attention to bring in honeycomb, and spend the time integrating it.
Ben started pumping ELB logs into honeycomb. it took 10 minutes, and right away...
* 403s spiking *everywhere* (microservices just bein flappy)
* random bursts of 500s (when driver shifts changed)
* random spikes of 400s for no apparent reason
* random spikes of 500s for no apparent reason
so they instrumented and rolled honeycomb out on their node services, racked up some wins, and took a deep breath and realized: they had to rewrite the monolith. it was unsalvageable. 😬 [ed: omgg been there]
❄️twice.❄️
no one remained from the team who built it, and the current team didn't really understand the codebase very well. they had to start by documenting it all.
❄️1❄️ understand the monolith, redesign a new architecture, and begin building it up using a whoel new stack
❄️2❄️ also keep the monolith limping along. DO NOT DIE.
both of which kicked off with honeycomb and instrumentation.
because holy hell they were DDoSing themselves in so many unbelievable and creative ways 😍❄️😍❄️😍
"Logging is phenomenal, but you have to know what you're looking for.. at the time you write the code."
ben "lol -- again, the only thing ops had to do was just point the logs at honeycomb and walk away. this is the best part for me.."
AND IT SAVED MY HISTORY, so i could go back and look up what i tried the last time!
... alright y'all i just realized there is an ACTUAL TRANSCRIPT at the bottom of the page with the webinary, so i do not know why the fuck i am tweeting all the quotes when i coudl be in bed sleeping. lmao
not many systems stories have a happy ending, so you are in for a treat. 🐝🌈