Robin Moffatt 🍻🏃🥓 Profile picture
DevEx Engineer at @Decodableco. Doing fun stuff with data and open source. 🌐 https://t.co/WparjfmCF5 🔗 Mastodon: @rmoff@data-folks.masto.host

Oct 5, 2022, 20 tweets

#Current22 @AdiPolak talking about chaos engineering

A scary list of all the things that could go wrong with data flows #Current22

“Disagree and commit” h/t @matryer #Current22

What can we learn from the software world of chaos engineering and apply it to the world of data flows?

Principles of chaos engineering

#Current22

Comparing steady state meaning in devops/SRE world to that in data #Current22

“The data isn’t wrong; your expectation of the data is wrong”

Chaos engineering - varying real world events. In data context this could be schematic changes, data corruption, fubar with partition deletion…

#Current22

No one wants to work weekends…

#Current22

Testing in production. Which in the data world means using production data 😱 #Current22

The stages of a data product #Current22

“git for data” with @lakeFS

#Current22

@lakeFS is an open source project, written in Go. It uses copy-on-write to efficiently provide duplicate copies of files. #Current22

Using quality check hooks to protect production data

Live demo time at #Current22

Creating a branch of data

“High performance yarn” is “fiction” 😆 #Current22

What if the join doesn’t work as we intended it?

Uh oh, we’ve got nulls

Now what do we do? Throw away the null data? Try and replace the values? How about just rolling back to before we made the change. #Current22

Now we fix the join and do it properly. We do the same data checks again to confirm it.

Share this Scrolly Tale with your friends.

A Scrolly Tale is a new way to read Twitter threads with a more visually immersive experience.
Discover more beautiful Scrolly Tales like this.

Keep scrolling