- Run tests during regular hours so folks are around
- Avoid important dates
- Make smallest change to prove/disprove hypothesis
- Make rollback easy
- People, practices & process
- Applications
- Platforms
- Infrastructure
Credit goes out to @russmiles for the model
- Misconfigured timeout
- Misconfigured error handling
- Missing fallback
- Missing regional failover
Most common? Latency and performance.
- Latency
- Error rate
- Yield
- Harvest* - not sure about this one, didn't pick up on the specifics