📙 Amazon Web Services — a practical guide
created at July 13, 2016, 5:30 p.m.
A collection of postmortems. Sorry for the delay in merging PRs!
created at Aug. 3, 2015, 12:20 a.m.
Compilation of public failure/horror stories related to Kubernetes
created at Jan. 19, 2019, 10:42 a.m.
A curated list of Chaos Engineering resources.
created at July 26, 2017, 5:05 p.m.
The list of continuous integration services and tools
created at Oct. 28, 2014, 7:07 a.m.
A collection of postmortem templates
created at May 29, 2017, 10:41 a.m.
A curated list of Site Reliability and Production Engineering Tools
created at March 9, 2020, 7:36 a.m.
Run Book / Operations Manual template for modern software systems
created at July 9, 2016, 7:23 p.m.
Tips and tricks for getting through on-call
created at Nov. 7, 2016, 3:46 a.m.
A lifecycle model for describing incident management
created at Nov. 4, 2016, 4:40 p.m.