Awesome
Postmortem Templates
This is a collection of postmortem templates derived from various sources such as the Site Reliability Engineering book, The Practice of Cloud System Administration book and other online resources.
Template List
- Template from Site Reliability Engineering book
- Template from The Practice of Cloud System Administration book
- Template from Google Developers Blog post
- Template from Azure status history posts
- Template from Michael Kehoe's blog post
- Template from the Real-World SRE book
- Template from Elastic Cloud incident report
Load templates automatically
It is possible to load the postmortem templates automatically without copy pasting from the files or manually writing the structure every time you want to author an incident report.
Vim
You can add the following line into your .vimrc
file:
au BufNewFile postmortem-*.md 0r ~/postmortem-templates/templates/postmortem-template-srebook.md
Emacs
You can add the following line into your .emacs
file:
(add-hook 'text-mode-hook (lambda () (when (and (string-prefix-p "postmortem-" (buffer-name)) (= (point-max) (point-min))) (insert-file-contents "~/postmortem-templates/templates/postmortem-template-srebook.md"))))
In both cases the filename pattern is postmortem-*
. For example, if you create a file named postmortem-api-outage-2017-05-29.md
it will load automatically the predefined template into that file. You can replace both the postmortem template and pattern to match your.
Examples
- Google Compute Engine Incident #16007
- Azure status history
- Buildbucket Postmortem: 6% builds lost on 2015-04-22
- Google API infrastructure outage incident report
- Gitlab - Postmortem of database outage of January 31
- Summary of the Amazon DynamoDB Service Disruption and Related Impacts in the US-East Region
- Elastic Cloud Incident Report: February 4, 2019
Postmortem resources
- Learn out of mistakes. Postmortems to the rescue.
- A collection of post-mortems
- Blameless PostMortems and a Just Culture
- A Tale of Postmortems
- Building a Blameless Post-Mortem Culture with Jason Hand
- The infinite hows
- Failure is Always An Option: How a Blameless Culture Leads to Better Results
- How to write an Incident Report / Postmortem
- SysAdvent - Day 1 - Why You Need a Postmortem Process
- Etsy’s Debriefing Facilitation Guide for Blameless Postmortems
- Writing Your First Postmortem
- How to Write Great Outage Post-Mortems
- Postmortem Action Items: Plan the Work and Work the Plan
- Site Reliability Engineering resources
- Postmortem culture: how you can learn from failure
- Postmortem Culture: Learning from Failure
- re:Work- Postmortem discussion template
- Wheel of Misfortune Game
- Post-Incident Review Template by VictorOps
- Atlassian Incident Handbook: Incident Postmortems
- Collection of Kubernetes Failure Stories
- Best engineering practices: how to fix a bug?
- Beginners Guide to Incident Postmortems