Skip to content

πŸ” Post-Mortem & Incident Review ​

Learning from failures and incidents without blame.

Purpose ​

A post-mortem is a structured review after an incident or project failure. The goal is to learn and prevent recurrence, not to assign blame.

Post-mortem process ​

  1. Timeline β€” Reconstruct what happened, when, and who was involved
  2. Root cause analysis β€” Use the 5 Whys to dig deeper
  3. Impact assessment β€” What was affected? Users, revenue, reputation?
  4. What went well β€” What worked during the response?
  5. What went wrong β€” Where did things break down?
  6. Action items β€” Specific, assigned, time-bound improvements

5 Whys example ​

Problem: Deployment caused 2h outage
Why? β†’ Config file was wrong
Why? β†’ It wasn't reviewed before deploy
Why? β†’ No review step in the deploy process
Why? β†’ Deploy process was never formalized
Why? β†’ Team grew fast, process didn't scale
β†’ Action: Formalize deploy checklist with mandatory review

Post-mortem template ​

FieldContent
Date...
SeverityP1 / P2 / P3
Duration...
SummaryOne paragraph
TimelineChronological events
Root cause5 Whys result
ImpactUsers, systems, business
Action itemsWhat, who, by when

Best practices ​

  • Hold the post-mortem within 48 hours
  • Blameless culture β€” focus on systems, not individuals
  • Share findings broadly to prevent similar issues elsewhere
  • Follow up on action items in the next sprint
  • Archive post-mortems as organizational knowledge

Pergame Knowledge Base