Sunday 8th November, 2015
9:00am to 12:30pm
Outages and incidents are inevitable in large scale and complex systems. Fixing the underlying technical problems is a challenge, but even more challenging is identifying and fixing any underlying systemic technical or organizational issues that are making incidents more frequent, more severe, or more costly to resolve.
This hands-on workshop will explore techniques for analyzing and learning from a set of incident postmortems to learn about what types of insights are waiting to be discovered. Postmortems and incident reports for analysis will be provided, but attendees are encouraged to bring some of their own.
Sign in to add slides, notes or videos to this session