•  

Lost Art of Troubleshooting

A session at DevOpsDays Baltimore

There are a lot of great things about the cloud, but the "destroy and rebuild" philosophy which is really good for building a continuous delivery pipeline, really sucks when applied to troubleshooting production problems. When your application goes haywire, the most valuable engineering skill is not the the ability to bring up a copy of your system or even the knowledge of a your technology stack (although it doesn't hurt). It is the skill of understanding and solving problems.

Finding the root cause of the issue and mitigating it with minimal disruption in production is a must-have skill for engineers responsible for managing and maintaining production systems, which nowadays includes ops, dbas and devs alike. In this talk I will discuss the skills required to troubleshoot complex systems, traits that prevent engineers from being successful at troubleshooting and discuss some techniques and tips and trick for troubleshooting complex systems in production.

About the speaker

This person is speaking at this event.
Leon Fayer

@papa_fire

Coverage of this session

Sign in to add slides, notes or videos to this session

Tell your friends!

When

Date Tue 7th March 2017

Short URL

lanyrd.com/sfqwtt

View the schedule

Share

See something wrong?

Report an issue with this session