Architecting for Failure in a Containerized World

A session at QCon San Francisco 2016

The container revolution is upon us and with it comes a new toolbox for building systems that are robust in the face of failures. Created in just the past few years, this new set of tools demands that we rethink our approach to architecting for failure. When we do, we will reap the benefits of architectural models that make it much simpler to reason about and handle failures of all types.

In this talk, we'll explore this toolbox and how the tools can be used to best effect. We will consider not only Docker but failure and recovery models built into orchestration systems such as Mesos and Kubernetes that are often used with Docker. We will talk about how to recognize and recover from failures involving different parts of our application including data persistence and active user operations. We will look in detail at how failure and recovery work in a real application and understand the implications this has on system architecture.

This all comes together into an architectural framework that lets us model the various types of failures that we might see and be confident that our systems can recover from them.

About the speaker

This person is speaking at this event.
Tom Faulhaber

Sign in to add slides, notes or videos to this session

Tell your friends!


Date Wed 9th November 2016

Session Hash Tag


Short URL


Official session page


View the schedule


See something wrong?

Report an issue with this session