by Sid Anand
In 2008, Netflix began to see traction in its new mode of video delivery -- video streaming to devices in the home and in your pocket. As part of this transition, we are witnessing a shift in our traffic patterns and in the expectations of our customers regarding availability. Specifically, as we become indistinguishable from TV, we cannot afford service downtimes, planned or otherwise. To complicate matters further, our systems operate in AWS, where we have less control over networking, persistence, virtualization, etc... In an effort to build a highly-available system using a sometimes-unavailable cloud, Netflix must adopt new deployment paradigms and new ways of testing reliability. From red-black deployments to the Simian army to reliable Cassandra clusters to new features in our platfrom service layer and more, we are investing in reliability across the board.
14th–18th November 2011