Surviving Black Friday - a Resilience Engineering Tale

A session at DevconTLV March

Tuesday 22nd March, 2016

4:55pm to 5:25pm (JMT)

The 'Black Friday fail' is the greatest fear of every major online retailer. Since downtime equals money, and in Black Friday it means quite a lot of money.

But the sad truth is that a failure of a service is inevitable, especially in a large distributed system. So how can we survive a failure of a service when it inevitably fails.
* In this lecture I will show how failures in large systems differs from failures in small systems.
* Will show examples of resilience engineering.
* Why simulate failures, and how to do it in your system.
* How to use gradual rollout, circuit breakers and automatic fallback to protect your system.
* The importance of failing fast, and failing silently.
* And the misconceptions we all have on how a large scale website failure unfolds.

About the speaker

This person is speaking at this event.
Omri Fima

Technical Lead Sears

Next session in Operations Track

5:25pm TBA

1 attendee

  • omri fima

Sign in to add slides, notes or videos to this session

Sign in to track this session

Tell your friends!


Time 4:55pm5:25pm JMT

Date Tue 22nd March 2016


Operations Track, Tel Aviv Cinematheque

Short URL


View the schedule


See something wrong?

Report an issue with this session