Data Science in the Cloud @StitchFix

A session at QCon San Francisco 2016

Stitch Fix is an online clothing retailer that not only focuses on delivering personalized clothing recommendations for our customers, but also applies the output of data science to automate numerous other business functions through the delivery of forecasts, predictions, and analyses via a robust API layer. We rely heavily on the ability for applied mathematics & statistics and our human decision makers to synergistically work; doing this well requires us to merge art & science together. However with over eighty data scientists in residence, it can be challenging to support so many different needs from an infrastructure perspective.

In this talk we’ll cover how we use the cloud to enable over 80 data scientists to be productive.
Specifically we’ll cover our infrastructure for:

Prototyping ideas, algorithms and analyses.
How we set up & keep schemas in sync between hive, presto, redshift & spark and make access easy for our data scientists.
How we productionize recommendation algorithms & our patterns for gracefully degrading and still serving fashion recommendations if something breaks down in our ETL.

About the speaker

This person is speaking at this event.
Stefan Krawczyk

My Professional Persona: Stitch Fixer, Ex-Idibon, http://Ex-Nextdoor.com Eng., Ex-Linkedin Eng., Stanford MSCS Grad bio from Twitter

Coverage of this session

Sign in to add slides, notes or videos to this session

Tell your friends!


Date Tue 8th November 2016

Session Hash Tag


Short URL


Official session page


View the schedule


See something wrong?

Report an issue with this session