Tuesday 16th June, 2015
3:00pm to 3:30pm
Uber is running businesses in 279 cities across 53 countries. In US, we covers 64% of the population. Every month, we created 50,000 driver jobs. Every day, we facilitates 1+ million trips globally. Uber has over ten engineering, business and data scientists teams which run all kinds of different data applications with all sorts of different performance characteristics and SLA. In this talk, we will present how we design and build this multi-tenancy architecture by Spark as a Platform while we scale our cluster size rapidly to match our business growth. We will discuss how we are leveraging Spark, SparkSQL, Spark Streaming, MLlib and IPython Notebook with Spark to build data applications efficiently in this shared environment. We will show why Spark make it a lot easier and we will also share our lessons from this on-going journey.
Sign in to add slides, notes or videos to this session