Apache Spark is a nascent big data framework that complements Apache Hadoop to offer fast, scalable access to big data. In recent months it's gained a lot of traction in industry as it has become an Apache incubation project. In this talk, you'll hear the good, bad and the voice beyond the hype.
This talk is about how to make Apache Spark cloud friendly, the kinds of jobs that are perfect for it and the kinds of performance and scale you can expect. Expect to see a bunch of demos on Microsoft Azure showing you the power of this framework for all aspects of big data, including interactive querying of terrabyte-scale datasets, machine learning and streaming messages in real-time.
Founder of the UK Azure Group, Microsoft Azure MVP and Insider. Purveyor of open source goodness. Slowly helping to make Apache big data projects a better experience in Microsoft's cloud.
Sign in to add slides, notes or videos to this session