Spark foundations: Prototyping Spark use cases on Wikipedia datasets

A session at Strata + Hadoop World in New York 2016

Monday 26th September, 2016

9:00am to 5:00pm (EST)

The real power and value proposition of Apache Spark is in building a unified use case that combines ETL, batch analytics, real-time stream analysis, machine learning, graph processing, and visualizations. Brian Clapper employs hands-on exercises using various Wikipedia datasets to illustrate the variety of ideal programming paradigms Spark makes possible. By the end of the training, you’ll be able to create proofs of concept and prototype applications using Spark.

The course will consist of about 50% lecture and 50% hands-on labs. All participants will have access to Databricks Community Edition after class to continue working on labs and assignments.

About the speaker

This person is speaking at this event.
Brian Clapper

Senior Instructor and Application Engineer at Databricks bio from LinkedIn

Sign in to add slides, notes or videos to this session

Tell your friends!


Time 9:00am5:00pm EST

Date Mon 26th September 2016

Short URL


Official session page


View the schedule


See something wrong?

Report an issue with this session