Apache Spark for Big Data Processing

A session at SpringOne 2GX 2015

Wednesday 16th September, 2015

4:30pm to 6:00pm (EST)

Big Data Track
Slides: http://www.slideshare.net/Spring...
Today, we live in the world of Big Data. Hadoop and MapReduce are highly dominant in the domain of large scale data processing. However, the MapReduce model shows its limits for various types of treatment, especially for highly iterative algorithms frequently encountered in the field of Machine Learning.

Spark is an in-memory data processing framework that, unlike Hadoop, provides interactive and real-time analysis on large datasets. Furthermore, Spark has a more flexible programming model and gives better performance than Hadoop.

In this talk, we aim at giving a portrait of Spark and at browsing its ecosystem, in particular Spark Streaming and MLlib with a concrete example. We will also show how you can use Spark with Spring XD, allowing you to take advantage of the strengths in each platform.

About the speakers

This person is speaking at this event.
Ludwine Probst

Data Engineer - Maths - Machine Learning - Duchess France Leader - GDG Leader

This person is speaking at this event.
Ilayaperumal Gopinathan

Software Engineer at Pivotal Inc

Sign in to add slides, notes or videos to this session

Tell your friends!


Time 4:30pm6:00pm EST

Date Wed 16th September 2015

Short URL


Official event site


View the schedule


See something wrong?

Report an issue with this session