Solving classical data analytic task by using modern distributed databases

A session at GOTO Copenhagen 2015

Tuesday 6th October, 2015

11:30am to 12:20pm (CET)

NoSQL databases have a limited query languages that are not suitable for analytical request. The classical solution provided by most of them is a Hadoop integration. That is not fast. Thus a number of fast distributed, parallel query/computation engines appears recently to fix Hadoop performance problems.
The presentation will show how to solve classical data analytic task by using modern distributed databases and in-memory engines using as example Spark and Cassandra. It will cover following topics:
Apache Spark benefits, architecture and Scala API. (Don't be afraid of Scala, we are here to help you)
Load and store data from Cassandra NoSQL database
Data enrichments and joins
Spark Machine learning and graph algorithms
Target audience: Software engineers and solution architects using or planning to use NoSql products for analytics, particularly Cassandra and Spark.

About the speaker

This person is speaking at this event.
Artem Aliev

Software Engineer at DataStax

Sign in to add slides, notes or videos to this session

Tell your friends!

View the schedule


See something wrong?

Report an issue with this session