by Nathan Marz
Cascalog is a data processing library for Clojure for processing Big Data on top of Hadoop or doing analysis on a local computer from the REPL. Cascalog combines the conciseness of a DSL with the power of a general purpose programming language. Cascalog is inspired by Datalog and blends logic programming with functional programming.
In this workshop, we’ll learn the basics of Cascalog. We’ll cover the API, inner and outer joins, functions, aggregators, negations, how the query planner works, how to create custom operations for queries, and how to read from diverse datasources like HDFS, MySQL databases, and others.
The format of the workshop is short lectures followed by interactive problem solving sessions where you’ll work on problems that utilize the new concepts from the lectures.
18th–20th September 2011