The MapReduce framework promises to make computing of large sets of data very easy. The approach offers excellent scalability across many computing nodes, and can easily be integrated with existing systems. This session will give an introduction to the basic techniques and ideas behind MapReduce, followed by hands-on examples using Apache Hadoop, a major implementation of MapReduce, and Hadoop's streaming functionality that allows users to write processing jobs not just in Java, but in any programming language, including PHP.
Chief @Agavi fanboy; opinionated lover of rockets, sharks & laser beams; RC helicopter pilot; world traveler; software engineer; firm believer in logic & reason bio from Twitter
Sign in to add slides, notes or videos to this session