Tuesday 23rd October, 2012
9:00am to 12:30pm
Software testing is hard enough, but it becomes especially challenging when you’re doing large-scale, distributed data processing. This tutorial will present a mix of lecture and instructor-led demonstrations to explain how you can verify that your code performs exactly as you intended.
This session will focus on four key topics:
Unit testing: Proving that a single piece of code works in isolation
Integration testing: Verifying that these units work correctly in conjunction with one another
Performance testing: Ensuring that the code runs at the expected speed and scale
Diagnostics: How to extract valuable information from Hadoop that can help you isolate problems in your code
We will also discuss several problems developers commonly introduce into their code, as well as ways to recognize and solve them.
Curriculum Developer, Cloudera, Inc.
Sign in to add slides, notes or videos to this session