Hadoop provides the ability to extract business intelligence from extremely large, heterogeneous data sets that were previously impractical to store and process in traditional data warehouses. The challenge for many companies now is in bridging the gap between the data in the data warehouse and the data in Hadoop. In this talk we'll discuss some steps that Orbitz has taken to bridge this gap, including examples of how Hadoop and Hive are used to aggregate data from large data sets, and how that data can be combined with relational data to create new reports that provide actionable intelligence to business users.
Husband. Father. Pet-wrangler. Partner Engineering team at Cloudera. Co-founder of Chicago Hadoop User Group and Chicago Big Data meetup. bio from Twitter
5 more videos from Chicago Data Summit
Sign in to add slides, notes or videos to this session