Saturday 26th June, 2010
2:15pm to 3:00pm
A website like Orbitz generates millions of searches each day. Storing and processing the ever-growing volumes of data generated by all of those searches becomes prohibitive though traditional systems such as relational databases. This presentation details how Orbitz is using new tools such as Hadoop and Hive to meet these challenges. We’ll discuss how Hadoop and Hive are being used to analyze search data in order to optimize the products shown to users and detect trends in search keywords. This includes such tasks as using Hadoop to extract and transform data, and using Hive to perform statistical analysis on that data.
Husband. Father. Pet-wrangler. Software Engineer at Cloudera. Co-author of Hadoop Application Architectures for O'Reilly bio from Twitter
Sign in to add slides, notes or videos to this session