by Ramesh Venkataramaiah and Jonathan Seidman
A website like Orbitz generates millions of searches each day. Storing and processing the ever-growing volumes of data generated by all of those searches becomes prohibitive though traditional systems such as relational databases. This presentation details how Orbitz is using new tools such as Hadoop and Hive to meet these challenges. We’ll discuss how Hadoop and Hive are being used to analyze search data in order to optimize the products shown to users and detect trends in search keywords. This includes such tasks as using Hadoop to extract and transform data, and using Hive to perform statistical analysis on that data.
26th June 2010