Tuesday 23rd October, 2012
9:00am to 12:30pm
Why should you care about HBase? Especially if you already know and use Riak/MongoDB/Cassandra/etc?
Well, HBase is inspired by Google’s battle-hardened “BigTable” architecture and is known to be one of the most scalable distributed databases around. For some perspective, the largest Riak and MongoDB clusters in production are measured in the dozens of nodes – while HBase clusters with hundreds of nodes aren’t unusual.
Even if you aren’t running a Top 100 site (yet), HBase could still be very useful to you since it’s strongly consistent and easy to reason about.
However, for all of HBase’s strengths, it is difficult to get started with. Instead of just running one daemon on each server in your cluster, you’ll have the fun of configuring HDFS (which requires you to setup a Namenode, backup Namenode, and datanodes), Zookeeper quorum (which should be at least 3 dedicated servers), and HBase itself (which will require a master, backup master, and regionservers). And that’s assuming you don’t want MapReduce or security features (which each require several more servers).
After this workshop you will:
Solutions Architect @ Cloudera, Drummer, Photographer, Dog lover bio from Twitter
Sign in to add slides, notes or videos to this session