Using HBase

A session at Strata New York 2012

Tuesday 23rd October, 2012

9:00am to 12:30pm (EST)

Why should you care about HBase? Especially if you already know and use Riak/MongoDB/Cassandra/etc?

Well, HBase is inspired by Google’s battle-hardened “BigTable” architecture and is known to be one of the most scalable distributed databases around. For some perspective, the largest Riak and MongoDB clusters in production are measured in the dozens of nodes – while HBase clusters with hundreds of nodes aren’t unusual.

Even if you aren’t running a Top 100 site (yet), HBase could still be very useful to you since it’s strongly consistent and easy to reason about.

However, for all of HBase’s strengths, it is difficult to get started with. Instead of just running one daemon on each server in your cluster, you’ll have the fun of configuring HDFS (which requires you to setup a Namenode, backup Namenode, and datanodes), Zookeeper quorum (which should be at least 3 dedicated servers), and HBase itself (which will require a master, backup master, and regionservers). And that’s assuming you don’t want MapReduce or security features (which each require several more servers).

After this workshop you will:

  • Have a high level understanding of HBase’s internal architecture
  • Setup a running HBase server on your laptop
  • Learn how to setup and benchmark a production-ready HBase cluster from scratch
  • Learn the fundamentals of HBase’s client APIs and use them to build a link shortener on HBase
  • Learn how to configure and tune HBase to work with your hardware for your needs (write heavy workload? lots of random reads?)
  • Learn about common HBase issues, and debug a real world production problem

About the speakers

This person is speaking at this event.
Matteo Bertozzi

Distributed Storage Layer Developer, Guitarist, ... bio from Twitter

This person is speaking at this event.
Amandeep Khurana

Solutions Architect @ Cloudera, Drummer, Photographer, Dog lover bio from Twitter

Sign in to add slides, notes or videos to this session

Tell your friends!


Time 9:00am12:30pm EST

Date Tue 23rd October 2012

Short URL


View the schedule


See something wrong?

Report an issue with this session