by Sid Anand
For the past 3 years, Netflix has been building a popular subscription-based service to stream movies and TV shows to game consoles, mobile devices, BluRay players, digital TVs, etc… With tens of millions of paying customers, Netflix has firmly established itself as a household brand in the US. Few people are aware that, while aggressively expanding our market and products, we have also moved our web and data infrastructure to Amazon Web Services. We currently use a large array of AWS’s offerings and deliver >90% of our web traffic from the cloud. While we have moved a significant portion of our web infrastructure to the cloud, the migration of our data has followed a slightly slower pace. Where we once solely relied on relational databases such as Oracle and MySQL, today we use a combination of technologies, including but not limited to SimpleDB, S3, Cassandra, and HBase. We also leverage open source caching technology like Memcached and Squid. This talk will detail the current evolution of Netflix’s cloud-based data infrastructure and specifically its use of open source technology.
Between the NoSQL movement and new cloud offerings, it seems there are new storage options popping up every day. How do you select which one is the best for your project? The truth is that it's unlikely one option is best for all your needs. This session walks you through the various options considered by one startup and how it selected five separate storage engines - and has no regret doing so!
In this workshop, one of the core MongoDB committers will present the fundamental principles of MongoDB, how to set up and interact with the database, and what to consider when building applications using a document-based data model.
Redis is an entry in the new breed of nosql databases. But it takes a different approach that makes it much more interesting then most of the other key/value stores in the same category. Come learn what makes redis so useful that it seems everyone is adding it to their toolbox.
by Tom White
Apache Whirr is a way to run distributed systems - such as Hadoop, HBase, Cassandra, and ZooKeeper - in the cloud. Whirr provides a simple API for starting and stopping clusters for evaluation, test, or production purposes. This talk explains Whirr's architecture and shows how to use it.
by Brian Aker
Many people view topics like Map/Reduce and queue systems as advanced concepts that require in-depth knowledge and time consuming software setup. Gearman is changing all that by making this barrier to entry as low as possible with an open source, distributed job queuing system.
25th–27th July 2011