A talk about how scaling foursquare using MongoDB and Scala.
Solr, an open source enterprise search server, scales very well within an index (vertical scaling). It is when you have multiple indexes (horizontal scaling) that it starts to get hairy, which happens a lot when you are hosting a cloud based solution for multiple users. In this session we will discuss these issue as well as the techniques of how to overcome them in-depth.
by Jeff Kirkell
The popularity of NoSQL opens up an endless array of possible uses but also causes its own set of problems. Riak, a NoSQL offering created by Basho solves this by claiming to have no single point of failure. Proving this goes a long way to dispelling the concerns within an enterprise to begin adopting a non-relational solution.
by Bradley Holt
CouchDB is a document-oriented database that uses JSON documents, has a RESTful HTTP API, and is queried using map/reduce views. Each of these properties alone, especially MapReduce views, may seem foreign to developers more familiar with relational databases. This tutorial will teach web developers the concepts they need to get started using CouchDB in their projects. CouchDB’s RESTful HTTP API makes it suitable for interfacing with any programming language. CouchDB libraries are available for many programming languages and we will take a look at some of the more popular ones.
by Rob Treat
The open source database landscape has never been in more turmoil, and yet the popularity of Postgres continues to grow and grow. Get up to speed on what you need to know to administer the world's most advanced open source database, including installation, configuration, tuning, and how best to use PostgreSQL's community resources; with special focus on Postgres 9 and the upcoming 9.1 release.
Algorithms are getting raunchier, tools more potent and competitions more intimate! Let us mix analytics tools (like R & Mahout) and a dash of algorithmics to work on BigData Analytics competitions and see if the answer is always 42. In the process we will explore and apply a few good algorithms, to the Heritage Health competition …
You have an idea for an app. Great! First you have to munge and maintain the data. Did you know there is one data API to pull clean, updated data from multiple sources? It slices, it dices, it serves out data on geo, social & more! And you don't need even touch MySQL. Mash up some data with the Infochimps Data Scientists Jacob Perkins, Dhruv Bansal and Ham the Incredible Coding Chimp.
This hands-on tutorial aims at learning the basics of the important machine learning algorithms in Mahout. It aims to help you get it up and running on a Hadoop cluster. Mahout is open source implementation of a collection of algorithms designed from ground up to sift through terabytes of data and help bring out important patterns which are otherwise not in the reach of standard tools.
25th–27th July 2011