A talk about how scaling foursquare using MongoDB and Scala.
by Andy Blyler and Lindsay Snider
Solr, an open source enterprise search server, scales very well within an index (vertical scaling). It is when you have multiple indexes (horizontal scaling) that it starts to get hairy, which happens a lot when you are hosting a cloud based solution for multiple users. In this session we will discuss these issue as well as the techniques of how to overcome them in-depth.
by Jeff Kirkell
The popularity of NoSQL opens up an endless array of possible uses but also causes its own set of problems. Riak, a NoSQL offering created by Basho solves this by claiming to have no single point of failure. Proving this goes a long way to dispelling the concerns within an enterprise to begin adopting a non-relational solution.
by Bradley Holt
CouchDB is a document-oriented database that uses JSON documents, has a RESTful HTTP API, and is queried using map/reduce views. Each of these properties alone, especially MapReduce views, may seem foreign to developers more familiar with relational databases. This tutorial will teach web developers the concepts they need to get started using CouchDB in their projects. CouchDB’s RESTful HTTP API makes it suitable for interfacing with any programming language. CouchDB libraries are available for many programming languages and we will take a look at some of the more popular ones.
Several techniques and approaches to scaling CouchDB will be discussed. You will learn about CouchDB’s peer-based, bi-directional replication, how to handle conflicts, and the concept of eventual consistency. We will discuss the how CouchDB relates to other SQL alternatives, what problems CouchDB is good at solving, the trade-offs as compared to other tools, and what makes CouchDB unique. We will also cover authentication, authorization, show functions, list functions. We will take a brief look at ways of distributing CouchDB including to mobile devices and CouchApps (web applications built using CouchDB, JavaScript, and HTML5).
by Rob Treat
The open source database landscape has never been in more turmoil, and yet the popularity of Postgres continues to grow and grow. Get up to speed on what you need to know to administer the world's most advanced open source database, including installation, configuration, tuning, and how best to use PostgreSQL's community resources; with special focus on Postgres 9 and the upcoming 9.1 release.
Algorithms are getting raunchier, tools more potent and competitions more intimate! Let us mix analytics tools (like R & Mahout) and a dash of algorithmics to work on BigData Analytics competitions and see if the answer is always 42. In the process we will explore and apply a few good algorithms, to the Heritage Health competition …
by dhruvpatil and winnie hsia
You have an idea for an app. Great! First you have to munge and maintain the data. Did you know there is one data API to pull clean, updated data from multiple sources? It slices, it dices, it serves out data on geo, social & more! And you don't need even touch MySQL. Mash up some data with the Infochimps Data Scientists Jacob Perkins, Dhruv Bansal and Ham the Incredible Coding Chimp.
by Robin Anil and Ted Dunning
This hands-on tutorial aims at learning the basics of the important machine learning algorithms in Mahout. It aims to help you get it up and running on a Hadoop cluster. Mahout is open source implementation of a collection of algorithms designed from ground up to sift through terabytes of data and help bring out important patterns which are otherwise not in the reach of standard tools.