Get Lanyrd on your mobile (iPhone, Android and more) - check it out here

OSCON Data 2011 schedule

Tuesday 26th July 2011

Wednesday 27th July 2011

  • Consistency or Bust - Breaking a Riak Cluster

    by Jeff Kirkell

    The popularity of NoSQL opens up an endless array of possible uses but also causes its own set of problems. Riak, a NoSQL offering created by Basho solves this by claiming to have no single point of failure. Proving this goes a long way to dispelling the concerns within an enterprise to begin adopting a non-relational solution.

    At 10:40am to 12:10pm, Wednesday 27th July

    In Oregon Ballroom 203, Oregon Convention Center

    Coverage slide deck

  • Learning CouchDB

    by Bradley Holt

    CouchDB is a document-oriented database that uses JSON documents, has a RESTful HTTP API, and is queried using map/reduce views. Each of these properties alone, especially MapReduce views, may seem foreign to developers more familiar with relational databases. This tutorial will teach web developers the concepts they need to get started using CouchDB in their projects. CouchDB’s RESTful HTTP API makes it suitable for interfacing with any programming language. CouchDB libraries are available for many programming languages and we will take a look at some of the more popular ones.

    Several techniques and approaches to scaling CouchDB will be discussed. You will learn about CouchDB’s peer-based, bi-directional replication, how to handle conflicts, and the concept of eventual consistency. We will discuss the how CouchDB relates to other SQL alternatives, what problems CouchDB is good at solving, the trade-offs as compared to other tools, and what makes CouchDB unique. We will also cover authentication, authorization, show functions, list functions. We will take a brief look at ways of distributing CouchDB including to mobile devices and CouchApps (web applications built using CouchDB, JavaScript, and HTML5).

    At 10:40am to 12:10pm, Wednesday 27th July

    In Oregon Ballroom 204, Oregon Convention Center

    Coverage slide deck

  • Pro PostgreSQL 9

    by Rob Treat

    The open source database landscape has never been in more turmoil, and yet the popularity of Postgres continues to grow and grow. Get up to speed on what you need to know to administer the world's most advanced open source database, including installation, configuration, tuning, and how best to use PostgreSQL's community resources; with special focus on Postgres 9 and the upcoming 9.1 release.

    At 1:40pm to 3:10pm, Wednesday 27th July

    In Oregon Ballroom 204, Oregon Convention Center

  • The Hitchhiker’s Guide to A Kaggle Competition

    by Krishna Sankar

    Algorithms are getting raunchier, tools more potent and competitions more intimate! Let us mix analytics tools (like R & Mahout) and a dash of algorithmics to work on BigData Analytics competitions and see if the answer is always 42. In the process we will explore and apply a few good algorithms, to the Heritage Health competition …

    At 1:40pm to 3:10pm, Wednesday 27th July

    In Oregon Ballroom 203, Oregon Convention Center

    Coverage slide deck

  • Cook Up a Data Mashup on the Fly with Infochimps

    by dhruvpatil and winnie hsia

    You have an idea for an app. Great! First you have to munge and maintain the data. Did you know there is one data API to pull clean, updated data from multiple sources? It slices, it dices, it serves out data on geo, social & more! And you don't need even touch MySQL. Mash up some data with the Infochimps Data Scientists Jacob Perkins, Dhruv Bansal and Ham the Incredible Coding Chimp.

    At 4:10pm to 5:40pm, Wednesday 27th July

    In Oregon Ballroom 204, Oregon Convention Center

  • Hands On Mahout - Mammoth Scale Machine Learning

    by Robin Anil and Ted Dunning

    This hands-on tutorial aims at learning the basics of the important machine learning algorithms in Mahout. It aims to help you get it up and running on a Hadoop cluster. Mahout is open source implementation of a collection of algorithms designed from ground up to sift through terabytes of data and help bring out important patterns which are otherwise not in the reach of standard tools.

    At 4:10pm to 5:40pm, Wednesday 27th July

    In Oregon Ballroom 203, Oregon Convention Center

    Coverage slide deck