Introduction, basic schema design and a scalable message bus.
We will build an IRC server based on MongoDB. In this first session we will review how the server works and how to manage the necessary data. We will look in detail at how to build a Message Bus that is the backbone of our IRC server.
by Steve Pousty
You have seen the stuff that FourSquare has done with spatial and you want some of that hotness for your app. But, where to start? Have no fear - by the end of this session you will have all the pieces necessary to write your own location based app. The OpenShift platform already has MongoDB plus the spatial bits installed, so there is no need to find a VPS or convince your IT dude to install stuff. What's OpenShift? It's Red Hat's free auto-scaling Platform as a Service. This session will start with a quick intro on firing up an OpenShift instance with MongoDB. Then we will load some data into MongoDB, show you how to handle spatial data, do some command line spatial operations, and finally plugin in some code to build a simple "Find the National Park Application". When you go home you will be able to amaze your friends and supervisors with some spatial magic goodness you can control.
Getting insights out of MongoDB is easy if you know how to program. But what if you don't? In this talk, we introduce Analytica, a software product with an Excel front-end that makes it easy to report, analyze, and graph your data directly from MongoDB. We will show you how to build reports, create graphs, and deploy Analytica without impacting your production deployment.
by Kyle Banker
MongoDB has been designed for versatility, but the techniques you might use to build, say, an analytics engine or a hierarchical data store might not be obvious. In this talk, we'll learn about MongoDB in practice by looking at hypothetical application designs (based on real-world designs, of course). Topics to be covered include schema design, indexing, transactions (gasp!), trees, what's fast, and what's not. Sprinkled with tips, tricks, shoots, ladders, and trap doors, you're guaranteed to learn something new in this interdisciplinary talk.
by Richard Kreuter
Deployment, replication and monitoring.
Now that we've built our app, we will look at how to deploy it in production. We will design and deploy a replica set to support a highly available backend for the server. We'll also show you how to monitor your server, both from the shell and using the MongoDB Monitoring Service (MMS). We'll also try to break our cluster and show how the service stays running throughout failures as well as how to recover from catastrophic failures.
Understanding the concerns and goals of running multiple datacenter in a hot-failover or hot-hot-hot configuration can be challenging. Come to this talk to learn the basics while finish up with a much better understanding how to deploy MongoDB across the world. We will work through deploying multi data center for read/write locality, increased availability and improved systems isolation, for disaster preparedness with recovery. In addition we'll look at the MongoDB roadmap and planned enhancements around data center awareness.
You have a need for MongoDB, and you’d like to run it in the cloud…but can it be done quickly? It’s much easier than you might think. In this presentation, Claudio Gentile from RightScale (Leaders in Cloud Management) will discuss real world use cases for social gaming applications using MongoDB, and will showcase how easy it is to deploy Mongo as part of a 3-tier cloud architecture in just minutes. The audience will learn:
MongoDB supports write-ahead journaling (by default) to facilitate fast crash recovery and consistency in database files after that crash. In this session, we'll give an overview of on-disk persistence with MongoDB, journaling, and discuss the internals of journaling and the storage engine.
by Ian White
Sailthru is a behavioral communication and analytics platform focusing on email, onsite, and mobile personalization. Sailthru stores terabytes of data for hundreds of millions of end users and billions of messages, all in MongoDB. Since we presented last year, we've had 10x growth and experienced lots of challenges scaling on AWS. We've learned lots of lessons about building and designing for MongoDB at scale, and we'll share them.
Build a scalable message logging service and then shard it - Live!
We will extend the IRC server to log the full chat history. This requires a scalable backend to store an infinitely growing volume of messages. We'll look at several ways of designing the message storage and the limitations and tradeoffs of each. We will then deploy our updated IRC server and upgrade from a replica set to a sharded cluster without any downtime.
by Ben Sabrin
Adoption of MongoDB has accelerated tremendously among developers in the past 18 months, and many large enterprises have now deployed MongoDB in reliable and large scale production environments. However, for many developers, it remains a challenge to convince production teams and business stakeholders to adopt an open source technology that has not been certified yet by their IT teams. This session will provide you with the compelling arguments to reassure business and production teams such as:
by Deep Kapadia
This is a follow up talk to Jake Porway's talk at MongoNYC 2011 where he presented Project Cascade as one of R&D Labs Projects backed by MongoDB. Since then Project Cascade has moved to a scalable production level system. The talk will cover challenges and lessons learnt in moving MongoDB from a research project to a production environment.
In this session we’ll talk through a series of examples to distill some of our best operational practices. The format of this talk is an interactive and fun adventure through some real-world cases that come from real systems and large deployments. This session will touch on backups, network availability, performance pitfalls, indexing/schema-design, log management, monitoring and alerting along with some good examples of diagnostic techniques with a goal of finding good solutions.
We will explore the various options for backups, go through the rich information available through the monitoring service (MMS), and observe the automated chunk split and automated chunk migration across shards. We will also provoke several failures in the cluster and see how to recover.
by Dan Pasette
In this talk we will explain the elements of a successful MongoDB deployment and focus on the where, how and why they fit into the development process.
by Jon Hoffman
At foursquare we store almost all our live data within sharded mongodb clusters. We handle thousands of requests per second via our web site and api and each of those may generate several dozen mongo queries. This talk will address the practical details of our configuration, optimizations we've found useful, and the operational techniques we use to keep things running smoothly.
One of the most awesome benefits of Mongo is that its schemaless. This means you can do some really interesting things in your models such as dynamically adding attributes to the model without requiring a migration. The simplest example is adding createdat or updatedat timestamps. We'll go over some cool examples and discuss best practices. We'll cover best practices to avoid downtime in production while migrating to a different data representation.
by Buck Heroux
MongoDB is helping Next Big Sound change how the music industry uses big data. By marrying social media data with traditional sales data, Next Big Sound creates a central platform for all of the music industries needs with MongoDB working hard behind the scenes. This use case analysis will hit key points such as cluster architecture, document schema, some growing pains, and how Next Big Sound uses MongoDB to store rich event data as well as index hundreds of thousands of artists by their network distribution and geolocation.
by Richard Kreuter
MongoDB supports a wide range of indexing options to enable fast querying of your data. In this talk we’ll cover how indexing works, the various indexing options, and cover use cases where each might be useful.
by Sridhar Nanjundeswaran
by Chris Westin
We're working on a new aggregation framework for MongoDB that will introduce a new aggregation system that will make it a lot easier to do simple tasks like counting, averaging, and finding minima or maxima while grouping by keys in a collection. The new aggregation features are not a replacement for map-reduce, but will make it possible to do a number of things much more easily, without having to resort to the big hammer that is map-reduce. After introducing the syntax and usage patterns for the new aggregation system, we will give some demonstrations of aggregation using the new system.
by Dan Pasette
At 5ivePoints, we have a MongoDB database with 85+million US voters, every one of them geocoded. Our application allows supporters of political campaigns across the country to pull up nearby voters on their mobile phone to canvas. Moreover, campaigns can create custom filters using any combination of voter attributes, and we provide data-driven filters that update in near real-time based on campaign results. But with a collection of this size, each index is on the order of GBs, and un-indexed queries aren't feasible. I'll discuss how we are able to support flexible user queries - integrating both geo-spatial and other attributes - with only 4 indexes, minimizing page faults caused by pulling rarely-used indexes into memory.
by Kiril Savino
GameChanger is a giant data pipeline where high throughput on dynamic content is the top priority. We route live sports data in and back out to content generation systems, phones, dynamic web viewers, mobile apps, and email, peaking out at 10s of 1000s of constantly changing data streams. Our play-by-play collection will soon pass 1 billion 1k-sized, sport-variable documents- it's not something you can just query. We've had to re-learn how to lay out data in the document world: you must validate and maintain data differently, change access patterns, and cater to the strengths of the platform.
23rd May 2012