by Dwight Merriman and Eliot Horowitz
Last bugs are finished, testing is complete, business is ready. What do you do next? In this talk we will cover the topics to ensure that you are prepared for a successful launch of your MongoDB based product, including: Machine Sizing - How much CPU, Memory, Disk should I use for my MongoDB? Backup / Restore procedures - What are my options and what do I need to do? Load Testing and Capacity Planning - How much resource is my MongoDB going to use? When do I need to add replicas and shards? Monitoring - What should I be watching and how do I know if things are running correctly?
An unconference session for Java developers at MongoSV.
You’ve probably heard that MongoDB uses Memory Mapped Files and supports Journaling. But how exactly does it all fit together? In this advanced internals talk, we’ll dive into the kernel of MongoDB and look at how the databases uses these functions to persist your data. Expect low level database internals, memory hierarchies, and durability models.
by Paul Pederson
This is an overview of the MongoDB architecture: the storage engine for bson and btrees, the query evaluator, the replication design, and the sharding design. The talk is more broad than deep, and it is intended to give an effective overview of the mongodb design decisions and implementation strategies.
by Richard Kreuter
Open Q & A with the MongoDB development team
This talk will introduce the features of MongoDB by walking through how one can building a simple location-based application using MongoDB. The talk will cover the basics of MongoDB's document model, query language, map-reduce framework and deployment architecture.
There’s lots of talk in the NoSQL space about using the “right tool for the right job”. In this talk, Dwight Merriman, co-founder of 10gen and the MongoDB project will talk about the history of MongoDB, going back to his years of building high performance systems at DoubleClick through to today. Dwight will also cover the design decisions that went into creating MongoDB and the use cases it’s particularly well suited to.
by Curt Stevens
With over 1400 mongo instances deployed we have learned much about how and what to monitor, how to triage incidents and we have developed a set of in-house tools to help us with diagnosis and forensic analysis of day to day issues. We will share some insights we have gained in a year of building & maintaining a reliable and scalable mongo service for a variety of Disney products.
by Greg Pendler, Dale Russell and Brian Knox
Today's real-time world leaves nothing to wait. With more data to process, and less time at hand, pairing MongoDB with Fusion-io flash memory solutions is a winning combination. Meet a couple of customers – Kontera and Aggregate Knowledge – and learn how you can harness the power of MongoDB along with the speed of NAND flash memory today!
by Kenny Gorman
This talk goes over various performance tuning techniques used in real world examples from our various implementations of MongoDB at Shutterfly. We will cover various techniques including usage of the profiler, query tuning, monitoring for performance, data-modeling, data locality. I will also discuss our implementation of Facebook Flashcache for MongoDB.
by Kyle Banker
An unconference session for Ruby developers at MongoSV.
Two very different MongoDB implementations; both with substantial outcomes. 1) Scaling MongoDB to calculate football team picks, and ranking in realtime, as the results unfold. 2) Leveraging MongoDB's flexible schema to reduce complexity and speed performance for XML data feeds. In both of these examples, we will review the architecture deployed on EC2, and discuss how MongoDB was a crucial component of the optimal solution.
by Richard Kreuter
One of the challenges that comes with moving to MongoDB is figuring how to best model your data. While most developers have internalized the rules of thumb for designing schemas for RDBMSs, these rules don't always apply to MongoDB. The simple fact that documents can represent rich, schema-free data structures means that we have a lot of viable alternatives to the standard, normalized, relational model. Not only that, MongoDB has several unique features, such as atomic updates and indexed array keys, that greatly influence the kinds of schemas that make sense. Understandably, this begets good questions: Are foreign keys permissible, or is it better to represent one-to-many relations withing a single document? Are join tables necessary, or is there another technique for building out many-to-many relationships? What level of denormalization is appropriate? How do my data modeling decisions affect the efficiency of updates and queries? In this session, we'll answer these questions and more, provide a number of data modeling rules of thumb, and discuss the tradeoffs of various data modeling strategies.
by Paul Pederson
This talk describes the life-cycle of a query, starting from when it arrives at mongod:27017, and moves through the query parser, query optimizer, query evaluator, touches the journaling subsystem and the storage engine, then returns a cursor to the result set. This talk focuses on the non-sharded configuration.
by Jared Rosoff
MongoDB supports replication for failover and redundancy. In this session we will introduce the basic concepts around replica sets, which provide automated failover and recovery of nodes. We'll show you how to set up, configure, and initiate a replica set, and methods for using replication to scale reads. We'll also discuss proper architecture for durability.
by Tony Tam
This talk described how Wordnik is taking advantage of the cloud-readiness of MongoDB to improve uptime and reduce costs against it's huge and dynamic English language corpus.
by Eduard Koller, David Makogon and Sridhar Nanjundeswaran
A brief overview of building and deploying applications using MongoDB Replica Sets on Windows Azure followed by an interactive session to discuss recommended configurations and tips (and tricks).
by Raghurama Bhat and Biren Gandhi
Consumer social web is seeing lot of momentum and traction around various NoSQL solutions, including MongoDB. Enterprise world is not lagging too much behind this time around. A number of Enterprise Social Software (aka ESS or E2.0) systems are embracing MongoDB, node.js and other alternative stacks as a replacement for their traditional J2EE architectures. Biren Gandhi, an early architect at Facebook and Zynga will share his recent experiences with MongoDB as part of Cisco's Enterprise Collaboration Software - Quad.
by Daniel Pasette, Mike Dirolf and Bernie Hackett
An unconference session for Python developers at MongoSV.
With over 180,000 projects and over 2 million users, SourceForge has tons of data about people developing and downloading open source projects. Until recently, however, that data didn't translate into usable information, so Zarkov was born. Zarkov is system that captures user events, logs them to a MongoDB collection, and aggregates them into useful data about user behavior and project statistics. This talk will discuss the components of Zarkov, including its use of Gevent asynchronous programming, ZeroMQ sockets, and the pymongo/bson driver.
by Richard Kreuter
Open Q & A with the MongoDB development team
by Mark Atwood
Drop in on this technical talk with OpenShift Evangelist and all-around Open Source guy Mark Atwood. In this talk you'll be able to follow along on your own laptop as we deploy a Twitter-clone Python app built using the Bottle web framework with a MongoDB back-end. Nothing complicated, nothing convoluted. We'll use Git to push code and OpenShift to host our app and MongoDB. We'll take a crack at how to preform snapshots (backups and restores) and effectively tail MongoDB logs while we are at it. If you don't have one, maybe even two MongoDB apps running on the cloud by the end of the talk, without having to pull out your credit card, you're doing it wrong. MongoDB on OpenShift is Big and it's FREE!
Foursquare is a service that makes the real world easier to use. Our production datastore, built on MongoDB, has scaled to more than 15 millon users, more than a billion check-ins, and millions of check-ins per day. This talk will discuss the systems and procedures we use to maintain uptime and performance when running MongoDB in the dynamic, virtualized environment of Amazon EC2.
MongoDB supports a wide range of indexing options to enable fast querying of your data. In this talk we’ll cover how indexing works, the various indexing options, and cover use cases where each might be useful.
An unconference session for PHP developers at MongoSV.
by Kyle Banker
MongoDB has been designed for versatility, but the techniques you might use to build, say, an analytics engine or a hierarchical data store might not be obvious. In this talk, we'll learn about MongoDB in practice by looking at four hypothetical application designs (based on real-world designs, of course). Topics to be covered include schema design, indexing, transactions (gasp!), trees, what's fast, and what's not. Sprinkled with tips, tricks, shoots, ladders, and trap doors, you're guaranteed to learn something new in this interdisciplinary talk.
by Michael Kocher
Technological innovation - faster computers, more efficient solar cells, more compact energy storage - is often enabled by materials advances. Yet, it takes an average of 18 years to move new materials discoveries from lab to market. This is largely because materials designers operate with very little information and must painstakingly tweak new materials in the lab. Computational materials science is now powerful enough that it can predict many properties of materials before those materials are ever synthesized in the lab. By scaling materials computations over supercomputing clusters, we have computed some properties of over 80,000 materials and screened 25,000 of these for Li-ion batteries. The Materials Project is making these materials and their properties available to scientists around the world through a sophisticated web interface. MongoDB is at the core of the Materials Project architecture. It is used to schedule and track quantum mechanical calculations of materials properties on supercomputers, to store and search the results of these computations, and to perform advanced analytics on the computed materials properties.
Open Q & A with the MongoDB development team
by Sridhar Nanjundeswaran and Robert Stam
An unconference session for C# developers at MongoSV.