by Doug Judd
Over the past five years, there has been an explosion of new NoSQL database technologies whose primary innovation has been the ability to break capacity barriers by harnessing the collective power and resource of large clusters of server class PCs. In this keynote address, Doug Judd, the original creator of Hypertable, will present a past, present, and future look at scalable NoSQL databases, providing insights into the forces that led to their creation, a review of the current state of the field, and factors that will influence their future evolution.
by Mitch Pirtle
Itching to try your favorite new NoSQL technology with your long-time favorite CMS? What‘s this going to do to your code? How will this affect your development process, and what will this look like in the data center? Joomla founder Mitch Pirtle goes through several case studies to demonstrate some unexpected impact of plugging NoSQL in to relational applications.
by Bernd Ocklin
MySQL Cluster is the high availability, low latency storage engine with autosharding and real-time capabilities, used worldwide to ensure that hundreds of millions of mobile phone users are always reachable.
This mature and proven technology is gaining rapid adoption in the web due to its simple scaling model, performance/and integrated HA that are needed to ensure that web services can evolve rapidly.
MySQL Cluster has many attributes that make it ideal for new generations of dynamic, highly scalable applications, including:
This session looks at the existing NoSQL access methods for MySQL as well as the latest developments for the MySQL Cluster storage engine. You can get the best of both worlds – persistence, consistency, rich queries, high availability, scalability and simple, flexible APIs and schemas for agile development.
Learn how cluster as a globally distributed database engine is used in mobile phone networks, how web services profit from its scaleable fully event driven architecture.
by Dirk Bartels
The session focuses on the evolving value proposition NoSQL-database types such as key-value, document, and graph/object stores have and the influence of enterprise application architectures and requirements. After a brief introduction about the principles and the evolution of NoSQL, the presentation discusses critical requirements to make NoSQL work for enterprise application development in contrast to web applications.
The session addresses project managers, software architects and database developers.
by Chris Harris
Business moves fast, very fast. It’s driven by making decision based on facts, but how do you deal with facts that are hours or days old? Periodic data warehouse loads and ETL process no longer scale to meet the increasing business needs in the web economy. Today, modern organizations are combining high performance operational data stores like MongoDB with traditional business intelligence RDBMS solutions to create a hybrid than solves the business need to have fast accurate data for decision making with the ability to interpret and analyze data in every way.
This talk will try to answer the questions
Riak provides a solid storage foundation to build highly concurrent applications. But its simplistic approach to resolving and detecting concurrent modifications to data leave quite a few people scratching their heads. We’ll look at how you can build applications and data structures to fully utilize Riak‘s strengths while still allowing for eventually consistent data.
With property graph databases and NoSQL coming into fashion over the last years, the interest in graph algorithms, graph processing and recommender systems has increased considerably. In this talk, Peter Neubauer is outlining the differences in approach between imperative scripting of graph traversals with Gremlin and nested iterators over a data structure, and declarative techniques like Cypher, a pattern matching language over Neo4j.
Peter will show basic recommendation algos, spreading activation and min/max flow calculations on live demo graphs. Be aware that things might break, but fun and pun is guaranteed.
by Doug Judd
Rediff.com India (Nasdaq: REDF) is one of India‘s top Internet portals, providing email, search, news, entertainment and shopping services to India and the global Indian community. With over 100 million registered users, Rediff.com is one of the largest Indian Internet portals and one of the top email providers worldwide. Rediffmail is Rediff.com‘s popular email service and has experienced steady growth ever since it was launched in 1998. Architected for high availability, Rediffmail is a geo-distributed application served out of three separate data centers. By 2011, the request load generated by the application had begun to overwhelm the underlying storage system, causing frequent outages. Metadata updates in connection with inbox management turned out to be the primary culprit, generating over 75 percent of the storage system request volume. In late 2011, the Rediffmail engineering team re-architected the system on top of Hypertable, solving the inbox management problem and eliminating the associated outages. In this talk, Doug Judd, the CEO of Hypertable Inc., will present the new Rediffmail architecture and describe how it leverages the Hypertable scalable database as a core component.
by Axel Morgner
This session offers a close look at the implementation and benefits of utilizing a graph database, namely Neo4j, to serve as a CMS back-end. As we will see, the classic CMS ORM approach has several drawbacks, such as the inherent complexity of mapping CMS content to tables, or the inflexibility of relational databases for semi-structured CMS data. Graph databases address these challenges by directly storing CMS data as a graph structure in a natural and trivial way, and allowing to alter the CMS schema at runtime. Along with a live Demonstration the talk will provide a short introduction of general CMS use-cases and requirements, and will cover the specific strategy and data model used to build open-source CMS structr, which can be applied to other cases as well.
Apache Cassandra is a distributed database built to handle massive amounts of data on large clusters of commodity servers. This talk will present the storage engine at the core of Cassandra, motivating the use of a structure similar to a Log-Structured Merge Tree rather than of a usual B-Tree and its implications for the data model. We will also introduce most of the current features of that engine (secondary indexes, integrated caching, TTL etc.) including recent developments introduced in Cassandra 1.0 like compression/checksumming and the new leveled compaction.
This talk will give an introduction to ArangoDB. ArangoDB is a new open source NoSQL database, whose goal it is to be a powerful universal database. It allows flexible data modelling: as key-value pairs, documents or graph data. It allows complex queries through an elegant query language. An embedded JavaScript interpreter allows you to add functionality to the database for transactions, complex manipulations of the objects, treating AvocadoDB as a complete application server – database stack. ArangoDB guarantees durability through MVCC with append-only journals and zero-administration availibility through synchronous replication. It utilizes main memory and SSDs to deliver maximal performance. Even though ArangoDB is still a young project, it has already attracted international attention, especially in Japan and the USA. The community is currently working on APIs for Python, D, Ruby, Java and node.js.
Recommendation engines have changed a lot during the last years and the last big change is NoSQL, especially Graph Databases. With this presentation we intend to show how to build a Graph Processing technology, based on our experience in doing that for environments like Digital Libraries and Movies and Digital Media.
First, we will introduce the state of the art on context aware Recommendation Engines, with special interest on how people are using Graph Processing, NoSQL, systems to scale this kind of solutions. After an introduction to the ecosystem, the next step is to have something to work with. So we will show the audience how to build a Recommendation Engine with a few steps.
The demonstration part will be made using the next technology stack: Sinatra as a simple web framework. Ruby as a programming language. OrientDB, Neo4j, Redis, etc. as a NoSQL technology stack. The result of our demonstration will be a simple engine, accessible through a REST API, to play and extend, so that attendants can learn by doing.
In the end our audience will have a full introduction to the field of Recommendation Engines, with special interest on Graph Processing, NoSQL, systems. Based on our experience making this technology for large scale architectures, we think the best way to learn this is by doing it and having an example to play with.
Developing a business web application is still a long process in 2012. Model-Driven Development is at the heart of:
The Wakanda platform – with its NoSQL object datastore - intends to let you create such model-driven applications. The presentation will explain and show how to create the application model, with its business and security rules, coded once, then made available everywhere without being “bypassable”. You will see enhancements to the maintenance and debug process, as everything is centralized in the model on the server-side (while maintaining potential pre-validation on the client to decrease the load on the server).
To add even more consistency, the same language is used everywhere: JavaScript.
Salvatore Sanfilippo will present the new Redis 2.6 released for the first time.
The new Lua scripting feature and the other major changes in this release will be explained showing use cases and comparing the solutions with the ones available in Redis 2.4.
by Luca Garulli
Today, many companies are already using a NoSQL solution to handle a piece of data. What is the next step? Probably the adoption of NoSQL as a total replacement of Relational DBMS. To reach this Goal, NoSQL solutions have to support multiple models and features, that are already available in Relational DBMS. The challenge has started.
Spring always provided sophisticated support for various Java data access technologies. The lately invented Spring Data project now takes the next step and introduces a consistent programming model for non-relational data stores and helps implementing data access layers in a consistent and easy-to grasp fashion - for both the NoSQL stores as well as more traditional APIs like JPA.
The talk introduces the umbrella project, foundational concepts and abstractions and dives down into specialties of particular modules using MongoDB and Neo4J as examples.
by Sebastian Cohnen and Timo Derstappen
Whether IT is CouchDB, Redis, Riak or MongoDB we have investigated, implemented, tested and run several NoSQL databases at Adcloud. Although we are happy with most of our Decisions, we have learned our lessons, which we would like to share. We try to choose the right database for the right job. But sometimes it is hard to define the job.
NoSQL is about to reach the tip of the hype cycle. Therefore, it is time to take a break and thoroughly analyze the NoSQL trend.
This talk will trace NoSQL back to the early days of database approaches and will then reflect on NoSQL‘s current state and future: Is NoSQL as a technological approach to data storage just a feature or is it a unique category of its own? Is NoSQL a fundamentally new thing that will stay as a trend? What are current NoSQL systems lacking? Will there be a technology or trend that could disrupt NoSQL?
by Matt Casters
With the arrival of a new armada of NoSQL databases chances Are increasing constantly that you will have to integrate data from one of them. So in this talk I‘ll be going over crazy data integration between various relational and NoSQL databases.
Obviously, I‘ll be giving a short introduction Pentaho Data Integration (Kettle), the graphical programming and design involved. I‘ll also be giving a short overview of the NoSQL landscape so that you can get an idea of what‘s going on in that space.
by Peter Idestam-Almquist
New real-time applications require databases that can process larger volumes of transactions than what is possible with current market-leading relational databases. And while more recent NoSQL databases are able to scale out so that they can handle extremely large volumes of data, they cannot guarantee consistency. While this is acceptable for some types of data such as social network bulletin boards, it does not work for most business critical data such as stock quantities or money. But what if there was a happy medium? A database that could process millions of transactions on a single machine, while maintaining the consistency that these business critical applications require? There is! In this presentation, Peter will describe a different kind of a NewSQL database, which will process millions of ACID-compliant transactions while scaling in on a single machine to fully ensure consistency. He will also delve into the technical differentiators between traditional relational databases, NoSQL and NewSQL databases and what applications they are best-suited for.
by Jan Lehnardt
Couchbase Server 2.0 is a love child of memcached and Apache CouchDB. This talk introduces the Couchbase architecture, and features it also demonstrates integration challenges as well as our solutions. Couchbase is an Apache 2.0 licensed open-source project, based on the existing memcached, Membase and CouchDB technologies, that aims to solve your data storage problems in a flexible, fast and reliable manner. In this talk, you will learn how it all works.
Couchbase provides clustered storage management, in-memory, disk-based and hybrid working sets, a high-performance key-value store with dynamic query capabilities as well as cross-cluster replication this allows you to mix and match what aspects of CAP your application needs. All which a familiar memcached-compatible APIs and no need for you to adjust your applications.
The Cassandra distributed database has added many new features over the last year based on real-world needs of developers at Twitter, Netflix, Openwave, and others building massively scalable systems. This talk will cover the motivation and use cases behind features such as secondary indexes, Hadoop integration, SQL support, bulk loading, and more.
by Tim Lossen
Wooga creates casual social games, and we approach each new game as a greenfield development project. This gives us the chance to experiment with different approaches and to try out promising new technologies.
In this talk, I will explain how our back-end architecture has evolved over time, and how our search for the "perfect backend stack"" has (so far) progressed through three distinct phases: from sharded relational database, over various NoSQL databases, to the question: “Do we really need a database, after all?”
I will share what worked well, what didn‘t, what lessons we have learned on the way - and which other directions we would like to explore in the future.
Automatic encryption of data at rest, RESTful web services, JSON support: The Rocket U2 databases (UniData & UniVerse) provide an enterprise NoSQL solution already in use everywhere from emergency systems to major financial institutions. Come learn about this proven, high-performance database technology running in thousands of organizations around the globe.
by Luca Garulli
This talk presents the OrientDB NoSQL Open Source project and its documentgraph capabilities. NoSQL products promise big performance and scalability at the cost of many compromises like tran-21 Abstracts Wed 30 May sactions, aneasy query language and constraints. OrientDB offers a flexible model where it can be used in different ways depending on the use case.
This presentation deals with the OrientDB features and some different use cases where it can be applied.
Relational Database Management Systems were hardly ever used at Google to store logs data. But SQL as a data exploration language has been becoming increasingly popular over the last years. In fact, nowadays the fast majority of data analysis jobs at Google are expressed by SQL queries. In this talk I give a brief history of data storage and exploration techniques deployed at Google and describe the current state of the art. I furthermore explore why SQL became so popular and illustrate my main conclusion: SQL‘s simplicity and conciseness rules over all other (currently available) alternatives.
by Konstantin Osipov
Tarantool/Box is an open-source, transactional database server. Its key property is high efficiency (hundreds of thousands of requests per second on a commodity server) combined with high level of customization, achievable with Lua stored procedures.
Tarantool utilizes lock-free cooperative multi-tasking environment and asynchronous I/O to achieve minimal overhead per request. Each request, which can be either a simple GET/PUT, classical for NoSQL systems, or a custom data access script in Lua, is run in a multi-version transactional environment.
The key advantage of Tarantool/Box, compared to existing open-source NoSQL solutions, is the ability to execute parts of application business logic automatically on the server, with sufficiently high performance. Sufficiently means that you may never need to shard.
Tarantool is written in C and Objective C. The server code and Drivers are available under terms of simplified BSD license. In the talk I’ll demonstrate key features of the server using examples in Lua, Ruby and Erlang.
by Brian Clark
Join Brian Clark, the VP of Product Management at Objectivity, in a discussion of the latest trends in Big Data Analytics, defining Big Data and understanding how to maximize your existing architectures by utilizing NOSQL technologies to improve functionality and provide real-time results. There will be a focus on relationship analytics as well as an introduction to NOSQL data stores, object and graph databases, such as the architecture behind Objectivity/DB and InfiniteGraph.
by Pavlo Baron
If you want to know about the principles and pitfalls of building distributed systems, you can either make your way through a whole pile of pretty prosaic theoretical books or you can just come to this talk and understand some of them real fast by watching a little playfull demostration. It‘s your choice.
by Vincent Delfosse
During the lifetime of a building, a huge quantity of information will be captured by multiple actors: owners, managers, users, architects, electricians, plumbers, etc. As most of this information is not shared between these persons, the same analysis, verifications or measurements have to be done multiple times. SpatioData is a research project aiming at the development of a collaborative platform to support the effective sharing of such diverse data.
In order to reach this ambitious goal, it is important to provide the users with devices suiting their needs (mobile or station) and interfaces adapted to their specific activities. These various client applications will communicate to a centralized database through WebServices exhibiting a common data model. The data model of such an application has to address many problems. Amongst those are:
UML has been chosen for the representation of our data model, as it is a formalized and well-known format in the developers community. But instead of fully developing the data model, then implementing it and providing dedicated services to it, an original approach has been adopted, in the form of UML-oriented WebServices independent of any specific model. This solution has been built on top of a graph database (Neo4J) in a simple but powerful way, providing answers to the given problems above.
This communication will detail the complete system architecture and the process designed on top of it, to make sure thirdparty developers can take full advantage of this platform.