Staying agile in the face of data deluge

A session at Span

As our applications need to process ever more data in ever shorter time, it's difficult to stay sane. The architecture of our applications quickly becomes a monstrosity of different databases, queues and servers held together by string and sellotape. That may work at first, but soon gets ugly. If something goes wrong, it's hard to recover. If features of the application need to change, it's hard to adapt.

Stream processing gives us a route towards building data systems that are scalable, robust, and easy to adapt to changing requirements. In this talk, we will discuss how you can bring sanity to your own application architecture with Apache Samza, an open source framework for distributed stream processing applications.

Apache Samza is used in production at LinkedIn, building upon years of hard-won experience in building large-scale data systems. Even if you're not processing millions of messages per second, like LinkedIn is, you can still pick up useful tips on how to structure your data processing for scale and agility.

About the speaker

This person is speaking at this event.
Martin Kleppmann

Senior Software Engineer, LinkedIn

Martin is a software engineer at LinkedIn, committer on Apache Samza (a distributed stream processing framework), and author at O'Reilly (currently writing a book on designing data-intensive applications). Previously he co-founded and sold two startups, Rapportive and Go Test It. His technical blog is at http://martin.kleppmann.com, and he's @martinkl on Twitter. He is based in Cambridge, UK.

Sign in to add slides, notes or videos to this session

Span

England England, London

28th October 2014

Tell your friends!

When

Date Tue 28th October 2014

Short URL

lanyrd.com/sddbmg

Official event site

london-2014.spanconf.io

View the schedule

Share

See something wrong?

Report an issue with this session