Beyond Ad-hoc Data Science

A session at QCon San Francisco 2015

Today, tooling for ad-hoc data science is fairly well understood. But when you want to create a repeated process such as analytics or prediction systems, things tend to change with time, and how to deal with such change is not always clear. Columns and features are added and removed. New models are developed. Data errors are discovered and corrected. How can we build a data pipeline system to handle these demands? This talk will discuss some of the systems challenges and solutions that arise when building evolving data science products, and we’ll see how they are addressed at Twitter.

About the speaker

This person is speaking at this event.
P. Oscar Boykin

Sr. Staff @Twitter, Author of Scalding

Sign in to add slides, notes or videos to this session

Tell your friends!


Date Mon 16th November 2015

Short URL


View the schedule


See something wrong?

Report an issue with this session