by Simon Rogers
90,000 items on Afghanistan, 291,000 on Iraq - and another 251,000 cables. Managing the Wikileaks release is just one of the huge data journalism projects the Guardian's data team has embarked on. This talk will look at how journalists can make sense of data, get stories out of it and our role in supplying open data to the world.
by Alistair Croll, Toby Segaran, Amber Case and Bradford Cross
The convergence of big, open data, ubicomp, and new interfaces will change the way humans work, play, learn, and love. It's a slow transformation that happens one tweet, one blog, and one game at a time -- but it's also an inexorable road towards the singularity. In this panel discussion, we'll look beyond the bytes and algorithms to think about humanity awash in a sea of information.
by Ed Boyajian
Companies must choose to spend their money and time on the right software initiatives. With exploding volumes of critical data, getting new insight and mastery over business operations demands new investments in BI at multiple levels. Ed will show a proven path for how to avoid exorbitant database software fees and shift that spend to be used in areas like BI where you can realize a stronger ROI.
by Barry Devlin
For more than 20 years now, data warehousing has put manners on unruly enterprise data. Yet, physics tells us that disorder inexorably increases unless we endlessly fight it. As information volumes and types explode into chaos, is it time to declare the warehouse dead? Or we could move from classical to quantum physics and create a new information architecture. It’s time to make some new choices…
by DJ Patil
by Scott Yara
A defining characteristic of modern life is the incredible proliferation of digital information. The Economist estimates that the amount of information created each year is growing at a 60% compounded rate. According to the Harvard Business Review, we humans generated more data last year than in all of previous human history.
by Carol McCall
In 2001, the Institutes of Medicine declared that “between the care we have and the care we could have lies not just a gap, but a chasm,” yet nothing’s really changed. Healthcare remains one of the most richly endowed yet poorly equipped knowledge industries anywhere. Using real world examples, we’ll see how BIG DATA may be just what the doctor ordered, but only if we pick the right problems.
by Marshall Kirkpatrick, Simon Rogers and Jer Thorp
After Kennedy, you couldn't win an election without TV. After Obama, it was social media. But tomorrow's citizen gets their information from visualizations. In this panel, three acclaimed designers show how they apply visualization to big data, making complex, controversial topics easy to understand and explore.
by Pablo Castro
Sharing data on the Web comes with a tough trade-off between minimalism and enabling creative new scenarios. This session will explore Web APIs that focus on exposing data and let clients decide how to use it. We'll share our experiences while designing the Open Data Protocol (odata.org), what we found to be great and terrible ideas and what we hear from folks running OData Web APIs.
by Stephen Sorkin and Narayan Bharadwaj
From customer behaviors & usage statistics to security postures & operational analytics, Splunk's ability to make sense of all types of machine data, structured or unstructured, and mash it up w/ other business data provides complete real-time visibility & operational intelligence. This tutorial demos a new approach for analyzing your organization's petabytes of data to derive real-time insights.
by Tim Guleri, Roger Ehrenberg, Paul Kedrosky and Ping Li
The ability to collect, crunch, act upon, and share huge amounts of data disrupts nearly every industry, tearing down barriers to entry and creating entirely new businesses. This panel of investors will discuss where they see the opportunities in the Big Data industry, and how they think about the value of new ventures in the space.
by Eric Day
The OpenStack project was launched last summer by Rackspace, NASA, and a number of other cloud technology leaders in an effort to build a fully-open cloud computing platform. It is a collection of scalable, standards-based projects currently consisting of OpenStack Compute and OpenStack Object Storage. This session will introduce the projects and describe how they can help manage your data.
by Ben Gimpert, Margaret Francis, Ryan Strynatka and Josh Merchant
Today's web analyst has moved far beyond funnels and visitors. Automated systems decide who gets what content, and language parsing tries to distill sentiment from millions of online interactions. This panel will look at where web analytics is headed, and how new algorithms and approaches are yielding fresh insights into online commerce.
by Alex Howard, Peter Clark and Steve Midgley
Open access to information promises to connect citizens to their representatives, improving government transparency and helping educators transform the classroom. In this real-world panel, practitioners in government and the public sector will give us a glimpse into how data and new interfaces are transforming how we teach and govern.
by Kevin Weil
Most analytics systems rely on large offline computations, which means results come in hours or days behind. Twitter is all about realtime, but with over 160 million users producing over 90 million tweets per day, we need realtime analytics that scaled horizontally. This talk discusses the development of that infrastructure, as well as the products we are beginning to build on top of it.
Birds of a Feather (BoF) sessions provide face to face exposure to those interested in the same projects and concepts. BoFs can be organized for individual projects or broader topics (best practices, open data, standards). BoF topics are entirely up to you. Thursday's Lunchtime BoF sessions will happen on the hotel side of the Hyatt Regency, Mezzanine Level.
Live demonstration of ambient computing using projector-camera pairs to scan the room and place interactive simulations into the space. All surfaces are rendered interactive. We will demonstrate a 3D sandtable for firefighter training and STEM education where the 3D sand becomes and interactive surface.
by Vineet Tyagi
Organizations today possess massive data - in tera- and petabytes - that needs to be effectively collected, stored and processed. Hadoop is a cost effective option that helps manage this big data. To derive real returns from these big data systems, one needs to extract useful insights and business intelligence.
The rise of sensor network data and the expectation for low latency query responses combine to obsolete available databases and storage platforms. We have built a platform for web-scale OLAP and in this talk I will cover how we made our infrastructure capable of real-time update and query performance over hundreds of terabytes of multidimensional data.
Data integration and viz technology have given rise to an appetite for government data–the Gov 2.0 movement. Do government agencies have good data? Sort of: I believe that an understanding of data limitations has gotten short shrift in the drive to develop the next app. I'll discuss why a knowledge of the complexities of government data is crucial to building quality decision-making tools.
by Creve Maples, Ph.D.
We will discuss the impact of the information explosion, the effectiveness of current technological directions, and explore the success that new perception-based, human-computer interfaces provide in analyzing and understanding complex data. Real examples will be used to illustrate that effective man-machine environments are essential in productively dealing with multi-dimensional information.
by Jon Bruner
In a first, Forbes presented all federal campaign contributions by America’s wealthiest people in our September 2010 online edition of the Forbes 400. We combined human effort and homegrown database code to sort through 6 million political donations and find the 20,000 that came from America’s richest people.
by Ben Werther
If you are a leading enterprise or web company, then two things are almost certainly true. Data is the lifeblood of your business. And you face an ever-increasing need to scale your applications and data services.
by Dylan Field, Nolan Goldberg, Brian Fitzpatrick and Jud Valeski
To many people, Big Data means Open Data: social graphs, voting records, weather patterns, and more. But who owns data? Most of our laws were written for atoms, not bits; they're woefully out of date in an information age. When you share data, does it become more or less valuable? If someone adds to your data, is it still yours? This panel will tackle the gray area of data ownership.
Artistic visualizations and infographics tell the stories of rich data in unique, compelling ways and synthesize datasets in ways that allow them to be interpreted, absorbed, and experienced in ways beyond the spreadsheet, pie chart, and bar graph.
by Julie Steele, Ian White, Peter Marney, Moe Khosravy and Dennis Yang
Does information really want to be free? While the Internet is full of open data, there's plenty of data companies are willing to pay handsomely for -- particularly if it's timely and well aggregated. As a result, data marketplaces are a burgeoning business. This panel will look at the market for data, and where it's headed.
by Brian Wilson
New technologies are driving a new era of global collaboration among scientists and researchers. Digital scholarship, the ability to create, collect, publish and collaborate in new digital mediums, is driving the exponential growth of data related to scholarly research. This talk will highlight evolving strategies used to appraise and predict success of institutions and researchers.
by Sunita Shenoy
Ram Peddibhotla, a Director from Intel’s Software and Services Group, will discuss how the future of mobile involves ubiquity across multiple hardware platforms. Specifically, Ram will discuss how open source software will shape the next generation of computing devices, improving compatibility.
by Nicholas Yee and Nic Ducheneaut
Virtual worlds are a goldmine of untapped insights, even for predicting physical behaviors. Not only will we share PARC findings and methods developed to extract key data from online games, but more importantly, we'll discuss how social scientists converted and processed raw behavioral metrics into meaningful psychological variables that can be applied to a broad spectrum of business applications.
by Drew Conway, Christopher Ahlberg, Robert McGrew and Rion Snow
Data doesn't just show us the past—it can help predict the future. Several new firms harvest massive amounts of open data, trying to anticipate everything the right ad placement to the next terrorist attack. In this session, we bring together the founders of these firms to discuss the technology—and ethics—of looking into the future.