by Michael Chui
McKinsey’s influential Big Data report has helped define and explain the opportunity created by the torrent of data flowing daily through business. Michael Chui outlines the big picture of data innovation, challenges and competitive advantage.
How does one take thousands of data domains, and tens of thousands of models and algorithms, and make it so that anyone can get answers to their natural language questions?
Stephen Wolfram, the creator of Wolfram|Alpha and Mathematica, will describe how this works in Wolfram|Alpha and what the paradigm of computational knowledge is now making possible.
When it comes to big data insights, how do you know you’re asking the right questions? Hiring data scientists is a good start – we’re seeing their growth both on LinkedIn and at LinkedIn. But even data scientists are not immune from the myriad of hidden pitfalls that keep your key insights out of sight.
Drawing from a deceptively simple exercise that I’ve used to haze dozens of data scientists on their first day, I will discuss the good, the bad and the ugly lessons we’ve learned about asking the right questions, denominators and being a data skeptic.
by Julie Steele
O’Reilly Media’s Julie Steele showcases the finalists and Best in Show winner from the Strata Vizathlon, a data visualization contest produced in partnership with Juice Analytics. (Note: the contest has already closed).
by Jodee Rich
By using social media metrics, TV networks can now get ratings for their programs in real time.
HPCC Systems from LexisNexis® Risk Solutions offers a proven, data-intensive supercomputing platform designed for the enterprise to solve big data problems. HPCC Systems offers a consistent data-centric programming language, two processing platforms and a single architecture for efficient processing. Customers, such as financial institutions, insurance carriers, insurance companies, law enforcement agencies, federal government and other enterprise-class organizations leverage the HPCC Systems technology through LexisNexis® products and services.
This keynote is sponsored by LexisNexis
People correctly assume that in order to profit from big data, you have to acquire the data. We look at some cases of profiting from exploiting innovative ways of acquiring data.
The time has come for policymakers to begin using innovative technologies to analyze data exhaust, in order to protect communities from multiple slow-onset crises that threaten to reverse hard-won progress in human development.
by Bill Hoffman, Jane Yakowitz and Robert Kirkpatrick
Personal data is a an exploding asset class that is currently not being leveraged to inform public policy decisions or mitigate risk. This panel, sponsored by United Nations Global Pulse, will examine the value of private sector data and consider some of the challenges inherent its use.
Aneesh Chopra, the US Federal Chief Technology Officer, and deputy CTO Chris Vein, in conversation with Tim O’Reilly, founder and CEO, O’Reilly Media.
As they saying goes, “If you’re not paying for a product, then you’re the product.” But the world is a better place when human beings are enlisted as sensors—as distributed mechanisms for mapping, understanding, and connecting the world and the data it generates—and not treated as barcodes, mere bits of data to be read, logged and analyzed.
This is a huge blind-spot in contemporary network service design, abetted by our own inability to correctly price privacy disclosures. There are tools, services—and yes, even analytics—waiting to be invented and productized which will make our services into something better than Skinner boxes that train us to undervalue our data and our privacy.
by Simon Rogers
Data increasingly drives news reporting, and the Guardian has been at the front of this change. Simon Rogers, editor of the Guardian’s award-winning Datablog, will talk about how data is directing its coverage.
by Kristian Hammond
As the world of data expands, the challenge of understanding it expands as well. Narrative Science is addressing this challenge with a software platform that uses data to drive the generation of compelling narratives that tell the stories contained within it. The technology tells the stories that are hidden in the numbers.
by Sean Gourley
Disruptive technology shapes the world, defining political, military, financial, and commercial opportunities and threats. Whether originating in academic research, in National Labs, or in privately held or public companies, these technologies can emerge with explosive impact, creating and destroying value. Yet there are few tools to track these innovations—at a global scale and at a pace that keeps up with the rate of change.
What if corporate strategists could literally draw a map to find growth opportunities? A technique called semantic clustering analysis makes this possible. When applied to technology entities worldwide, this analysis can reveal not only which innovation areas are thick with competition, but also where in the market there are opportunities, or “white spaces,” ripe for innovation. The result is a data-driven visual tool that can be used to drive corporate innovation strategy.
The data age is having a radical effect on investigative journalism. Reporters need to know how to find stories in data and social media. What are the opportunities in data for journalists, and what does the investigative reporter of tomorrow look like?
Big Data can become an unmanageable business burden if you’re not careful. As your company’s analytics initiatives rapidly grow, you’re going to max out your IT budget if you don’t keep the data as compact, compressed, and storage-efficient as possible. Just as critical, your users will find all the information far too massive to wade through if you don’t deliver targeted subsets to their tablets, smartphones, and other devices for speedy consumption. In this session, Forrester senior analyst James Kobielus will help you understand how to keep your company’s data as small and nimble as practical while scaling it out into the petabytes.
The first three of the five startups that were selected as Launch Pad finalists will deliver their pitches.
by Abhishek Mehta, Richie Prager, Allen Weinberg and Roy E. Lowrance
Coming off the worst financial crisis of our generation, an entire industry (and the global economy) is not just rebuilding the very foundations of a robust banking system, but also rethinking the ‘new normal’ for Banking. Data has rapidly emerged as a key lever for this reboot. Hear from some of the leading business practitioners in the industry:
How tectonic shifts driven by Big Data technologies, emerging Business Models, and Macro Socio-Economic conditions is unleashing an unprecedented redesign of the Global Financial System? and Who are the emerging BIg Data players at the forefront of this shift?
by Coco Krumme
What billions of credit card transactions can tell us about the nature of habits, patterns of human movement, and the structure of urban economies.
Digital media publishers sit astride torrents of data about their content, audiences and advertisers. Every day, the world’s two billion internet users visit millions of pages, and are exposed to more than one hundred billion online advertising impressions. Publishers of digital content need to crunch this data to predict who will visit tomorrow, and how to price the ads those visitors see.
In the past year, Metamarkets has developed a Big Data stack for the media markets, comprised of three pieces:
Our platform is in currently in production with three of the largest media publishers in Europe and United States.
by Bill Cook
The most important question to ask about big data is: what’s in it for my organization? Does it create net-new economic value, increase enterprise agility, improve collaboration, or enable new efficiencies? And if so, in which scenarios? What are the implications for people, process and technologies in any given organization?
In this keynote, Bill Cook, President of EMC Greenplum, will provide insight into today’s business opportunity with big data analytics and three key steps to consider as you begin your journey.
This keynote is sponsored by EMC Greenplum
by Mark Bolgiano, John Bottega and Donald F. Donahue
Donald F. Donahue, President and CEO of the Depository Trust and Clearing Corporation (DTCC), and Paul Sforza, CTO of the US Treasury Department, will discuss America’s first public financial services data utility.
This project, led by Richard Berner and others working under the Office of Financial Research (OFR) provisions of the Dodd-Frank regulatory reform legislation, is being incorporated into our country’s existing information infrastructure to provide consistent, quality data to investors, institutions, and regulators.
An interview with Bradley Horowitz, VP Product for Google+
This is not your father’s weather forecast. Businesses across the commercial spectrum can be positively and negatively affected by weather conditions in deep and sometimes unanticipated ways. Whether they are surprise acute weather events (ie., weather black swans) or prolonged patterns that slowly enhance or curtail product demand, it is hard to find an industry that does not have some sort of operational or financial exposure to the atmosphere.
Hear how companies in sectors ranging from banks managing commodity risk to home centers staging seasonal demand driven products are are analyzing weather in different ways to get ahead, and how they can tap into the unexploited possibilities of the treasure trove of government maintained weather data.
The remaining two of the five startups that were selected as Launch Pad finalists will deliver their pitches.
Consumers are generating an immense amount of text-based data via social and other channels. That’s a treasure trove of insight IF we can can unlock intention, emotion and meaning. How well do computers do it today and what are some of the cutting-edge ways to improve accuracy in the near future?
Startups are in it to make money—whether by breaking even, being acquired, or finding some other exit. But for data-driven startups, turning bits into dollars can be a challenge. There are privacy issues, data ownership concerns, and questions about how to monetize the business. This panel of investors and entrepreneurs will look at strategies for generating money in data-driven startups.
Five startups will be selected to deliver their pitches to the Strata Summit audience, to be judged on Wednesday afternoon.
by Kyle Cranmer
Even the biggest commercial datasets can’t rival those produced at CERN, home of the Large Hadron Collider. As scientists research the conditions that created our universe, the volume of data captured is truly staggering.
20th–21st September 2011