Get Lanyrd on your mobile (iPhone, Android and more) - check it out here

DataWeek schedule

Monday 24th September 2012

  • Data Science: People, Algorithms, or Data?

    by Josh Wills, Jeremy Howard, Chris Pouliot and Jacob Spoelstra

    What is the formula for the most effective data science project: the best people, the best algorithms, or the best data? Can we imagine algorithms-as-a-service or predictive analytics that disintermediate the data scientist? Will the data science market get fragmented to the point that I may use separate vendors for sourcing the data, crunching the numbers, and interpreting the results? Do we really have a shortage of data scientists or a shortage of technologies that make data science accessible?

    At 10:00am to 11:00am, Monday 24th September

  • Technologies Behind the Big Data Revolution

    The Big Data Revolution is about startups and corporations extracting incredible value- insights- and user functionality out of increasingly large amounts of data. Why this revolution is happening now in 2012 is partially because of a cultural awareness of the value of data but is in the scope of this panel largely caused by new Hadoop, NoSQL, NewSQL, unstructured and semi-structured data technologies enabling the Big Data Revolution. How have big data tools given corporations, startups, or organizations new power over data that was never before possible? If this trend continues, where will this Big Data Revolution lead us?

    At 10:00am to 11:00am, Monday 24th September

  • The Emerging Cloud Stack

    by Chad Arimura

    What is causing the rise of real-time 3rd party cloud infrastructure? Companies once developed their IT Infrastructure internally- but as they bring their technology stack online they are making greater use of platform-as-a-service, processing-as-a-service, or other API-centric components of the new cloud stack. Is it the need for real-time data or the need for infinite scalability that is defining the new cloud stack? What does future plug-and-play IT infrastructure look like?

    At 10:00am to 11:00am, Monday 24th September

  • Using the cloud and data to evolve from website to customer delight

    by Oren Michels

    Description: The volume of data created by devices and our digital lives is changing the game of reaching customers online. In the post website era, demographics aren’t enough. Customers want, and expect, to be engaged with when and where they need a brand or its services. The customer engagement game today is won with apps that fast-forward a brand beyond the website by tailored combinations of internal customer data, external social data and cloud-based services. A common hurdle to moving beyond the browser is effectively merging current customer data with essential new factors to do more than shrink a website. Too often, marketing and IT both stumble on critical choices of how to inform apps with where or when or why a customer wants and needs a brand: integrating quickly with partners for data, platforms and development.

    In this talk, Oren Michels, co-founder and CEO of Mashery, will discuss how smart brands in retail, travel, media, healthcare, online services and more are using APIs to power combinations of data for apps that grant wishes for customers, not just shrink websites. He will demonstrate use cases and share insight on how APIs accelerate effective integration of traditional databases, realtime data intelligence and other cloud-based tools to curate insight for business and delight for customers.

    At 10:00am to 10:50am, Monday 24th September

    In 111 Minna Gallery

  • Building a Developer Ecosystem

    by Tim Anglade

    Session Description: Winning in the data and cloud space means making developers happy. Whether your API offers a data feed, infrastructure, or cloud services: how do you build a developer ecosystem around your API? Is a "developer-friendly" API one with dozens of SDK's, one where you help develop mashups for thousands of services, or one where your cloud service is simply superior?

    At 11:00am to 12:00pm, Monday 24th September

    In SPUR Urban Center

  • When is Hadoop Your Best Solution?

    by Stefan Groschupf

    Big data means- in the strictest sense- a set of new technologies that help you manage or process more data faster than ever before. With a list of Hadoop users including Facebook, Yahoo, IBM, AOL, Microsoft, and most major tech companies: when is Hadoop realistically the best solution on the market for your big data problem?

    At 11:00am to 12:00pm, Monday 24th September

  • Curing RESTless-ness: Struggles and Strategies for API Security

    by Rob Richards

    Developers may approach API and Web application security the same, but often learn the hard way that API security in real world environments is much more complex. Securing an API for business use involves effective measures to thwart threats, and is rarely without common security missteps unique to device types, operating systems and more. During this presentation Rob Richards, Mashery’s Chief Architect, will discuss how API providers are effectively addressing common issues such as authentication and authorization, data privacy, as well as payload encryption. Rob will also cover examples of advanced API-related attacks and vulnerabilities that are affecting platforms today, and effective methods for detection, debugging and seeking support to expose and remedy potential threats.

    Attendees will learn: API threat zones from the provider to consumers, authentication and authorization, privacy concerns The characteristics of common API attacks, and what makes apps vulnerable to them OAuth and other schemes in use today by some of the biggest API providers Upcoming standards for using signatures and encryption with JSON data structures Effective tips for using common debugging tools, and effectively tapping communities, to accelerate discovery and remediation of vulnerabilities in app code

    At 3:00pm to 3:50pm, Monday 24th September

    In SPUR Urban Center

  • Data Startups - From Investment to Exit Strategy

    by Savitha Srinivasan, Mike Dauber and Jim Fowler

    As we watch the acquisitions of Vertica and Jigsaw as well as the IPO\'s of Linkedin and Splunk, we need to ask: what is the strategy for moving your data company from investment to IPO or acquisition? If companies are raising funds and getting acquired across the data stack: from data storage and backup to the API or application-layer, what are the strategies for getting your data company from investment to IPO or acquisition? We ask the private and institutional venture capitalists, the acquisition partners, and the entrepreneurs who have made the journey what they\'ve learned from their own paths to exit.

    At 3:00pm to 4:00pm, Monday 24th September

  • Building Your First MongoDB Application

    by Sridhar Nanjundeswaran

    This talk will introduce the features of MongoDB by walking through how one can build a simple location-based checkin application using MongoDB. The talk will cover the basics of MongoDB\'s document model, query language, map-reduce framework and deployment architecture.

    At 4:00pm to 5:00pm, Monday 24th September

  • Looking Inside the God Box of the Internet

    by Greg Lindahl

    In presentations, a box in the middle of a slide with everything connected to it is often derisively called a "God Box". On the Internet, though, we really do have a God Box -- search engines are at the center of everyone's navigation and discovery, and search engine crawls contain an incredible amount of structured and unstructured data about anything and everything ever mentioned on the Internet.

    At 4:00pm to 5:00pm, Monday 24th September

  • An Introduction to MongoDB's Tools to Technologies to Efficiently Process Large Quantities of Data

    by Bryan Reinero

    Topics Include: • Map-Reduce Useful for batch processing of data and aggregation operations. It is similar in spirit to Hadoop with all input coming from a collection and output going to a collection. Map and reduce functions are written in JavaScript and execute on the server. • The Aggregation Framework is an alternate to Map-Reduce while providing a powerful means to calculate aggregate values. The aggregation framework can be used to perform SQL-like group-bys, distincts and self-joins. The aggregation framework also provides projection facilities which reshape data, add computed fields, create virtual sub-objects, and to extract sub-fields to results. • Hadoop Integration is a series of plugins for the Hadoop platform to allow connectivity to MongoDB The Hadoop connector allows both reading MongoDB data into Hadoop (for use in MapReduce jobs as well as other components of the Hadoop ecosystem), as well as writing the results of Hadoop jobs out to MongoDB.

    At 5:00pm to 6:00pm, Monday 24th September

  • Technology Hiring Mixer

    The Technology Hiring Mixer connects software engineers, web developers, and data scientists to tech company hiring managers.

    At 6:00pm to 7:00pm, Monday 24th September

Tuesday 25th September 2012

  • Future of the Business Cloud

    by Ash Jhaveri and dimitri sirota

    Bringing a business into the cloud is not just a matter of virtualizing its infrastructure and data: the \"enterprise cloud\" enables new web-based deployment, scalability, and shareablility with outside organizations. How is bringing the enterprise into the cloud a larger opportunity (or problem) than simply virtualizing its current infrastructure? How is the enterprise using the cloud in new ways: from joining an API mashup ecosystem to linking together its own apps or connecting to outside organizations?

    At 10:00am to 11:00am, Tuesday 25th September

  • ROI of Big Data: Turning Big Data Into Dollars

    by Raj De-Datta, John Santaferraro and Michael Warjas

    Amidst all the talk about Big Data technologies crunching terabytes or petabytes of data, what is the real ROI for a company\'s investment of hundreds of thousands if not millions of dollars in its Big Data technologies? Have compliance, intelligence, or simply the requirement of new scales of data incentivized the Big Data transformations- and what is the most direct path for converting Big Data into profits and corporate growth?

    At 11:00am to 12:00pm, Tuesday 25th September

  • The Marketplace of Data-As-A-Service

    by Dhruv Bansal

    Are we witnessing the fragmentation of \"Data-as-a-Service\" companies across every industry vertical, or what types of technologies will emerge to help organize the new marketplace of \"data-as-a-service\"? What is the \'state of the industry\' for data-as-a-service companies, and has the business model of data become popular in mainstream corporate culture yet?

    At 11:00am to 12:00pm, Tuesday 25th September

  • Managing Extremely Big Data in the Cloud

    by Siraj Khaliq

    What are the challenges of hosting, managing, and searching extremely large amounts of structured or unstructured data in a 3rd party cloud? As the enterprise and organizations move increasingly larger amounts of data online, we are witnessing a new set of technologies and business practices to tackle this problem.

    At 3:00pm to 4:00pm, Tuesday 25th September

  • Social Enterprise and Social Analytics

    by Cameran Evans

    Collaboration, productivity, and knowledge management tools are turning the enterprise into a real-time communication platform for recognizing good ideas, failing projects, or highly contributing individuals on a transparent software platform. What is the vision of the Social Enterprise, and how can managers best use data and analytics from social enterprise technology to optimize the effectiveness of their organizations? Do these technologies break down or reinforce departmental or organizational walls? Are we approaching a day when a digital social layer cuts across multiple enterprises on joint resource sharing, or will competitive risk outweigh the net gains?

    At 3:00pm to 4:00pm, Tuesday 25th September

  • The Customer Intelligence Ecosystem

    by Phil Davis

    How do I more effectively sell to customer leads I already have? Sales is just as much a matter of your customer intelligence ecosystem- how your source and integrate customer data- as it is a matter of having a hot-shot sales team. There is a new ecosystem of data providers, integrators, data cleaning and deleting technologies, and customer data platforms for improving the performance of your existing sales database. What new data types are changing the way we sell to consumers and businesses? Are innovations occurring more in email and consumer-level ad retargeting or around sales team data for phone calls and prioritizing meetings and proposals? What types of data do sales people wish they had in order to be more effective?

    At 3:00pm to 4:00pm, Tuesday 25th September

  • ROI For Enterprise Social? You're More Likely to See Bigfoot!

    by Michael Weir

    "In reality we have very little to prove the worth of Enterprise Social. We have some academic studies, we have some anecdotal evidence, a few (very few) published use cases where metrics are involved, and we have a whole lot of “it makes sense, just look at it.” In this panel, we will open up the discussion and try to crack the ROI code for Enterprise Social. Will it be possible for the industry to agree to some metrics or will smart enterprises create their own based on their specific measurable goals and targets? • Will soft metrics like \"\"follows\"\" and \"\"likes\"\" morph into things that can be actually measured and have value attached? • Is it even possible to find areas to measure and improve within the current Enterprise Social Landscape? • Ultimately these tools have to add to the bottom line. Any \"\"real\"\" case studies out there?\"

    At 4:00pm to 5:00pm, Tuesday 25th September

  • The Power of the Human Cloud For Enterprise Labor

    by zack kass

    Crowdsourcing is an effective labor tool that is shifting the traditional working paradigm and challenging the outsourcing industry. By tapping into millions of people around the world, to work on simple tasks that only humans can do, the “human cloud” creates a scalable workforce that can respond in real time. In this discussion we will look at how CrowdFlower is able to provide a platform for enterprises to adopt crowdsourcing as a means to solve various business solutions that are flexible, scalable, and accurate. Key Learning Points: •How crowdsourcing is becoming mainstream technology •How and why crowdsourcing will impact the future of work and labor • Examples of how crowdsourcing can be used •The values of crowdsourcing relative to outsourcing in terms of time, scale, and cost

    At 4:00pm to 5:00pm, Tuesday 25th September

  • SF Beta: Dataweek Edition

    Part social mixer, part interactive demo day, SF Beta is the Bay Area\'s premier startup mixer. This September, it\'s also the kickoff party for DataWeek. Join us for cocktails, hang out with hundreds of data geeks, and meet 12 up-and-coming data startups demoing all night. The fun takes place at the legendary 111 Minna Gallery in the heart of SoMa.

    At 5:00pm to 6:00pm, Tuesday 25th September

Wednesday 26th September 2012

  • Keynote: Gurbaksh Chahal

    by Gurbaksh Chahal

    At 9:00am to 10:00am, Wednesday 26th September

  • Advertising and the Social Graph

    by Yan Qu

    How is social data such as social connections, sharing activity, and shared interests create new advertising opportunities? Some of the internet\'s largest web companies are actively innovating around social data advertising- but what types of social data are most actionable for an advertiser? Is social advertising via open social graph data as effective as closed social graph data? Are there different concerns around personal data privacy and social data privacy?

    At 10:00am to 11:00am, Wednesday 26th September

  • How to Leverage Analytics With Facebook's New Insights & Analysis

    by Evan J. Andrews

    Facebook is changing rapidly, and their analytics software (Insights) is changing along with it. In this panel our social media and analytics experts will teach you how to analyze the activity on your Facebook Page using data from Facebook Insights & Analysis and get the maximum reach and brand lift from your social media presence.

    At 10:00am to 11:00am, Wednesday 26th September

  • The Data-Driven Marketing Campaign

    by Russell Glass

    Digital agencies and brands have a continuously updated array of technology options for managing their client\'s marketing campaigns for building visibility and acquiring customers from SEO / SEM and Social Media to Display Advertising and Retargeting. Does every product in the marketing funnel require its own data ecosystem for targeting ads, tracking marketing effectiveness, and analyzing results? Is the difference between direct advertising data, social media data, and SEO data incredibly great - and how are these marketing technologies converging or sharing information?

    At 12:00pm to 1:00pm, Wednesday 26th September

  • Influence Data and the Pagerank of People

    by Hayes Davis

    A consumer 2.0 world is one where influencers who have the most reach and engagement with their followers get extra deals, discounts, and memberships because of their social influence. Similarly, not all complements or insults to your brand are created equal- brand managers need to prioritize the issues of influencers above those without a social media following. How do the algorithms work beneath this PageRank of People? How far are we from an age when my social influence score affects by job prospects or my price of consumer goods?

    At 3:00pm to 4:00pm, Wednesday 26th September

  • Webscale: Analyzing the Entire Web of Data

    by Andrew Dumont

    Big Data tools have made it exponentially easier to index and analyze the entire internet. What one website sees as back-links, RSS feeds, and meta-tags, Big Data sees as insights into advertising, trends, and topical relevance across the web itself. From SEO and pagerank analysis to news discovery and contextualization, what new business models are possible when a tech startup can use big data tools in order to index and analyze the entire web? What are the challenges to indexing and analyzing the web that push modern Big Data tools to their limit?

    At 3:00pm to 4:00pm, Wednesday 26th September

  • Gamification Analytics

    by Patrick Salyer

    Companies use gamification technologies and practices to increase customer acquisition, engagement, and retention. Beyond the badges, credits, and leader boards, what are the metrics of gamification? Do brands or applications need to track new types of data or utilize new analytic methods to make gamification more fun or sticky? This panel asks the leaders in the gamification space to elucidate the underlying metrics and science of a new gamified web and mobile world.

    At 4:00pm to 5:00pm, Wednesday 26th September

  • Entertainment & Digital Media Data

    by Max Goodberg

    Between viral videos, streaming music, and online gaming, the ability to deliver digital media and entertainment to end users is more than just content creation. The ability to collect, analyze, target, and act on entertainment & media data is the next major paradigm shift in entertainment delivery. What makes the data behind entertainment & media analytics unique in a world focused on big enterprise data? How can leveraging entertainment data influence the entertainment industry itself - will we see the rise of personalized entertainment or perhaps hyper-targeted ads inside of the media?

    At 5:00pm to 6:00pm, Wednesday 26th September

  • Turning Craigslist Into a Firehose For All Developers

    by Greg Kidd

    Craigslist, more than any 1.0 web offering, has flown the luddite flag whether for user or developers. Its low tech UI/UX hasn\'t changed in a decade. At a time when Amazon, eBay, Etsy, and social networks have all built themselves into API based platforms with vibrant developer communities, Craigslist has spent considerable technical and legal resources discouraging 3rd party developers from developing new products and services for the 1.5M new postings that are added to the site each day. But that status quo has changed with 3taps and the treasure trove of facts about CL local exchanges is now available to any and all comers.

    At 5:00pm to 6:00pm, Wednesday 26th September