Your current filters are…
In this talk I will explain why it is that I spent the past year building Hadoop clusters for several clients in several markets. I will outline what the Apache Hadoop framework is from a technical point of view and I will explain why it could be as relevant as its popularity suggest, also from a technical and application perspective. There will be examples of what you can and cannot do with it and when it makes sense to use such a technology. And, last but not least, there will be cool things that you can try at home (or at your office, given time), because you should build a Hadoop cluster too! Code will be involved...
Keywords: Hadoop, Big Data, scalable processing, NoSQL, Cloud, Java
Target Audience: Developers and architects who agree that data is an asset which should be used to extract valuable information.
HBase has seen some interesting deploys of late whether its Yahoo! hosting a copy of the Bing crawl (on a thousand node cluster) or Facebook real-time analytics' taking millions of hits a second. In this talk we will review notable cases of HBase in production and give an update on the current state of "Realtime" Hadoop.
Keywords: hbase, hadoop, apache, big data
Target Audience: Folks interested in big data deploys. This is not a deep technical talk.
by Isabel Drost
Storing ever growing amounts of data is getting cheaper each year. Building great applications requires data driven application design: Starting with clear guidelines on which data to gather, building new features based on usage data up to discovering deficiencies based on implicit user feedback.
Several projects at the ASF make it possible to jump start your data driven applications with existing projects for data analysis, search, storage and data mining. This talk shows how to combine the various tools, helps with understanding what pigs, elephants and mahouts can do for you.
Keywords: data mining, machine learning, data analysis, hadoop, mahout, lucene
Target Audience: should be somewhat knowledgeable in hadoop, can be beginner in data mining/analytics