Kafka is a distributed pub-sub system that handles streaming data and provides the ability to load data directly into Apache Hadoop. It provides a highly performant messaging system combined with an simple, extensible API. Kafka is currently in production at LinkedIn and was recently open-sourced. Learn more at http://sna-projects.com/kafka/
Apache (Hadoop|Giraph|Kafka) (Committer|PMC). Focusing on Higher Level Languages On Hadoop (HeLLisH). (Smart|occasional dumb)ass. bio from Twitter
Sign in to add slides, notes or videos to this session