by Jakob Homan
Kafka is a distributed pub-sub system that handles streaming data and provides the ability to load data directly into Apache Hadoop. It provides a highly performant messaging system combined with an simple, extensible API. Kafka is currently in production at LinkedIn and was recently open-sourced. Learn more at http://sna-projects.com/kafka/