Kafka and Hadoop
Getting data from Kafka to Hadoop should be simple, which is why the community has so many options to choose from. Cloudera engineer, Gwen Shapira, reviews some popular solutions: Storm, Spark, Flume and Camus. She goes over the pros and cons of each, recommends use-cases and future development plans as well.
Practical Machine Learning for Engineers
Mining Value from Massive Data Sources
Dr. Shah, team lead at Thomson Reuters R&D, discusses strategies that have (and have not) worked in dealing with massive datasets.