Mining Value from Massive Data Sources
Dr. Shah, team lead at Thomson Reuters R&D, discusses strategies that have (and have not) worked in dealing with massive datasets.
Three Reasons a Data Engineer Should Learn Scala
There has been a lot of debate over Scala lately, including criticisms like this, this, this, and defenses like this and this. Most of the criticisms seem to focus on the language's complexity, performance, and integration with existing tools and libraries, while some praise its elegant syntax, powerful type system, and good fit for domain-specific languages.
How Machine Learning Drives Personalization at TellApart
TellApart Software Engineer Nick Gorski takes us through a technical deep-dive into TellApart's personalization system. He discusses the machine learning data pipeline at TellApart that powers the models, real-time calculations of the expected value of shoppers, and how to translate that value into a bid price for every bid request received (hundreds of thousands per second).