Big Data Pipelines and Use Cases at StumbleUpon

StumbleUpon indexes over 100 million web pages for serendipitous retrieval forĀ over 25 million registered users. Debora Donato (Principal Data Scientist, StumbleUpon) walks through StumbleUpon's big data architecture, data pipelines, mobile optimization efforts, and data mining projects.


This talk is from the SF Data Mining Meetup hosted by Trulia.