Streaming Data Analysis and Online Learning by John Myles White

In this talk, "Streaming Data Analysis and Online Learning," John Myles White of Facebook surveys some basic methods for analyzing data in a streaming manner. He focuses on using stochastic gradient descent (SGD) to fit models to data sets that arrive in small chunks, discussing some basic implementation issues and demonstrating the effectiveness of SGD for problems like linear and logistic regression as well as matrix factorization. He also describes how these methods allow ML systems to adapt to user data in real-time. This talk was recorded at the New York Open Statistical Programming meetup at Knewton.


John is one of the primary developers of Julia, a new language for technical computing. He is currently developing the statistical and machine learning infrastructure for Julia.  In addition, he is one of the residents at Hacker School's Summer 2013 program. John recently finished his Ph.D. at Princeton, where he developed models of human decision-making. During grad school, John co-wrote Machine Learning for Hackers and Bandit Algorithms for Website. Starting in the fall, John will be a research scientist at Facebook.

Get updates of upcoming tech talks and presentations

If you'd like to be notified when we post new tech talks, developer presentations and opensource updates, you can subscribe to our newsletter, or YouTube channel.

Want to hear from more top engineers?

Our weekly email contains the best software development content and interviews with top CTOs. Enter your email address now to stay in the loop.