Tuesday 16th June, 2015
3:30pm to 4:00pm
Recommendation systems have been very successful in helping companies to keep engaging their existing users as well as to attract new users. At MyFitnessPal, our goal is to recommend the most relevant and healthy foods/recipes to our users, and some important implementation considerations are:
1. change in user's preference over time, which means that RecSys pipelines should be updated frequently.
2. 15B+ food entries generated by our 80M+ registered users, which makes it hard to develop a production-ready, well fine-tuned and fully scalable RecSys in a timely manner.
In this talk, I will present how we used Spark to address the scalability, flexibility and easy-to-develop aspect of our RecSys pipeline taking into account the above mentioned implementation considerations. Overall, Spark has been very effective tool letting us focus on more fundamental aspects of modeling while minimizing the extra burden of dealing with very complex parallel processing of data.
Sign in to add slides, notes or videos to this session