Pablo Delgado is a Senior Software Engineer, he works on building infrastructure for machine learning for Personalized Recommendation Algorithms at Netflix. Previously he was working on the recommendation systems stack for personal restaurant recommendations at Opentable. Pablo obtained a degree in Mathematics and Computer Science in University College London, London United Kingdom, where he worked on Graph based Methods for Collaborative Filtering.
At OpenTable, we help diners find the best dining experiences, wherever they travel. One of the key problems for accomplishing this is providing personalized recommendations. We have been leveraging our large corpus of unstructured reviews to build models to improve the accuracy of these recommendations. We will discuss how we use Spark both for the training of our recommenders, and for the natural language processing of the reviews to generate topic models.
This session is an informal meeting about deploying spark in a multi-user / multi-tenant environment.We will discuss the different deploy alternatives: standalone, mesos, yarn, cook, databricks with their pros and cons. If time allows it a, workflow managers can be covered as well, (ie: airbnb/airflow) This is the perfect place to ask questions and share experiences from your company spark deploys.