Large Scale Topic Modeling: Improvements to LDA on Apache Spark - The Databricks Blog