Topic models automatically infer the topics discussed in a collection of documents. These topics can be used to summarize and organize documents, or...
Update August 4th 2016: Since this original post, MongoDB has released a new Databricks-certified connector for Apache Spark. See the updated blog post...
Databricks now includes a new feature called Jobs, enabling support for running production pipelines, consisting of standalone Spark applications. Jobs includes a scheduler...
Today I’m excited to announce the general availability of Apache Spark 1.3! Apache Spark 1.3 introduces the widely anticipated DataFrame API, an evolution...
We’re really excited to announce that Sharethrough has selected Databricks to discover hidden patterns in customer behavior data. Sharethrough builds software for delivering...