Integrating Spark and Solr - Databricks

Integrating Spark and Solr

Download Slides

As more organizations seek to leverage Spark for big data analytics and machine learning, the need for seamless integration between Spark and Solr emerges. In this presentation, Timothy Potter covers how to populate Solr from a Spark streaming job as well as how to expose the results of any Solr query as an RDD. Attendees will come away with a solid understanding of common use cases, access to open source code, and performance metrics to help them develop their own large-scale search and discovery solution with Spark and Solr.

Learn more:

  • Solr As A SparkSQL DataSource
  • Using Spark-Solr at Scale: Productionizing Spark for Search with Apache Solr
  • How to Integrate Spark MLlib and Apache Solr to Build Real-Time Entity Type Recognition System for Better Query Understanding


  • « back