Semantic Search: Fast Results from Large, Non-Native Language Corpora - Databricks

Semantic Search: Fast Results from Large, Non-Native Language Corpora

Download Slides

The Semantic Engine is a custom search engine deployable on top of large, non-native language corpora that goes beyond keyword search and does NOT require translation. The large, on-the-fly calculations essential to making this an effective search engine necessitated development on a distributed platform capable of processing large volumes of unstructured data.
Hear how the low barrier to entry provided by Apache Spark allowed the Novetta Solutions team to focus on the hard analytical challenges presented by their data, without having to spend much time grappling with the inherent difficulties normally associated with distributed computing.

Session hashtag: #SFds18

About Rob Lantz

Rob Lantz is Director of Predictive Analytics at Novetta, overseeing multiple big data analytics and predictive modeling projects. He joined Novetta in 2013 after working as an Operations Analyst and Consultant for various customers, during which time he deployed multiple times to the Afghan theater. Prior to that, Rob served for 10 years in the US Marines during which time he was deployed as part of Operation Iraqi Freedom. Rob is a proud graduate of the US Naval Academy and Naval Postgraduate School. He is a Certified Analytics Professional (CAP), a Professional Statistician (PStat), and an AWS Certified Developer.