Databricks Runtime 3.0 Beta Delivers Cloud Optimized Apache SparkMay 24, 2017 by Reynold Xin in Product A major value Databricks provides is the automatic provisioning, configuration, and tuning of clusters of machines that process data. Running on these machines...
Persistent Clusters: Simplifying Cluster Management for AnalyticsMay 19, 2017 by Evan Ye, Haogang Chen, Henry Davidge and Prakash Chockalingam in Company Blog Today we are excited to announce persistent clusters for analytics in Databricks. With persistent clusters, users no longer need to go through the...
Detecting Abuse at Scale: Locality Sensitive Hashing at Uber EngineeringMay 9, 2017 by Yun Ni, Kelvin Chu and Joseph Bradley in Solutions This is a cross blog post effort between Databricks and Uber Engineering. Yun Ni is a software engineer on Uber’s Machine Learning Platform...
Query Watchdog: Handling Disruptive Queries in Spark SQLApril 17, 2017 by Alicja Luszczak, Srinath Shankar and Bill Chambers in Product Read Rise of the Data Lakehouse to explore why lakehouses are the data architecture of the future with the father of the data...
Delivering a Personalized Shopping Experience with Apache Spark on DatabricksMarch 31, 2017 by Brett Bevers, Engineering Manager, Data Engineering at Dollar Shave Club in Product This is a guest blog from our friends at Dollar Shave Club. Dollar Shave Club (DSC) is a men's lifestyle brand and e-commerce...
The Tenth Spark Summit with a Terrific Agenda for AllMarch 30, 2017 by Jules Damji in Announcements The number 10 is often used as a measuring yardstick to denote achievement, attainment or accomplishment: the 10th anniversary; a perfect score of...
Analyse One Year of Radio Station Songs Aired with Apache Spark, Spark SQL, Spotify, and DatabricksMarch 27, 2017 by Paul Leclercq in Solutions Read Rise of the Data Lakehouse to explore why lakehouses are the data architecture of the future with the father of the data...
How Apache Spark on Databricks is Taming the Wild West of Wi-FiFebruary 27, 2017 by Tomasz Magdanski in Company Blog iPass is the world’s largest Wi-Fi provider, yet we don’t own a single hotspot. You can think of us as the Uber of...
Anonymizing Datasets at Scale Leveraging Databricks InteroperabilityFebruary 13, 2017 by Don Hillborn in Product A key challenge for data-driven companies across a wide range of industries is how to leverage the benefits of analytics at scale when...
Announcing the Spark Live 2017 World TourJanuary 31, 2017 by Wayne Chan in Company Blog Due to the enthusiasm and positive feedback from last year’s Spark Live tour, we will be hitting the road again in 2017 to...