How to Avoid Drowning in GDPR Data Subject Requests in a Data Lake
With GDPR enforcement rapidly approaching (May 25, 2018), many companies are still trying to figure out how to comply. A big pain point,…
With GDPR enforcement rapidly approaching (May 25, 2018), many companies are still trying to figure out how to comply. A big pain point,…
Structured Streaming in Apache Spark 2.0 decoupled micro-batch processing from its high-level APIs for a couple of reasons. First, it made developer’s experience…
Combining the best of data warehouses, data lakes and streaming For an in-depth look and demo, join the webinar. Today we are proud…
Today we are happy to announce the availability of Apache Spark 2.2.0 on Databricks as part of the Databricks Runtime 3.0. This release…
We started building Structured Streaming in Apache Spark one year ago as a new, simpler way to develop continuous applications. Not only does…
This is the fifth post in a multi-part series about how you can perform complex streaming analytics using Apache Spark. At Databricks, we’ve…
This is the third post in a multi-part series about how you can perform complex streaming analytics using Apache Spark. In this blog,…
In part 1 of this series on Structured Streaming blog posts, we demonstrated how easy it is to write an end-to-end streaming ETL…
We are well into the Big Data era, with organizations collecting massive amounts of data on a continual basis. Yet, the value of…
Apache Spark 2.0 adds the first version of a new higher-level API, Structured Streaming, for building continuous applications. The main goal is to…