Skip to main content
<
Page 180
>

What AWS Per-Second Billing Means for Big Data Processing

November 6, 2017 by Prakash Chockalingam in
Databricks, the Unified Analytics Platform, has always been a cloud-first platform. We believe in the scalability and elasticity of the cloud so that...

Spark Summit EU 2017 Recap and Reflections

November 5, 2017 by Jules Damji in
“Dublin is now a truly cosmopolitan capital, with an influx of people, energy, and ideas infusing the ever-beguiling, multi-layered city with fresh flavors...

Access Control for Databricks Jobs

Secure your production workloads end-to-end with Databricks’ comprehensive access control system Databricks offers role-based access control for clusters and workspace to secure infrastructure...

Continuous Integration & Continuous Delivery with Databricks

Continuous integration and continuous delivery (CI/CD) is a practice that enables an organization to rapidly iterate on software changes while maintaining stability, performance...

Introducing Pandas UDF for PySpark

October 30, 2017 by Li Jin in
NOTE: Spark 3.0 introduced a new pandas UDF. You can find more details in the following blog post: New Pandas UDFs and Python...

Databricks Delta: A Unified Data Management System for Real-time Big Data

Combining the best of data warehouses, data lakes and streaming For an in-depth look and demo, join the webinar . Today we are...

Introducing the Natural Language Processing Library for Apache Spark

October 19, 2017 by David Talby in
This is a community blog and effort from the engineering team at John Snow Labs, explaining their contribution to an open-source Apache Spark...

Using Databricks to Democratize Big Data and Machine Learning at McGraw-Hill Education

October 18, 2017 by Matthew Hogan in
This is a guest post from Matt Hogan, Sr. Director of Engineering, Analytics and Reporting at McGraw-Hill Education. McGraw-Hill Education is a 129-year-old...

3 Things CISO’s expect from Tech Companies in a Cloudy World

October 17, 2017 by David Cook in
Adding new software to an enterprise is a difficult process. In the past, choosing new software only required budget approval before it could...

Arbitrary Stateful Processing in Apache Spark’s Structured Streaming

October 17, 2017 by Bill Chambers and Jules Damji in
This is the seventh post in a multi-part series about how you can perform complex streaming analytics using Apache Spark and Structured Streaming...