Skip to main content
Page 1

Introducing Flint: A time-series library for Apache Spark

September 11, 2018 by Li Jin and Kevin Rasmussen in
This is a joint guest community blog by Li Jin at Two Sigma and Kevin Rasmussen at Databricks; they share how to use...

Introducing Pandas UDF for PySpark

October 30, 2017 by Li Jin in
NOTE: Spark 3.0 introduced a new pandas UDF. You can find more details in the following blog post: New Pandas UDFs and Python...