Introducing Flint: A time-series library for Apache SparkSeptember 11, 2018 by Li Jin and Kevin Rasmussen in Company Blog This is a joint guest community blog by Li Jin at Two Sigma and Kevin Rasmussen at Databricks; they share how to use...
Introducing Pandas UDF for PySparkOctober 30, 2017 by Li Jin in Solutions NOTE: Spark 3.0 introduced a new pandas UDF. You can find more details in the following blog post: New Pandas UDFs and Python...