Apache Spark ❤️ Apache DataSketches: New Sketch-Based Approximate Distinct CountingSeptember 21, 2023 by Daniel Tenedorio, Menelaos Karavelas and Ryan Berti in Engineering Blog Introduction In this blog post, we'll explore a set of advanced SQL functions available within Apache Spark that leverage the HyperLogLog algorithm, enabling...