Predicting Geographic Population using Genome Variants and K-Means - The Databricks Blog