Session

Highways and Hexagons: Processing Large Geospatial Datasets With H3

Overview

ExperienceIn Person
TypeBreakout
TrackData Engineering and Streaming
IndustryPublic Sector
TechnologiesApache Spark, Databricks SQL, Databricks Workflows
Skill LevelIntermediate
Duration40 min

The problem of matching GPS locations to roads and local government areas (LGAs) involves handling large datasets and a number of geospatial operations. In this deep dive, we will outline the challenges of developing scalable solutions for these tasks.

 

We will discuss our multi-step approach, first focusing on the use of H3 indexing to isolate matches with single candidates, then explaining use of different geospatial computational techniques to accurately match points with multiple candidates.

 

From technical perspective, the talk will showcase the use of broadcasting and partitioning techniques, their effect on autoscaling, memory usage and effective data parallelization.

 

This session is for anyone interested in geospatial data, spark performance optimization and the real-world challenges of large-scale data engineering.

 

This session will be co-presented by Prad Dias (Austroads Senior Implementation Manager) and Petr Andreev (Mantel Group Senior Data Engineer)

Session Speakers

Petr Andreev

/Senior Data Engineer
Mantel Group

IMAGE COMING SOON

Ahangama (Prad) Dias

/Austroads Ltd