Kaoula Ghribi

Cloud/Data Engineer, SNCF

Kaoula is a Data/Cloud Engineer, working in the French Railways Company (SNCF) since 2017. Holding an engineering in computer science, certified Spark developer and GCP Associate Cloud Engineer. After engineering school, she joined CEA ( French Alternative and Atomic Energies Center) as R&D Engineer and spent a few years working in applying distribut-ed constraint optimizations in electricity households consumption prediction and optimizing. She decided in 2017 to specialized in big data processing and to join SNCF, Here she works on leveraging big data technologies to improve passenger security and trains punctuality.

Past sessions

Summit Europe 2020 Building a Streaming Data Pipeline for Trains Delays Processing

November 18, 2020 04:00 PM PT

A major cause of dissatisfaction among passengers is the irregularity of train schedules.

SNCF (French National Railway Company) has distributed a network of beacons over its 32,000 km of train tracks, triggering a flow of events at each train passage. In this talk, we will present how we built a real-time data processing on these data, to monitor traffic and map the propagation of train delays.

During the presentation we will demonstrate how to build an end to end solution, from ingestion to exposure.

The presentation will take place as follows:
-Data Pipeline: how we set up a data transformation pipeline using Spark 3 and Delta with Azure Databricks and how Delta Lake makes dynamically updated data reliable
-Exposure: how we expose our output in the best way depending on the consumer Power BI or REST API.
-Production-ready: finally, we will demonstrate how we have structured our development process to make it reliable and aligned with SNCF best practices.

Speakers: Alexandre Bergere and Kaoula Ghribi