This article represents a collaborative effort between Plotly, Ballard Power Systems, and Databricks.
Fleets of buses worldwide run on hydrogen fuel cells made by Ballard Power Systems. To lead in the zero-emission energy space, Ballard has combined the power of Databricks and Plotly Dash to build an RDU (Remote Diagnostics Unit) Visualization Platform capable of processing more than 50 terabytes of data and delivering actionable information globally to subject matter experts, fleet managers and C-level business leaders.
Facing increasingly growing amounts of data, Ballard’s analytics team encountered performance and scalability constraints with legacy architecture components with classical data warehouses and BI reporting, which reduced data viewing and analytics capacity to a single week. Furthermore, the required reports took weeks to curate and deliver, hampering long-term trend analysis and contemporary strategic decisions.
Addressing these challenges via standard data warehousing and BI solutions was not viable due to the complexity, scale and granularity of Ballard’s streaming data collection, pipeline, and analytics requirements. Ballard required a solution to do real-time processing of complex semi-structured data, perform ML training and inference, and visualize these changes in a live data application. Including incoming data not being ordered in time - and quite significantly so - as required for stream-based processing, along with significant lags in data arrival and missing chunks of data.
The requirement was for a solution that addressed all these complexities, all while being accessible to a diverse set of internal and external stakeholders globally, including:
We have a complex, growing data pipeline with currently over 40 billion data points from our expanding customer base of fuel cell modules. We specifically chose the combination of Databricks and Plotly Dash because we required a platform capable of delivering sophisticated, interactive, at-scale streaming analytics globally - at both a technical and business level - to a wide range of internal and external stakeholders in a timely manner.— Andreas Putz, Analytics Team Lead, Ballard Power Systems
Ballard’s analytics team developed the RDU Visualization Platform by integrating Databricks and Plotly Dash in an Azure cloud environment to deliver interactive analytics products for multiple stakeholders, including the following examples:
The RDU Visualization Platform continues to evolve, and Ballard welcomes sharing best practices with others advancing similar objectives with these technologies. Planned enhancements to the Ballard system include adding spatial analytics, point-in-time formal reporting, further structured streaming optimizations, and other ML development initiatives.
Ballard Power Systems’ (NASDAQ: BLDP; TSX: BLDP) vision is to deliver fuel cell power for a sustainable planet. Ballard zero-emission PEM fuel cells are enabling the electrification of mobility, including buses, commercial trucks, trains, marine vessels, and stationary power. To learn more about Ballard, please visit www.ballard.com.
About Plotly
Plotly is a software company whose mission is to enable every company, around the world, to build data apps. Our product, Dash Enterprise, is a platform of best-in-class development tools to quickly and easily visualize data in Python from virtually any data source. With customers across the Fortune 500, Plotly is a category-defining leader in enabling data-driven decisions from advanced analytics, machine learning, and artificial intelligence. For more information, visit www.plotly.com.
Databricks is the Data and AI company. More than 10,000 organizations worldwide — including Comcast, Condé Nast, and over 50% of the Fortune 500 — rely on the Databricks Lakehouse Platform to unify their data, analytics and AI. Databricks is headquartered in San Francisco, with offices around the globe. Founded by the original creators of Apache Spark™, Delta Lake and MLflow, Databricks is on a mission to help data teams solve the world’s toughest problems. To learn more, follow Databricks on Twitter, LinkedIn and Facebook.
Contact us to learn more.