Regeneron is a leading biotechnology company using the power of science to bring new medicines to patients in need.
More than 95% of all experimental medicines that are currently in the drug development pipeline are expected to fail. To improve these efforts, the Regeneron Genetics Center built one of the most comprehensive genetics databases by pairing the sequenced exomes and electronic health records of more than 400,000 people. However, they faced numerous challenges analyzing this massive set of data:
Databricks provides Regeneron with a unified analytics platform running on Amazon Web Services that simplifies operations and accelerates drug discovery through improved data science productivity. This is empowering them to analyze the data in new ways that were previously impossible.
“The Databricks Unified Analytics Platform is enabling everyone in our integrated drug development process – from physician-scientists to computational biologists – to easily access, analyze, and extract insights from all of our data”
-Jeffrey Reid, PhD, Head of Genome Informatics at Regeneron
With Databricks we have been able to reduce the time it takes to ETL massive amounts of data from weeks to just a few hours to greatly accelerate drug discovery.