Introducing GlowGR: An industrial-scale, ultra-fast and sensitive method for genetic association studiesJune 25, 2020 by Leland Barnard, Henry Davidge, Karen Feng, Joelle Mbatchou, Boris Boutkov, Kiavash Kianfar, Dr. Lukas Habegger, Jonathan Marchini, Dr. Jeff Reid, Evan Maxwell and Frank Austin Nothaft in Product Today, we announce that we are making a new whole genome regression method available to the open source bioinformatics community as part of...
Introducing Glow: An Open-Source Toolkit for Large-Scale Genomic AnalysisOctober 18, 2019 by Frank Austin Nothaft, Karen Feng, Henry Davidge, Ion Stoica, Dr. Jeff Reid, Dr. Lukas Habegger, Evan Maxwell, Leland Barnard and Kiavash Kianfar in Announcements The key to solving some of today’s most challenging medical problems lies in the analysis of genomics data. Understanding the impact of the...
Parallelizing SAIGE Across Hundreds of CoresOctober 2, 2019 by Karen Feng, Henry Davidge and Frank Austin Nothaft in Engineering Blog As population genetics datasets grow exponentially, it is becoming impractical to work with genetic data without leveraging Apache Spark™. There are many ways...
Accurately Building Genomic Cohorts at Scale with Delta Lake and Spark SQLJune 19, 2019 by Frank Austin Nothaft and Karen Feng in Solutions Get an early preview of O'Reilly's new ebook for the step-by-step guidance you need to start using Delta Lake. This is the second...
A Summer of Personal and Professional Growth at DatabricksSeptember 5, 2017 by Karen Feng in Company Blog This summer, I worked at Databricks as a software engineering intern on the Growth team. By introducing two new features, user groups and...