Skip to main content
Page 1

Introducing GlowGR: An industrial-scale, ultra-fast and sensitive method for genetic association studies

Today, we announce that we are making a new whole genome regression method available to the open source bioinformatics community as part of...

Introducing Glow: An Open-Source Toolkit for Large-Scale Genomic Analysis

The key to solving some of today’s most challenging medical problems lies in the analysis of genomics data. Understanding the impact of the...

Parallelizing SAIGE Across Hundreds of Cores

As population genetics datasets grow exponentially, it is becoming impractical to work with genetic data without leveraging Apache Spark™. There are many ways...

Accurately Building Genomic Cohorts at Scale with Delta Lake and Spark SQL

June 19, 2019 by Frank Austin Nothaft and Karen Feng in
Get an early preview of O'Reilly's new ebook for the step-by-step guidance you need to start using Delta Lake. This is the second...

A Summer of Personal and Professional Growth at Databricks

September 5, 2017 by Karen Feng in
This summer, I worked at Databricks as a software engineering intern on the Growth team. By introducing two new features, user groups and...