Skip to main content

Solution Accelerator

Disease Profiling With TCGA

Pre-built code, sample data and step-by-step instructions ready to go in a Databricks Notebook

Disease Profiling With TCGA
Understanding cancer’s secrets with data

Understanding cancer’s secrets with data

The Cancer Genome Atlas (TCGA) aims to demystify the complex molecular aspects of cancer through advanced genome analysis and extensive genome sequencing. Its goal is to enhance cancer outcomes by investigating how genomic changes affect cancer diagnosis, treatment and prevention.

With this Solution Accelerator, which uses open access TCGA clinical and transcriptomics data spanning 33 cancer types, organizations can:

  • Unify transcriptomics with associated clinical data to construct gene expression profiles
  • Publish the resulting tables to Unity Catalog for downstream analysis
  • Analyze and interact with the data in natural language
  • Explore expression profiles for each sample and inspect clusters of samples
Download notebook