Interpretation of SNPs data is a non-trivial task: The analysis of the whole exome and/or whole genome data processing and later on interpretation is a challenging process in which Apache spark usage significantly speeds up the end-to-end analysis from FASTQ to annotated vcf file. In this talk we’ll share how doc.ai implements Apache spark technology for bioinformatics purposes.
Kartik Thakore, Head of Data Engineering at doc.ai, has worked in the medical data space since 2011. His work brings artificial intelligence and statistical solutions to the medical health care landscape. He led his own startup, AIMED Stat Inc, and then subsequently joined other startups, BetterDoctor and Human API, before joining doc.ai in 2018. Kartik also shares his data science expertise by advising several other startups in the Bay Area, London and Toronto on machine learning, data engineering and architectural directions.