Investigator
Assistant Professor · The University of Texas MD Anderson Cancer Center, Epidemiology
Isoform-level analyses of 6 cancers uncover extensive genetic risk mechanisms undetected at the gene-level
Abstract Background Integrating genome-wide association study (GWAS) and transcriptomic datasets can identify mediators for genetic risk of cancer. Traditional methods often are insufficient as they rely on total gene expression measures and overlook alternative splicing, which generates different transcript-isoforms with potentially distinct effects. Methods We integrate multi-tissue isoform expression data from the Genotype Tissue-Expression Project with GWAS summary statistics (all N > ~20,000 cases) to identify isoform- and gene-level associations with six cancers (breast, endometrial, colorectal, lung, ovarian, prostate) and six related cancer subtype classifications (N = 12 total). Results Directly modeling isoforms using transcriptome-wide association studies (isoTWAS) significantly improves discovery of genetic associations compared to gene-level approaches, identifying 164% more significant associations (6163 vs. 2336) with isoTWAS-prioritized genes enriched 4-fold for evolutionarily-constrained genes. isoTWAS tags transcriptomic associations at 52% more independent GWAS loci across the six cancers. Isoform expression mediates an estimated 63% greater proportion of cancer risk SNP heritability compared to gene expression. We highlight several isoTWAS associations that demonstrate GWAS colocalization at the isoform level but not at the gene level, including CLPTM1L (lung cancer), LAMC1 (colorectal), and BABAM1 (breast). Conclusion These results underscore the importance of modeling isoforms to maximize discovery of genetic risk mechanisms for cancers.
Assistant Professor
The University of Texas MD Anderson Cancer Center · Epidemiology
Postdoctoral Fellow
University of California Los Angeles · Pathology and Laboratory Medicine