Integrating muti-omics data to identify tissue-specific DNA methylation biomarkers for cancer risk

Yaohua Yang & Jirong Long et al. · 2024-07-18

Abstract

The relationship between tissue-specific DNA methylation and cancer risk remains inadequately elucidated. Leveraging resources from the Genotype-Tissue Expression consortium, here we develop genetic models to predict DNA methylation at CpG sites across the genome for seven tissues and apply these models to genome-wide association study data of corresponding cancers, namely breast, colorectal, renal cell, lung, ovarian, prostate, and testicular germ cell cancers. At Bonferroni-corrected P < 0.05, we identify 4248 CpGs that are significantly associated with cancer risk, of which 95.4% (4052) are specific to a particular cancer type. Notably, 92 CpGs within 55 putative novel loci retain significant associations with cancer risk after conditioning on proximal signals identified by genome-wide association studies. Integrative multi-omics analyses reveal 854 CpG-gene-cancer trios, suggesting that DNA methylation at 309 distinct CpGs might influence cancer risk through regulating the expression of 205 unique cis-genes. These findings substantially advance our understanding of the interplay between genetics, epigenetics, and gene expression in cancer etiology.

Funding
DNA Methylation Markers, Genes and Breast Cancer RiskPathologyDNA Methylation Markers, Genes and Breast Cancer RiskIdentification of Genes and DNA Methylation Markers for Lung Cancer Risk by Integrating Multi-omics DataDissecting roles of microbiome-host interactions in colorectal neoplasia etiology using multi-omics dataIdentification of Genes and DNA Methylation Markers for Lung Cancer Risk by Integrating Multi-omics DataDissecting roles of microbiome-host interactions in colorectal neoplasia etiology using multi-omics dataU.S. Department of Health & Human Services | National Institutes of Health Funding

NCI NIH HHS

R01 CA247987

NCI NIH HHS

P30 CA008748

U.S. Department of Health & Human Services | National Institutes of Health (NIH)

R01CA247987

U.S. Department of Health & Human Services | National Institutes of Health (NIH)

R01CA249863

U.S. Department of Health & Human Services | National Institutes of Health (NIH)

R00CA248822

NCI NIH HHS

R01 CA249863

NCI NIH HHS

R00 CA248822

U.S. Department of Health & Human Services | National Institutes of Health

R00CA248822

U.S. Department of Health & Human Services | National Institutes of Health

R01CA249863

U.S. Department of Health & Human Services | National Institutes of Health

R01CA247987