Investigator

Jesus Gonzalez-Bosquet

University of Iowa

JGJesus Gonzalez-Bo…
Papers(12)
scRNA-seq Can Identif…A Natural Language Pr…Endometrial carcinosa…Training, Validating,…Integration of Genomi…Using Genomic Variati…Identification of Ova…Methylation Signature…Interval debulking su…Identification of Nov…Identification of Nov…Disparity of ovarian …
Collaborators(10)
Eric DevorVincent WagnerAndreea NewtsonMario M LeitaoMark E. ShermanMartin KöbelMartin WidschwendterMatthew Stephen BlockMichael AnglesioNadia Traficante
Institutions(8)
University Of IowaMemorial Sloan-Ketter…Mayo ClinicUniversity of CalgaryLeopold-Franzens-Univ…Mayo Clinic in Roches…University of British…Peter MacCallum Cance…

Papers

scRNA-seq Can Identify Different Cell Populations in Ovarian Cancer Bulk RNA-seq Experiments

High-grade serous ovarian cancer (HGSC) is a heterogeneous disease. RNA sequencing (RNAseq) of bulk solid tissue is of limited use in these populations due to heterogeneity. Single-cell RNA-seq (scRNA-seq) allows for the identification of diverse genetic compositions of heterogeneous cell populations. New computational methodologies are now available that use scRNAseq results to estimate cell type proportions in bulk RNAseq data. We performed bulk RNA-seq gene expression analysis on 112 HGSC specimens and 12 benign fallopian tube (FT) controls. We identified several publicly available scRNAseq datasets for use as annotation and reference datasets. Deconvolution was performed with MUlti-Subject SIngle Cell Deconvolution (MuSiC) to estimate cell type proportions in the bulk RNA-seq data. Datasets from the Cancer Genome Atlas (TCGA). HGSC repositories were also evaluated. Clinical variables and percentages of cell types were compared for differences in clinical outcomes and treatment results. Pathway enrichment analysis was also performed. Different annotations for referenced scRNA-seq datasets used for deconvolution of bulk RNA-seq data revealed different cellular proportions that were significantly associated with clinical outcomes; for example, higher proportions of macrophages were associated with a better response to primary chemotherapy. Our deconvolution study of bulk RNAseq HGSC samples identified cell populations within the tumor that may be associated with some of the observed clinical outcomes.

A Natural Language Processing Method Identifies an Association Between Bacterial Communities in the Upper Genital Tract and Ovarian Cancer

Bacterial communities within the female upper genital tract may influence the risk of ovarian cancer. In this retrospective cohort pilot study, we aim to detect different communities of bacteria between ovarian cancer and normal controls using topic modeling, a natural language processing tool. RNA was extracted and analyzed using the VITCOMIC2 pipeline. Topic modeling assessed differences in bacterial communities. Idatuning identified an optimal latent topic number and Latent Dirichlet Allocation (LDA) assessed topic differences between high-grade serous ovarian cancer (HGSOC) and controls. Results were validated using The Cancer Genome Atlas (TCGA) HGSOC dataset. A total of 801 unique taxa were identified, with 13 bacteria significantly differing between HGSOC and normal controls. LDA modeling revealed a latent topic associated with HGSOC samples, containing bacteria Escherichia/Shigella and Corynebacterineae. Pathway analysis using KEGG databases suggest differences in several biologic pathways including oocyte meiosis, aldosterone-regulated sodium reabsorption, gastric acid secretion, and long-term potentiation. These findings support the hypothesis that bacterial communities in the upper female genital tract may influence the development of HGSOC by altering the local environment, with potential functional implications between HGSOC and normal controls. However, further validation is required to confirms these associations and determine mechanistic relevance.

Endometrial carcinosarcoma without myoinvasion

Uterine carcinosarcoma without myoinvasion, limited to the endometrial lining/polyp or with no residual uterine disease at the time of hysterectomy, is extremely uncommon, with unknown oncologic outcomes. Thus, this study aimed to evaluate the long-term outcomes of patients with carcinosarcoma without myoinvasion. Patients with International Federation of Gynecology and Obstetrics 2009 stage IA carcinosarcoma without myoinvasion who underwent surgery from December 1998 to January 2023 were identified from 11 centers worldwide. Patients were classified by tumor status (limited to the endometrium, limited to polyp, no residual disease in the hysterectomy specimen) and by type of adjuvant therapy (chemotherapy vs no chemotherapy). Survival analysis follow-up was limited to the first 5 years after surgery. Of 97 patients included, 28 (28.9%) had disease confined to a polyp, 55 (56.7%) to the endometrium, and 14 (14.4%) had no residual disease in the hysterectomy specimen. Patients received observation only (n=16, 16.5%), vaginal brachytherapy alone (n=14, 14.4%), external beam radiation therapy ± vaginal brachytherapy (n=5, 5.2%), chemotherapy ± vaginal brachytherapy (n=51, 52.6%), and chemotherapy and external beam radiation therapy ± vaginal brachytherapy (n=7, 7.2%), whereas adjuvant therapy was unknown in 4 patients (4.1%). A total of 29 patients (29.9%) recurred, mostly with a distant pattern of relapse. The 5-year recurrence-free survival was 63.5% (95% CI 53.4% to 75.4%) and the overall survival was 72.0% (95% CI 62.6% to 82.9%). The median follow-up for patients without recurrence was 56.9 months (interquartile range; 21.8-72.9). No significant differences were observed in recurrence-free survival and overall survival based on status of the tumor (p=.99 and p=.43, respectively). The difference in recurrence-free survival and overall survival was not statistically significant based on the receipt of chemotherapy (p=.08 and p=.07, respectively). Patients with carcinosarcoma without myoinvasion have a poor prognosis, with a high recurrence rate with distant pattern. The use of chemotherapy did not achieve statistical significance but may be limited by our small series.

Training, Validating, and Testing Machine Learning Prediction Models for Endometrial Cancer Recurrence

PURPOSE Endometrial cancer (EC) is the most common gynecologic cancer in the United States with rising incidence and mortality. Despite optimal treatment, 15%-20% of all patients will recur. To better select patients for adjuvant therapy, it is important to accurately predict patients at risk for recurrence. Our objective was to train, validate, and test models of EC recurrence using lasso regression and other machine learning (ML) and deep learning (DL) analytics in a large, comprehensive data set. METHODS Data from patients with EC were downloaded from the Oncology Research Information Exchange Network database and stratified into low risk, The International Federation of Gynecology and Obstetrics (FIGO) grade 1 and 2, stage I (N = 329); high risk, or FIGO grade 3 or stages II, III, IV (N = 324); and nonendometrioid histology (N = 239) groups. Clinical, pathologic, genomic, and genetic data were used for the analysis. Genomic data included microRNA, long noncoding RNA, isoforms, and pseudogene expressions. Genetic variation included single-nucleotide variation (SNV) and copy-number variation (CNV). In the discovery phase, we selected variables informative for recurrence ( P < .05), using univariate analyses of variance. Then, we trained, validated, and tested multivariate models using selected variables and lasso regression, MATLAB (ML), and TensorFlow (DL). RESULTS Recurrence clinic models for low-risk, high-risk, and high-risk nonendometrioid histology had AUCs of 56%, 70%, and 65%, respectively. For training, we selected models with AUC >80%: five for the low-risk group, 20 models for the high-risk group, and 20 for the nonendometrioid group. The two best low-risk models included clinical data and CNVs. For the high-risk group, three of the five best-performing models included pseudogene expression. For the nonendometrioid group, pseudogene expression and SNV were overrepresented in the best models. CONCLUSION Prediction models of EC recurrence built with ML and DL analytics had better performance than models with clinical and pathologic data alone. Prospective validation is required to determine clinical utility.

Integration of Genomic and Clinical Retrospective Data to Predict Endometrioid Endometrial Cancer Recurrence

Endometrial cancer (EC) incidence and mortality continues to rise. Molecular profiling of EC promises improvement of risk assessment and treatment selection. However, we still lack robust and accurate models to predict those at risk of failing treatment. The objective of this pilot study is to create models with clinical and genomic data that will discriminate patients with EC at risk of disease recurrence. We performed a pilot, retrospective, case–control study evaluating patients with EC, endometrioid type: 7 with recurrence of disease (cases), and 55 without (controls). RNA was extracted from frozen specimens and sequenced (RNAseq). Genomic features from RNAseq included transcriptome expression, genomic, and structural variation. Feature selection for variable reduction was performed with univariate ANOVA with cross-validation. Selected variables, informative for EC recurrence, were introduced in multivariate lasso regression models. Validation of models was performed in machine-learning platforms (ML) and independent datasets (TCGA). The best performing prediction models (out of >170) contained the same lncRNA features (AUC of 0.9, and 95% CI: 0.75, 1.0). Models were validated with excellent performance in ML platforms and good performance in an independent dataset. Prediction models of EC recurrence containing lncRNA features have better performance than models with clinical data alone.

Identification of Ovarian High-Grade Serous Carcinoma with Mitochondrial Gene Variation

Women diagnosed with advanced-stage ovarian cancer have a much worse survival rate than women diagnosed with early-stage ovarian cancer, but the early detection of this disease remains a clinical challenge. Some recent reports indicate that genetic variations could be useful for the early detection of several malignancies. In this pilot observational retrospective study, we aimed to assess whether mitochondrial DNA (mtDNA) variations could discriminate the most frequent type of ovarian cancer, high-grade serous carcinoma (HGSC), from normal tissue. We identified mtDNA variations from 20 whole-exome sequenced (WES) HGSC samples and 14 controls (normal tubes) using the best practices of genome sequencing. We built prediction models of cancer with these variants, with good performance measured by the area under the curve (AUC) of 0.88 (CI: 0.74–1.00). The variants included in the best model were correlated with gene expression to assess the potentially affected processes. These analyses were validated with the Cancer Genome Atlas (TCGA) dataset, (including over 420 samples), with a fair performance in AUC terms (0.63–0.71). In summary, we identified a set of mtDNA variations that can discriminate HGSC with good performance. Specifically, variations in the MT-CYB gene increased the risk for HGSC by over 30%, and MT-CYB expression was significantly decreased in HGSC patients. Robust models of ovarian cancer detection with mtDNA variations could be applied to liquid biopsy technology, like those which have been applied to other cancers, with a special focus on the early detection of this lethal disease.

Methylation Signature Implicated in Immuno-Suppressive Activities in Tubo-Ovarian High-Grade Serous Carcinoma

Abstract Background: Better understanding of prognostic factors in tubo-ovarian high-grade serous carcinoma (HGSC) is critical, as diagnosis confers an aggressive disease course. Variation in tumor DNA methylation shows promise predicting outcome, yet prior studies were largely platform-specific and unable to evaluate multiple molecular features. Methods: We analyzed genome-wide DNA methylation in 1,040 frozen HGSC, including 325 previously reported upon, seeking a multi-platform quantitative methylation signature that we evaluated in relation to clinical features, tumor characteristics, time to recurrence/death, extent of CD8+ tumor-infiltrating lymphocytes (TIL), gene expression molecular subtypes, and gene expression of the ATP-binding cassette transporter TAP1. Results: Methylation signature was associated with shorter time to recurrence, independent of clinical factors (N = 715 new set, hazard ratio (HR), 1.65; 95% confidence interval (CI), 1.10–2.46; P = 0.015; N = 325 published set HR, 2.87; 95% CI, 2.17–3.81; P = 2.2 × 10−13) and remained prognostic after adjustment for gene expression molecular subtype and TAP1 expression (N = 599; HR, 2.22; 95% CI, 1.66–2.95; P = 4.1 × 10−8). Methylation signature was inversely related to CD8+ TIL levels (P = 2.4 × 10−7) and TAP1 expression (P = 0.0011) and was associated with gene expression molecular subtype (P = 5.9 × 10−4) in covariate-adjusted analysis. Conclusions: Multi-center analysis identified a novel quantitative tumor methylation signature of HGSC applicable to numerous commercially available platforms indicative of shorter time to recurrence/death, adjusting for other factors. Along with immune cell composition analysis, these results suggest a role for DNA methylation in the immunosuppressive microenvironment. Impact: This work aids in identification of targetable epigenome processes and stratification of patients for whom tailored treatment may be most beneficial.

Interval debulking surgery is not worth the wait: a National Cancer Database study comparing primary cytoreductive surgery versus neoadjuvant chemotherapy

In previous studies, neoadjuvant chemotherapy followed by interval debulking surgery was not inferior to primary cytoreductive surgery as initial treatment for advanced epithelial ovarian cancer. Our study aimed to compare surgical and survival outcomes between the two treatments in a large national database. Data were extracted from the National Cancer Database from January 2004 to December 2015. Patients with FIGO (International Federation of Gynecologists and Obstetricians) stage III-IV epithelial ovarian cancer and known sequence of treatment were included: primary cytoreductive (surgery=26 717 and neoadjuvant chemotherapy=9885). Tubal and primary peritoneal cancer diagnostic codes were not included. Residual disease after treatment was defined based on recorded data: R0 defined as microscopic or no residual disease; R1 defined as macroscopic residual disease. Multivariate Cox proportional HR was used for survival analysis. Multivariate logistic regression analysis was utilized to compare mortality between groups. Outcomes were adjusted for significant covariates. Validation was performed using propensity score matching of significant covariates. A total of 36 602 patients were included in the analysis. Patients who underwent primary cytoreductive surgery had better survival than those treated with neoadjuvant chemotherapy followed by interval surgery, after adjusting for age, co-morbidities, stage, and residual disease (p<0.001). Primary cytoreductive surgery patients with R0 disease had best median survival (62.6 months, 95% CI 60.5-64.5). Neoadjuvant chemotherapy patients with R1 disease had worst median survival (29.5 months, 95% CI 28.4-31.9). There were small survival differences between primary cytoreductive surgery with R1 (38.9 months) and neoadjuvant chemotherapy with R0 (41.8 months) (HR 0.93, 95% CI 0.87 to 1.0), after adjusting for age, co-morbidities, grade, histology, and stage. Neoadjuvant chemotherapy had 3.5 times higher 30-day mortality after surgery than primary cytoreductive surgery (95% CI 2.46 to 5.64). The 90-day mortality was higher for neoadjuvant chemotherapy in multivariate analysis (HR 1.31, 95% CI 1.06 to 1.61) but similar to primary cytoreductive surgery after excluding high-risk patients. Most patients with advanced epithelial ovarian cancer may benefit from primary cytoreductive surgery. Patients treated with neoadjuvant chemotherapy should be those with co-morbidities unfit for surgery.

Identification of Novel lncRNAs in Ovarian Cancer and Their Impact on Overall Survival

Long non-coding RNA’s (lncRNA) are RNA sequences that do not encode proteins and are greater than 200 nucleotides in length. They regulate complex cellular mechanisms and have been associated with prognosis in various types of cancer. We aimed to identify lncRNA sequences that are associated with high grade serous ovarian cancer (HGSC) and assess their impact on overall survival. RNA was extracted from 112 HGSC patients and 12 normal fallopian tube samples from our Biobank tissue repository. RNA was sequenced and the Ultrafast and Comprehensive lncRNA detection and quantification pipeline (UClncR) was used for the identification of lncRNA sequences. Univariate logistic and multivariate lasso regression analyses identified lncRNA that was associated with HGSC. Univariate and multivariate Cox proportional hazard ratios were used to evaluate independent predictors of survival. 1943 of 16,325 investigated lncRNA’s were differentially expressed in HGSC as compared to controls (p &lt; 0.001). Nine of these demonstrated association with cancer after multivariate lasso regression. Our multivariate analysis of survival identified four lncRNA’s associated with survival in HGSC. Three out of these four were found to be independently significant after accounting for all clinical covariates. Lastly, seven lncRNAs were independently associated with initial response to chemotherapy; four portended a worse response, while three were associated with improved response. More research is needed, but there is potential for these lncRNAs to be used as biomarkers of HGSC or predictors of treatment outcome in the future.

Disparity of ovarian cancer survival between urban and rural settings

To determine if there is a difference in overall survival of patients with epithelial ovarian cancer in rural, urban, and metropolitan settings in the United States. We performed a retrospective cohort study using 2004-2016 National Cancer Database (NCDB) data including high and low grade, stage I-IV disease. Bivariate analyses used Student's t-test for continuous variables and χ A total of 111 627 patients were included with a mean age of 62.5 years for metroolitan (range 18-90), 64.0 years for rural (range 19-90) and 63.2 years for urban areas (range 18-90). Of all patients included, 94 290 were in a metropolitan area (counties >1 million population or 50 000-999 999), 15 386 were in an urban area (population of 10 000-49 999), and 1951 were in a rural area (non-metropolitan/non-core population). Univariate Cox proportional hazards models showed clinically significant differences in survival in patients from metropolitan, urban, and rural areas. Multivariate Cox proportional hazards models showed a clinically significant increase in HRs for patients in rural settings (HR 1.17; 95% CI 1.06 to 1.29). Increasing age and stage, non-insured status, non-white race, and comorbidity were also significant for poorer survival. Patients with ovarian cancer who live in rural settings with small populations and greater distance to tertiary care centers have poorer survival. These differences hold after controlling for stage, age, and other significant risk factors related to poorer outcomes. To improve clinical outcomes, we need further studies to identify which of these factors are actionable.

94Works
12Papers
47Collaborators
Ovarian NeoplasmsEndometrial NeoplasmsNeoplasm Recurrence, LocalCarcinoma, Ovarian EpithelialNeoplasm GradingPrognosisBiomarkers, Tumor

Positions

Researcher

University of Iowa

Links & IDs
0000-0002-2079-4528

Scopus: 6602637763