Investigator

T. Martín-Noguerol

Unknown Institution

TMT. Martín-Noguerol
Papers(2)
Natural language proc…Towards Automated FIG…
Collaborators(2)
A. LunaP. López-Úbeda
Institutions(1)
Unknown Institution

Papers

Natural language processing evaluation of trends in cervical cancer incidence in radiology reports: A ten-year survey

Cervical cancer commonly associated with human papillomavirus (HPV) infection, remains the fourth most common cancer in women globally. This study aims to develop and evaluate a Natural Language Processing (NLP) system to identify and analyze cervical cancer incidence trends from 2013 to 2023 at our institution, focusing on age-specific variations and evaluating the possible impact of HPV vaccination. This retrospective cohort study, we analyzed unstructured radiology reports collected between 2013 and 2023, comprising 433,207 studies involving 250,181 women who underwent CT, MRI, or ultrasound scans of the abdominopelvic region. A rule-based NLP system was developed to extract references to cervical cancer from these reports and validated against a set of 200 manually annotated cases reviewed by an experienced radiologist. The NLP system demonstrated excellent performance, achieving an accuracy of over 99.5 %. This high reliability enabled its application in a large-scale population study. Results show that the women under 30 maintain a consistently low cervical cancer incidence, likely reflecting early HPV vaccination impact. The 30-40 cohorts declined until 2020, followed by a slight increase, while the 40-60 groups exhibited an overall downward trend with fluctuations, suggesting long-term vaccine effects. Incidence in patients over 60 also declined, though with greater variability, possibly due to other risk factors. The developed NLP system effectively identified cervical cancer cases from unstructured radiology reports, facilitating an accurate analysis of the impact of HPV vaccination on cervical cancer prevalence and imaging study requirements. This approach demonstrates the potential of AI and NLP tools in enhancing data accuracy and efficiency in medical epidemiology research. NLP-based approaches can significantly improve the collection and analysis of epidemiological data on cervical cancer, supporting the development of more targeted and personalized prevention strategies-particularly in populations with heterogeneous HPV vaccination coverage.

Towards Automated FIGO Staging in Radiology: The Role of LLMs in Cervical and Endometrial Cancer

Staging gynecological malignancies is a complex process, and radiologists should be familiar with the evolution of FIGO staging criteria. Large Language Models (LLMs) offer potential to support radiologists by automating classification tasks from free-text MRI reports. We conducted a retrospective study using two curated datasets of pelvic MRI reports from patients with cervical (n = 261, FIGO 2018) and endometrial cancer (n = 555, FIGO 2023). A general-purpose LLM (Cohere Command-A) was evaluated under three prompting strategies (zero-shot, guided, and chain-of-thought [CoT]), using exact stage accuracy, an ordinal FIGO distance metric, and the rate of severe errors. The Cohere Command-A model was chosen for its long-context reasoning, instruction-following capabilities, reproducible fixed version, and secure handling of sensitive clinical data. While alternative LLMs (eg, GPT-4o, Gemini, Llama-3, DeepSeek) could offer complementary insights, access, resources, and compliance constraints limited broader comparisons. For cervical cancer, CoT prompting achieved the highest accuracy (80.5%) and the lowest FIGO distance, with 23 severe misclassifications (≥2-stage deviation), outperforming guided and zero-shot prompting. For endometrial cancer, all strategies performed appropriately, with CoT again yielding the best results (accuracy, 90.6%) and the lowest number of severe misclassifications (37 cases), compared with guided and zero-shot prompting. In a small subset of cases with no agreement between any prompting strategy and the reference label, manual review showed that only a minority presented potentially suboptimal annotations, suggesting that CoT-based predictions may also help flag doubtful reports. The LLMs used demonstrated strong performance in automatically assigning FIGO stages for cervical and endometrial cancers from MRI reports. Their integration could reduce workload and improve consistency in staging. Further validation is needed before clinical implementation.

2Papers
2Collaborators