Investigator

Andrey Bychkov

Director of Digital Pathology · Kameda Medical Center, Pathology

Research Interests

ABAndrey Bychkov
Papers(3)
Commercially Availabl…Evaluation of general…Basaloid Squamous Cel…
Collaborators(7)
A.V. AsaturovaJijgee MunkhdelgerKris LamiSadakatsu IkedaSompon ApornviratThiyaphat Laohawetwan…Yosep Chong
Institutions(6)
Kameda Medical CenterNational Medical Rese…Nagasaki UniversityTokyo Medical and Den…Thammasat UniversityUijeongbu Saint Mary'…

Papers

Evaluation of general-purpose large language models as diagnostic support tools in cervical cytology

The application of general-purpose large language models (LLMs) in cytopathology remains largely unexplored. This study aims to evaluate the accuracy and consistency of a custom version of ChatGPT-4 (GPT), ChatGPT o3, and Gemini 2.5 Pro as diagnostic support tools for cervical cytology. A total of 200 Papanicolaou-stained cervical cytology images were acquired at 40x magnification, each measuring 384 × 384 pixels. These images consisted of 100 cases classified as negative for intraepithelial lesion or malignancy (NILM) and 100 cases across various abnormal categories: 20 low-grade squamous intraepithelial lesion (LSIL), 20 high-grade squamous intraepithelial lesion (HSIL), 20 squamous cell carcinoma (SCC), 20 adenocarcinoma in situ (AIS), and 20 adenocarcinoma (ADC). Diagnostic accuracy and consistency were evaluated by submitting each image to a GPT, ChatGPT o3, and Gemini 2.5 Pro 5-10 times. When distinguishing normal from abnormal cytology, LLMs showed mean sensitivity between 85.4 % and 100 %, and specificity between 67.2 % and 92.7 %. ChatGPT o3 was more accurate in identifying NILM (mean 89.2 % vs. 67.2 %) but less accurate in detecting LSIL (34 % vs. 85 %), HSIL (6 % vs. 63 %), and ADC (28 % vs. 91 %). Chain-of-thought prompting and submitting multiple images of the same diagnosis to ChatGPT o3 and Gemini 2.5 Pro did not significantly improve accuracy. Both models also performed poorly in identifying cervicovaginal infections. ChatGPT o3 and Gemini 2.5 Pro demonstrated complementary strengths in cervical cytology. Due to their low accuracy and inconsistency in abnormal cytology, general-purpose LLMs are not recommended as diagnostic support tools in cervical cytology.

136Works
3Papers
7Collaborators
Thyroid NeoplasmsDiagnosis, DifferentialLung NeoplasmsCytodiagnosisBiomarkers, TumorCarcinoma, PapillaryThyroid Cancer, Papillary

Positions

2018–

Director of Digital Pathology

Kameda Medical Center · Pathology

2014–

Researcher

Chulalongkorn University Faculty of Medicine · Pathology

2003–

Assistant Professor

Smolensk State Medical Academy · Pathology

Education

2013

Ph.D.

Nagasaki Daigaku

2002

M.D.

Smolensk State Medical Academy

Country

JP

Keywords
thyroidthyroid cancerpathology