Investigator

Christopher Jackson

Pennsylvania State University

CJChristopher Jacks…
Papers(1)
HiCervix: An Extensiv…
Collaborators(10)
De CaiJie ChenJunhan ZhaoJunzhou HuangKanran WangMin FengSen YangWei YuanXiyue WangYuan Xue
Institutions(8)
Pennsylvania State Un…Sichuan UniversityThe Fourth Affiliated…University of ChicagoThe University Of Tex…Chongqing Cancer Hosp…Ant GroupThe Ohio State Univer…

Papers

HiCervix: An Extensive Hierarchical Dataset and Benchmark for Cervical Cytology Classification

Cervical cytology is a critical screening strategy for early detection of pre-cancerous and cancerous cervical lesions. The challenge lies in accurately classifying various cervical cytology cell types. Existing automated cervical cytology methods are primarily trained on databases covering a narrow range of coarse-grained cell types, which fail to provide a comprehensive and detailed performance analysis that accurately represents real-world cytopathology conditions. To overcome these limitations, we introduce HiCervix, the most extensive, multi-center cervical cytology dataset currently available to the public. HiCervix includes 40,229 cervical cells from 4,496 whole slide images, categorized into 29 annotated classes. These classes are organized within a three-level hierarchical tree to capture fine-grained subtype information. To exploit the semantic correlation inherent in this hierarchical tree, we propose HierSwin, a hierarchical vision transformer-based classification network. HierSwin serves as a benchmark for detailed feature learning in both coarse-level and fine-level cervical cancer classification tasks. In our comprehensive experiments, HierSwin demonstrated remarkable performance, achieving 92.08% accuracy for coarse-level classification and 82.93% accuracy averaged across all three levels. When compared to board-certified cytopathologists, HierSwin achieved high classification performance (0.8293 versus 0.7359 averaged accuracy), highlighting its potential for clinical applications. This newly released HiCervix dataset, along with our benchmark HierSwin method, is poised to make a substantial impact on the advancement of deep learning algorithms for rapid cervical cancer screening and greatly improve cancer prevention and patient outcomes in real-world clinical settings.

1Papers
10Collaborators