Evaluating generative AI models for explainable pathological feature extraction in lung adenocarcinoma: grading assessment and prognostic model construction

Bibliographic Details
Title: Evaluating generative AI models for explainable pathological feature extraction in lung adenocarcinoma: grading assessment and prognostic model construction
Authors: Junyi Shen, Anqi Lin, Ting Wei, Jian Zhang, Peng Luo
Source: The Lancet Regional Health. Western Pacific, Vol 55, Iss , Pp 101352- (2025)
Publisher Information: Elsevier, 2025.
Publication Year: 2025
Collection: LCC:Public aspects of medicine
Subject Terms: Public aspects of medicine, RA1-1270
More Details: Background: With the widespread application of generative AI (GenAI) models, it is crucial to systematically evaluate their performance in lung adenocarcinoma histopathological assessment. This study aimed to evaluate and compare the performance of three GenAI models with visual capabilities (GPT-4o, Claude-3.5-Sonnet, and Gemini-1.5-Pro) in lung adenocarcinoma histological pattern recognition and grading, and to explore the construction of prognostic prediction models based on GenAI feature extraction. Methods: This retrospective study extracted 310 diagnostic slides from the TCGA-LUAD database for model evaluation. An additional 87 diagnostic pathology slides from local lung adenocarcinoma surgical patients were used for external validation of the prognostic model. Primary outcomes were GenAI grading accuracy and stability, measured by the area under the receiver operating characteristic curve (AUC) and intraclass correlation coefficient (ICC), respectively. Secondary outcomes included the construction and assessment of machine learning-based prognostic prediction models, utilizing features extracted by GenAI, with model performance evaluated using the Concordance index (C-index). Findings: Claude-3.5-Sonnet demonstrated the best overall performance, combining high grading accuracy (average AUC = 0.82) with moderate stability (ICC = 0.59) The optimal machine learning-based prognostic model, constructed using features extracted by Claude-3.5-Sonnet and incorporating clinical variables, showed good performance in both internal and external validation, with an average C-index of 0.72. Meta-analysis demonstrated that this prognostic model effectively stratified patients into risk groups, with the high-risk group showing significantly worse outcomes (Hazard ratio = 6.44, 95% confidence interval = 3.42-12.14). Interpretation: This study demonstrates the potential application value of GenAI models in lung adenocarcinoma histopathological assessment. Claude-3.5-Sonnet demonstrated the highest grading accuracy, and the machine learning-based prognostic model that utilized its feature extraction showed good predictive capabilities. These findings provide new research directions for AI-assisted pathological diagnosis and prognostic prediction, with the potential to improve the management of lung adenocarcinoma patients.
Document Type: article
File Description: electronic resource
Language: English
ISSN: 2666-6065
Relation: http://www.sciencedirect.com/science/article/pii/S2666606524003468; https://doaj.org/toc/2666-6065
DOI: 10.1016/j.lanwpc.2024.101352
Access URL: https://doaj.org/article/5da10e828ef94e0981c0369b7be51586
Accession Number: edsdoj.5da10e828ef94e0981c0369b7be51586
Database: Directory of Open Access Journals
More Details
ISSN:26666065
DOI:10.1016/j.lanwpc.2024.101352
Published in:The Lancet Regional Health. Western Pacific
Language:English