Development and external validation of a machine learning-based prediction model for the cancer-related fatigue diagnostic screening in adult cancer patients: a cross-sectional study in China.

Bibliographic Details
Title: Development and external validation of a machine learning-based prediction model for the cancer-related fatigue diagnostic screening in adult cancer patients: a cross-sectional study in China.
Authors: Du, Lin, Du, Junjia, Yang, Min, Xu, Qinqin, Huang, Jiannan, Tan, Wenxi, Xu, Tianxin, Wang, Lisheng, Nie, Wenbo, Zhao, Lijing
Source: Supportive Care in Cancer; Feb2023, Vol. 31 Issue 2, p1-11, 11p
Abstract: Purpose: Cancer-related fatigue (CRF) is the most common symptom in cancer patients and may interfere with patients’ daily activities and decrease survival rate. However, the etiology of CRF has not been identified. Diagnosing CRF is challenging. Thus, our study aimed to develop a CRF prediction model in cancer patients, using data that healthcare professionals routinely obtained from electronic health records (EHRs) based on the 3P model and externally validate this model in an independent dataset collected from another hospital. Methods: Between April 2022 and September 2022, a cross-sectional study was conducted on adult cancer patients at two first-class tertiary hospitals in China. Data that healthcare professionals routinely obtained from electronic health records (EHRs) based on the 3P model were collected. The outcome measure was according to ICD-10 diagnostic criteria for CRF. Data from one hospital (n = 305) were used for model development and internal validation. An independent data set from another hospital (n = 260) was utilized for external validation. logistic regression, random forest (RF), Naive Bayes (NB), and extreme gradient boosting (XGBoost) were constructed and compared. The model performance was evaluated in terms of both discrimination and calibration. Results: The prevalence of CRF in the two centers was 57.9% and 56.1%, respectively. The Random Forest model achieved the highest AUC of 0.86 among the four types of classifiers in the internal validation. The AUC of RF and NB were above 0.7 in the external validation, suggesting that the models also have an acceptable generalization ability. Conclusions: The incidence of CRF remains high and deserves more attention. The fatigue prediction model based on the 3P theory can accurately predict the risk of CRF. Nonlinear algorithms such as Random Forest and Naive Bayes are more suitable for diagnosing and evaluating symptoms. [ABSTRACT FROM AUTHOR]
Copyright of Supportive Care in Cancer is the property of Springer Nature and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Database: Complementary Index
More Details
ISSN:09414355
DOI:10.1007/s00520-022-07570-w
Published in:Supportive Care in Cancer
Language:English