Predicting total healthcare demand using machine learning: separate and combined analysis of predisposing, enabling, and need factors

Bibliographic Details
Title: Predicting total healthcare demand using machine learning: separate and combined analysis of predisposing, enabling, and need factors
Authors: Fatih Orhan, Mehmet Nurullah Kurutkan
Source: BMC Health Services Research, Vol 25, Iss 1, Pp 1-27 (2025)
Publisher Information: BMC, 2025.
Publication Year: 2025
Collection: LCC:Public aspects of medicine
Subject Terms: Healthcare demand, Andersen behavioral model, Machine learning, Feature selection, Health services utilization, Predictive modeling, Public aspects of medicine, RA1-1270
More Details: Abstract Objective Predicting healthcare demand is essential for effective resource allocation and planning. This study applies Andersen’s Behavioral Model of Health Services Use, focusing on predisposing, enabling, and need factors, using data from the 2022 Turkey Health Survey by TUIK. Machine learning methods provide a powerful approach to analyze these factors and their combined impact on healthcare utilization, offering valuable insights for health policy. Methods Seven different machine learning models—Decision Tree, Random Forest, Support Vector Machine (SVM), K-Nearest Neighbors (KNN), Logistic Regression, XGBoost, and Gradient Boosting—were utilized. Feature selection was conducted to identify the most significant factors influencing healthcare demand. The models were evaluated for accuracy and generalization ability using performance metrics such as recall, precision, F1 score, and ROC AUC. Results The study identified key features affecting healthcare demand. For predisposing factors, gender, educational level, and age group were significant. Enabling factors included treatment costs, community interest, and payment difficulties. Need factors were influenced by smoking status, chronic diseases, and overall health status. The models demonstrated high recall (approximately 0.90) and strong F1 scores (ranging from 0.87 to 0.88), indicating a balanced performance between precision and recall. Among the models, Gradient Boosting, XGBoost, and Logistic Regression consistently outperformed others, achieving the highest predictive accuracy. Random Forest and SVM also performed well, showing robust classification capability. Conclusions The findings highlight the effectiveness of machine learning methods in predicting healthcare demand, providing valuable insights for health policy and resource allocation. Gradient Boosting, XGBoost, and Logistic Regression emerged as the most reliable models, demonstrating superior generalization and classification performance. Understanding the separate and combined effects of predisposing, enabling, and need factors on healthcare demand can contribute to more efficient and data-driven healthcare planning, facilitating strategic decision-making in resource allocation and service delivery.
Document Type: article
File Description: electronic resource
Language: English
ISSN: 1472-6963
Relation: https://doaj.org/toc/1472-6963
DOI: 10.1186/s12913-025-12502-5
Access URL: https://doaj.org/article/2091f26dfcc4403baae032387a433900
Accession Number: edsdoj.2091f26dfcc4403baae032387a433900
Database: Directory of Open Access Journals
Full text is not displayed to guests.
More Details
ISSN:14726963
DOI:10.1186/s12913-025-12502-5
Published in:BMC Health Services Research
Language:English