Toward reliable diabetes prediction: Innovations in data engineering and machine learning applications

Bibliographic Details
Title: Toward reliable diabetes prediction: Innovations in data engineering and machine learning applications
Authors: Md. Alamin Talukder, Md. Manowarul Islam, Md Ashraf Uddin, Mohsin Kazi, Majdi Khalid, Arnisha Akhter, Mohammad Ali Moni
Source: Digital Health, Vol 10 (2024)
Publisher Information: SAGE Publishing, 2024.
Publication Year: 2024
Collection: LCC:Computer applications to medicine. Medical informatics
Subject Terms: Computer applications to medicine. Medical informatics, R858-859.7
More Details: Objective Diabetes is a metabolic disorder that causes the risk of stroke, heart disease, kidney failure, and other long-term complications because diabetes generates excess sugar in the blood. Machine learning (ML) models can aid in diagnosing diabetes at the primary stage. So, we need an efficient ML model to diagnose diabetes accurately. Methods In this paper, an effective data preprocessing pipeline has been implemented to process the data and random oversampling to balance the data, handling the imbalance distributions of the observational data more sophisticatedly. We used four different diabetes datasets to conduct our experiments. Several ML algorithms were used to determine the best models to predict diabetes faultlessly. Results The performance analysis demonstrates that among all ML algorithms, random forest surpasses the current works with an accuracy rate of 86% and 98.48% for Dataset 1 and Dataset 2; extreme gradient boosting and decision tree surpass with an accuracy rate of 99.27% and 100% for Dataset 3 and Dataset 4, respectively. Our proposal can increase accuracy by 12.15% compared to the model without preprocessing. Conclusions This excellent research finding indicates that the proposed models might be employed to produce more accurate diabetes predictions to supplement current preventative interventions to reduce the incidence of diabetes and its associated costs.
Document Type: article
File Description: electronic resource
Language: English
ISSN: 2055-2076
20552076
Relation: https://doaj.org/toc/2055-2076
DOI: 10.1177/20552076241271867
Access URL: https://doaj.org/article/9ba2419f5e0e44d5b127c778f8894fa8
Accession Number: edsdoj.9ba2419f5e0e44d5b127c778f8894fa8
Database: Directory of Open Access Journals
More Details
ISSN:20552076
DOI:10.1177/20552076241271867
Published in:Digital Health
Language:English