Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Development and Evaluation of Machine Learning Models for Diabetes Risk Prediction Using BRFSS 2023 Data
0
Zitationen
2
Autoren
2025
Jahr
Abstract
Diabetes remains a major global health burden, with its rising prevalence demanding scalable approaches to early risk detection. Traditional clinical screening is resource-intensive, whereas survey-based modelling offers a complementary tool for population-level surveillance. In this study, we develop and evaluate machine learning classifiers on the 2023 U.S. Behavioural Risk Factor Surveillance System (BRFSS) to predict self-reported diabetes and prediabetes. Following a comprehensive computational comparison, three models—elastic net logistic regression, gradient boosting, and XGBoost—were selected for hyperparameter optimisation, probability calibration, and recall-oriented threshold tuning. Given that diabetes risk prediction prioritises minimising false negatives (recall) over false positives (precision), we emphasised PR-AUC as the primary metric, with AUROC as a secondary measure. XGBoost and gradient boosting achieved the strongest overall performance (PR-AUC ≈ 0.48, AUROC ≈ 0.83), and emerged as reliable models, balancing discrimination, calibration, and sensitivity for practical screening applications while elastic net maximised sensitivity at the cost of precision.
Ähnliche Arbeiten
Biostatistical Analysis
1996 · 35.450 Zit.
UCI Machine Learning Repository
2007 · 24.319 Zit.
An introduction to ROC analysis
2005 · 21.017 Zit.
Prediction of Coronary Heart Disease Using Risk Factor Categories
1998 · 9.605 Zit.
The use of the area under the ROC curve in the evaluation of machine learning algorithms
1997 · 7.193 Zit.