Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Abstract TH847: Explainable Machine Learning–Identified Protective Social, Behavioral, and Clinical Factors for Cardiovascular Diseases
0
Zitationen
24
Autoren
2026
Jahr
Abstract
Background: Cardiovascular diseases (CVDs) remain the leading cause of death globally, yet traditional risk tools underweight protective social and behavioral contexts that shape cardiovascular resilience. Explainable machine learning (ML) offers a path to quantify such protective factors while maintaining transparency for population health deployment. Hypothesis: We hypothesize that explainable ML applied to a nationally representative survey can accurately distinguish adults without CVDs and identify a reproducible profile of protective social, behavioral, and clinical factors associated with lower disease prevalence. Methods: We analyzed adults from the 2021 Behavioral Risk Factor Surveillance System (n=116,608); 11 candidate predictors spanned demographics/socioeconomics (age, sex, race/ethnicity, income, insurance), behaviors (smoking, alcohol use, fruit/vegetable intake), and clinical/mental health (diabetes, depressive disorder). Baselines included logistic regression, random forests, support vector machines, and XGBoost that underwent Optuna hyperparameter tuning with nested cross-validation. Discrimination (area under the receiver-operating characteristic curve (AUROC), area under the precision-recall curve (AUPRC)), accuracy, and predictive values, were reported; interpretability used SHAP to quantify direction and magnitude of feature effects. Results: CVD prevalence rose with age (1.3% at 18–24 years to 24.6% at ≥65 years) and was higher in men than women (17.2% vs 12.2%); prevalence was greater in Black non-Hispanic than Hispanic adults (16.7% vs 9.6%) and in lower- versus higher-income groups (23.6% for <$15,000 vs 7.5% for ≥$200,000). On held-out testing, the best XGBoost model achieved AUROC of 0.76 and AUPRC of 0.95, with high negative predictive value (90.49%) and substantially low positive predictive value (32.36%); cross-validation confirmed robustness (mean AUROC=0.76, high AUPRC=0.95) and competitive overall accuracy of 76.98±0.45 across five folds. SHAP analyses consistently highlighted protective profiles characterized by younger age, higher income, health insurance coverage, and absence of diabetes and depressive disorder; dietary variables contributed modestly. Conclusions: Our explainable ML model identified protective profiles against CVDs, while achieving reliable performance. These results support prevention strategies that improve access to care, reduce metabolic risk, and motivate prospective validation.
Ähnliche Arbeiten
Effect of potentially modifiable risk factors associated with myocardial infarction in 52 countries (the INTERHEART study): case-control study
2004 · 11.894 Zit.
Global Burden of Cardiovascular Diseases and Risk Factors, 1990–2019
2020 · 10.742 Zit.
Global and regional diabetes prevalence estimates for 2019 and projections for 2030 and 2045: Results from the International Diabetes Federation Diabetes Atlas, 9th edition
2019 · 10.333 Zit.
IDF Diabetes Atlas: Global, regional and country-level diabetes prevalence estimates for 2021 and projections for 2045
2021 · 9.447 Zit.
Heart Disease and Stroke Statistics—2020 Update: A Report From the American Heart Association
2020 · 9.301 Zit.
Autoren
Institutionen
- Taipei Medical University(TW)
- Chung Shan Medical University(TW)
- North Carolina Agricultural and Technical State University(US)
- Harvard University(US)
- The University of Texas Health Science Center at Houston(US)
- Huntington University(US)
- Trinity Health(US)
- Methodist Hospital(US)
- University of Houston - Downtown(US)
- Tân Tạo University(VN)
- Christ Hospital(US)
- Carnegie Mellon University(US)
- Washington University in St. Louis(US)
- University of Massachusetts Lowell(US)