Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Coronary Heart Disease Prediction: A Comparative Study of Machine Learning Algorithms
31
Zitationen
5
Autoren
2024
Jahr
Abstract
Efforts to enhance the precision of heart disease detection methods are crucial in reducing the expensive healthcare expenses associated with the diagnostic processes.Extracting patterns from medical data can unlock associations to improve heart disease diagnosis techniques.This study aims to construct an efficient machine learning model to act as a reliable component of the medical decision support system.Seven different machine learning models were investigated including Logistic Regression, Support Vector Classifier, K-Nearest Neighbor (KNN), Random Forest, Decision Tree, Naï ve Bayes, and Gradient Boosting Classifier, which are comprehensively explored for heart disease classification.Hyperparameter optimization for these algorithms involves three techniques: Grid Search, Random Search, and Bayes Search.The assessment of each model's performance incorporates measuring specificity, sensitivity, and F1-scores, leveraging the dataset with 12 attributes and 1189 observations from three medical clinics (Cleveland, Statlog, Hungary).Feature selection methods, including the wrapper method, embedded method Chi-Sqaured, and variance analysis, are deployed to identify highly correlated features, ultimately reducing the data's dimensionality to 7 features.The evaluation process employs 10-fold crossvalidation, demonstrating that the Random Forest Model achieves the highest average accuracy at 92.85%, surpassing the previously reported 86.9%.Additionally, 10-fold crossvalidation ensures the models' reliability and resilience to data imbalance.Ensemble-based methods reaffirm the Random Forest's superior performance in diagnosing heart diseases, boasting an accuracy of 94.96%.In sum, this developed model exhibits reliability in heart disease classification and presents a promising solution for medical applications, to effectively mitigate diagnostic costs and time constraints.
Ähnliche Arbeiten
Biostatistical Analysis
1996 · 35.450 Zit.
UCI Machine Learning Repository
2007 · 24.319 Zit.
An introduction to ROC analysis
2005 · 20.968 Zit.
Prediction of Coronary Heart Disease Using Risk Factor Categories
1998 · 9.604 Zit.
The use of the area under the ROC curve in the evaluation of machine learning algorithms
1997 · 7.186 Zit.