Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
An Ensemble Hard Voting Model for Cardiovascular Disease Prediction
26
Zitationen
2
Autoren
2020
Jahr
Abstract
With the evolution of trending technologies, health informatics has played a vital role in making our day-to-day lives more comfortable. The availability of enough medical data and computational tools has made medical informatics possible to take a long step towards the next level of Healthcare Industry 4.0. Information engineering or emerging technologies can be applied to identify chronic diseases like heart failure to lessen the mortality rate. Machine Learning (ML) based approaches are gaining popularity for predicting these diseases in the 4 <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">th</sup> generation healthcare industry. In this paper, several risk factors, e.g., age, sex, total cholesterol level, number of cigarettes smoked per day, glucose level, and systolic blood pressure, have been considered input features for causing heart disease next ten years. The Hard Voting (HV) classifier has been formed with Logistic Regression (LogReg), Random Forest (RF), Multilayer Perceptron (MLP), and Gaussian Naïve Bayes (GNB) classifiers. RobustScaler was applied to scale the input attributes' values, and the dataset was balanced using Random Undersampling. The HV classifier is the satisfactory performance provider with 88.42% test accuracy along with precision, recall, F1, and Area Under Curve (AUC) scores of 1, 0.043, 0.082, and 0.73 correspondingly. The results have also been compared using some other parameters, e.g., the Receiver Operating Characteristics (ROC) curves, learning curves, precision-recall curve, confusion matrix, Logarithmic Loss (Log Loss), Brier Score Loss (BSL), Mathews Correlation Coefficient (MCC), Mean Absolute Error (MAE), and Mean Squared Error (MSE) to bolster the claim.
Ähnliche Arbeiten
Biostatistical Analysis
1996 · 35.449 Zit.
UCI Machine Learning Repository
2007 · 24.319 Zit.
An introduction to ROC analysis
2005 · 20.940 Zit.
Prediction of Coronary Heart Disease Using Risk Factor Categories
1998 · 9.604 Zit.
The use of the area under the ROC curve in the evaluation of machine learning algorithms
1997 · 7.181 Zit.