Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Explainability enhanced liver disease diagnosis technique using tree selection and stacking ensemble-based random forest model
33
Zitationen
5
Autoren
2025
Jahr
Abstract
Liver disease (LD) significantly impacts global health, requiring accurate diagnostic methods. This study aims to develop an automated system for LD prediction using machine learning (ML) and explainable artificial intelligence (XAI), enhancing diagnostic precision and interpretability. This research systematically analyzes two distinct datasets encompassing liver health indicators. A combination of preprocessing techniques, including feature optimization methods such as Forward Feature Selection (FFS), Backward Feature Selection (BFS), and Recursive Feature Elimination (RFE), is applied to enhance data quality. After that, ML models, namely Support Vector Machines (SVM), Naive Bayes (NB), Random Forest (RF), K-nearest neighbors (KNN), Decision Trees (DT), and a novel Tree Selection and Stacking Ensemble-based RF (TSRF), are assessed in the dataset to diagnose LD. Finally, the ultimate model is selected based on incorporating cross-validation and evaluation through performance metrics like accuracy, precision, specificity, etc., and efficient XAI methods express the ultimate model's interoperability. The analysis reveals TSRF as the most effective model, achieving a peak accuracy of 99.92 % on Dataset-1 without feature optimization and 88.88 % on Dataset-2 with RFE optimization. XAI techniques, including SHAP and LIME plots, highlight key features influencing model predictions, providing insights into the reasoning behind classification outcomes. The findings highlight TSRF's potential in improving LD diagnosis, using XAI to enhance transparency and trust in ML models. Despite high accuracy and interpretability, limitations such as dataset bias and lack of clinical validation remain. Future work focuses on integrating advanced XAI, diversifying datasets, and applying the approach in clinical settings for reliable diagnostics. • Performance comparison of different ML models for the prediction of LD using multiple datasets. • Analysis of the effect of different feature optimization techniques for ML-based LD diagnosis. • Developing a novel hybrid ML model namely TSRF for diagnosis of LD. • Exploring the reasoning behind the model's decision through XAI.
Ähnliche Arbeiten
Biostatistical Analysis
1996 · 35.450 Zit.
UCI Machine Learning Repository
2007 · 24.319 Zit.
An introduction to ROC analysis
2005 · 20.964 Zit.
Prediction of Coronary Heart Disease Using Risk Factor Categories
1998 · 9.604 Zit.
The use of the area under the ROC curve in the evaluation of machine learning algorithms
1997 · 7.185 Zit.