Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
A Machine Learning Approach for Predicting Patient's Length of Hospital Stay with Random Forest Regression
15
Zitationen
2
Autoren
2022
Jahr
Abstract
Predicting the length of stay (LOS) of patients can be incredibly useful for an efficient hospital resource management. Numerous hospitals worldwide are confronted with limited resources to accommodate hospitalized patients. By assuring optimized resource utilization, LOS forecasting can benefit all the stakeholders with improved treatment planning and expenditure estimation. However, traditional hospital management systems fail to estimate the duration of patients' stay at an early stage, resulting in various negative consequences. Therefore, this study proposes a machine learning approach for analyzing patient data and building a reliable prediction model incorporating Random Forest Regression Model to estimate the length of a patient's stay in the hospital. The data utilized in this study is a hospital discharge dataset containing records with various types of patient information. Two types of feature selection methods (PCA and Chi-square) and interquartile range based outlier elimination strategy have also been employed in the model for efficient prediction. To validate the performance of the proposed method including the impact of feature prioritization and outlier elimination, the dataset has been applied to ten different regression models as well as deep learning techniques after requisite data pre-processing. Furthermore, a comparison of multiple prediction models was conducted using various performance metrics in a variety of cases to determine the best performing regression model; in which the Random Forest Regression model outoerformed other models.
Ähnliche Arbeiten
Biostatistical Analysis
1996 · 35.445 Zit.
UCI Machine Learning Repository
2007 · 24.290 Zit.
An introduction to ROC analysis
2005 · 20.594 Zit.
The use of the area under the ROC curve in the evaluation of machine learning algorithms
1997 · 7.100 Zit.
A method of comparing the areas under receiver operating characteristic curves derived from the same cases.
1983 · 7.061 Zit.