Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Healthcare Data Warehouse Implementation with Machine Learning Regression Analysis: A Comparative Study of Random Forest, AdaBoost, and XGBoost for Regulatory Compliance Prediction
0
Zitationen
1
Autoren
2024
Jahr
Abstract
Healthcare organizations are increasingly relying on data warehouses to centralize and manage electronic health record (EHR) data for operational, clinical, and research purposes. These repositories integrate patient care information, administrative records, and financial data, while maintaining strict compliance with regulatory requirements and protecting health information privacy. Modern data warehouses serve as decision support systems, facilitating business analytics, quality improvement initiatives, and strategic planning across healthcare organizations. The implementation of research data warehouses (RDWs) has made it possible to effectively reuse EHR data for scientific investigations supported by specialized IT infrastructure and governance structures. Advanced machine learning techniques, including Random Forest, Ada boost, and XG Boost regression algorithms, are used to analyse complex healthcare datasets and extract predictive insights. These ensemble methods improve accuracy while reducing over fitting risks in predictive modelling applications. The evolution towards cloud-based repositories requires comprehensive data governance strategies that include security, integrity, and regulatory compliance to ensure consistent health analytics capabilities. Keywords: Data warehousing, Electronic health records (EHR), Health analytics, and Research data warehousing (RDW), Machine learning regression, Data governance, Regulatory compliance, Group methods, Clinical decision support and Predictive modelling
Ähnliche Arbeiten
Biostatistical Analysis
1996 · 35.445 Zit.
UCI Machine Learning Repository
2007 · 24.290 Zit.
An introduction to ROC analysis
2005 · 20.610 Zit.
The use of the area under the ROC curve in the evaluation of machine learning algorithms
1997 · 7.105 Zit.
A method of comparing the areas under receiver operating characteristic curves derived from the same cases.
1983 · 7.062 Zit.