Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
AI-Enabled High-Performance Data Analytics Framework: Using Adaboost Regression For Healthcare&Automobile Domains
0
Zitationen
1
Autoren
2023
Jahr
Abstract
The exponential growth of data in recent years has necessitated a paradigm shift from traditional data processing methods to high-performance data engineering solutions. This study presents an integrated framework that combines artificial intelligence with modern data engineering tools to deal with the difficulties associated with large-scale data analytics, especially in healthcare domains. Using the MIMIC-IV medical dataset with machine learning algorithms including Apache Spark, Delta Lake, and Ada boost Regression, we demonstrate an AI-driven method that automates data cleaning, schema matching, and workflow optimization. The framework addresses critical limitations in conventional ETL processes that struggle with the volume and velocity of healthcare data from electronic health records, sensors, and patient monitors. By eliminating costly data duplication and integrating analytics directly into relevant data warehouses, this approach significantly improves predictive healthcare capabilities, hospital resource management, and real-time diagnostics, while reducing the 70% of project time traditionally spent on data preparation tasks. Keywords: High Performance Data Analytics (HPDA), Artificial Intelligence, Health Data Engineering, Machine Learning, Apache Spark, Ada boost Regression, ETL (Extract, Transform, and Load)
Ähnliche Arbeiten
Biostatistical Analysis
1996 · 35.445 Zit.
UCI Machine Learning Repository
2007 · 24.290 Zit.
An introduction to ROC analysis
2005 · 20.602 Zit.
The use of the area under the ROC curve in the evaluation of machine learning algorithms
1997 · 7.103 Zit.
A method of comparing the areas under receiver operating characteristic curves derived from the same cases.
1983 · 7.061 Zit.