OpenAlex · Aktualisierung stündlich · Letzte Aktualisierung: 14.03.2026, 00:48

Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.

Unsupervised Characterization of Temporal Dataset Shifts as an Early Indicator of AI Performance Variations: Evaluation Study Using the Medical Information Mart for Intensive Care-IV Dataset (Preprint)

2025·0 ZitationenOpen Access
Volltext beim Verlag öffnen

0

Zitationen

5

Autoren

2025

Jahr

Abstract

<sec> <title>BACKGROUND</title> Reusing long-term data from electronic health records is essential for training reliable and effective health artificial intelligence (AI). However, intrinsic changes in health data distributions over time—known as dataset shifts, which include concept, covariate, and prior shifts—can compromise model performance, leading to model obsolescence and inaccurate decisions. </sec> <sec> <title>OBJECTIVE</title> In this study, we investigate whether unsupervised, model-agnostic characterization of temporal dataset shifts using data distribution analyses through Information Geometric Temporal (IGT) projections is an early indicator of potential AI performance variations before model development. </sec> <sec> <title>METHODS</title> Using the real-world Medical Information Mart for Intensive Care-IV (MIMIC-IV) electronic health record database, encompassing data from over 40,000 patients from 2008 to 2019, we characterized its inherent dataset shift patterns through an unsupervised approach using IGT projections and data temporal heatmaps. We trained and evaluated annually a set of random forests and gradient boosting models to predict in-hospital mortality. To assess the impact of shifts on model performance, we checked the association between the temporal clusters found in both IGT projections and the intertime embedding of model performances using the Fisher exact test. </sec> <sec> <title>RESULTS</title> Our results demonstrate a significant relationship between the unsupervised temporal shift patterns, specifically covariate and concept shifts, identified using the IGT projection method and the performance of the random forest and gradient boosting models (&lt;i&gt;P&lt;/i&gt;&amp;lt;.05). We identified 2 primary temporal clusters that correspond to the periods before and after &lt;i&gt;ICD-10&lt;/i&gt; (&lt;i&gt;International Statistical Classification of Diseases, Tenth Revision&lt;/i&gt;) implementation. The transition from &lt;i&gt;ICD-9&lt;/i&gt; (&lt;i&gt;International Classification of Diseases, Ninth Revision&lt;/i&gt;) to &lt;i&gt;ICD-10&lt;/i&gt; was a major source of dataset shift, associated with a performance degradation. </sec> <sec> <title>CONCLUSIONS</title> Unsupervised, model-agnostic characterization of temporal shifts via IGT projections can serve as a proactive monitoring tool to anticipate performance shifts in clinical AI models. By incorporating early shift detection into the development pipeline, we can enhance decision-making during the training and maintenance of these models. This approach paves the way for more robust, trustworthy, and self-adapting AI systems in health care. </sec>

Ähnliche Arbeiten

Autoren

Themen

Machine Learning in HealthcareElectronic Health Records SystemsArtificial Intelligence in Healthcare and Education
Volltext beim Verlag öffnen