Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.

Directly Modeling Missing Data in Sequences with RNNs: Improved Classification of Clinical Time Series

2016·39 Zitationen·arXiv (Cornell University)Open Access

Volltext beim Verlag öffnen

Zitationen

Autoren

2016

Jahr

Abstract

We demonstrate a simple strategy to cope with missing data in sequential inputs, addressing the task of multilabel classification of diagnoses given clinical time series. Collected from the intensive care unit (ICU) of a major urban medical center, our data consists of multivariate time series of observations. The data is irregularly sampled, leading to missingness patterns in re-sampled sequences. In this work, we show the remarkable ability of RNNs to make effective use of binary indicators to directly model missing data, improving AUC and F1 significantly. However, while RNNs can learn arbitrary functions of the missing data and observations, linear models can only learn substitution values. For linear models and MLPs, we show an alternative strategy to capture this signal. Additionally, we evaluate LSTMs, MLPs, and linear models trained on missingness patterns only, showing that for several diseases, what tests are run can be more predictive than the results themselves.

Autoren

Themen

Machine Learning in HealthcareTime Series Analysis and ForecastingArtificial Intelligence in Healthcare

Volltext beim Verlag öffnen

Directly Modeling Missing Data in Sequences with RNNs: Improved Classification of Clinical Time Series

Abstract

Ähnliche Arbeiten

Autoren

Themen