Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Generating Electronic Health Records with Multiple Data Types and\n Constraints
30
Zitationen
4
Autoren
2020
Jahr
Abstract
Sharing electronic health records (EHRs) on a large scale may lead to privacy\nintrusions. Recent research has shown that risks may be mitigated by simulating\nEHRs through generative adversarial network (GAN) frameworks. Yet the methods\ndeveloped to date are limited because they 1) focus on generating data of a\nsingle type (e.g., diagnosis codes), neglecting other data types (e.g.,\ndemographics, procedures or vital signs) and 2) do not represent constraints\nbetween features. In this paper, we introduce a method to simulate EHRs\ncomposed of multiple data types by 1) refining the GAN model, 2) accounting for\nfeature constraints, and 3) incorporating key utility measures for such\ngeneration tasks. Our analysis with over $770,000$ EHRs from Vanderbilt\nUniversity Medical Center demonstrates that the new model achieves higher\nperformance in terms of retaining basic statistics, cross-feature correlations,\nlatent structural properties, feature constraints and associated patterns from\nreal data, without sacrificing privacy.\n
Ähnliche Arbeiten
"Why Should I Trust You?"
2016 · 14.615 Zit.
Coding Algorithms for Defining Comorbidities in ICD-9-CM and ICD-10 Administrative Data
2005 · 10.529 Zit.
A Comprehensive Survey on Graph Neural Networks
2020 · 8.883 Zit.
Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead
2019 · 8.451 Zit.
High-performance medicine: the convergence of human and artificial intelligence
2018 · 7.948 Zit.