Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Data Curation Challenges for Artificial Intelligence
4
Zitationen
5
Autoren
2021
Jahr
Abstract
Deep learning algorithms have brought on a paradigm shift to automated medical image analysis approaches including segmentation. While state-of-the-art models can achieve near human-like performance on many tasks, these same algorithms can be remarkably brittle and lack the ability to generalize across datasets and institutions. A key component to training robust algorithms and evaluating generalizability is the curation of large quantities of heterogeneous data from diverse sources. In this chapter, the key challenges of data curation are discussed, with focus on the complexity of medical data, patient privacy protection, data quality issues, and data annotation. Solutions to the aforementioned challenges are also detailed. Methods for protecting patient privacy include automated anonymization and distributed deep learning techniques. Algorithms can be utilized to detect and correct for data quality issues. Natural language processing, crowdsourcing, and weakly supervised learning can be implemented to decrease annotation burden. Lastly, machine learning competitions can be an effective framework for constructing large, high-quality, multi-institutional datasets.
Ähnliche Arbeiten
A survey on deep learning in medical image analysis
2017 · 13.500 Zit.
Dermatologist-level classification of skin cancer with deep neural networks
2017 · 13.129 Zit.
A survey on Image Data Augmentation for Deep Learning
2019 · 11.731 Zit.
QuPath: Open source software for digital pathology image analysis
2017 · 8.101 Zit.
Radiomics: Images Are More than Pictures, They Are Data
2015 · 7.981 Zit.