Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Metrics That Matter: A Practical Survey on Synthetic Data Evaluation
0
Zitationen
7
Autoren
2026
Jahr
Abstract
Assessing the quality of synthetic data (SD) is vital to determine whether it can provide a viable alternative to real data. A wide variety of metrics exist to examine the three archetypal dimensions of SD evaluation: realism (fidelity), task-specific usefulness (utility), and remaining disclosure risk (privacy). Current work in SD generation often relies on the ad-hoc selection of evaluation metrics without a clear justification, while the suitability of metrics strongly depend on the dataset and other contextual factors. This paper surveys the field of SD evaluation, provides guidance regarding metric selection based on four key questions pertaining to the task, goal, data type, and domain of SD, and provides general practical recommendations on SD evaluation. Finally, experiments on an illustrative dataset of electronic health records show how researchers can bring our insights and recommendations for SD evaluation into practice. By doing so, we aim to support researchers and practitioners seeking to generate and evaluate SD.
Ähnliche Arbeiten
The REDCap consortium: Building an international community of software platform partners
2019 · 22.622 Zit.
The FAIR Guiding Principles for scientific data management and stewardship
2016 · 16.786 Zit.
Bayesian Data Analysis
1995 · 13.688 Zit.
k-ANONYMITY: A MODEL FOR PROTECTING PRIVACY
2002 · 8.389 Zit.
Business Intelligence and Analytics: From Big Data to Big Impact
2012 · 5.887 Zit.