Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Differentially-Private Federated Learning with Non-IID Data for Surgical Risk Prediction
3
Zitationen
7
Autoren
2024
Jahr
Abstract
Federated learning (FL) has emerged as a promising solution to deal with the privacy concerns which often limit access to data for training machine learning (ML) models. It encompasses the exchange of model parameters between data owners and a central model aggregation instance without the need to share sensitive data directly. Although FL does not require sharing sensitive information, it may not guarantee complete privacy, as model parameters can potentially reveal sensitive details about the underlying training data. To enhance privacy, FL approaches can additionally integrate differential privacy (DP), which entails the careful addition of noise to the model. In this paper, we investigate the relationship between FL, DP, and highly non-independent and identically distributed (non-IID) data. We present a specific real-life healthcare application of predicting patient mortality and revision surgery after visceral operations. Our contributions encompass assessing the performance of FL with and without DP in the presence of highly non-IID data, analysing the fairness for individual training participants and investigating the efficacy of personalisation to alleviate the privacy-utility trade-off introduced by DP. Our findings indicate that the addition of DP into FL given non-IID datasets disparately favours participants with larger datasets, while the model has limited utility for those with less data. The addition of a subsequent model fine-tuning step can remedy this disparity and enables all participants to reach a model performance comparable to the centralised training scenario.
Ähnliche Arbeiten
k-ANONYMITY: A MODEL FOR PROTECTING PRIVACY
2002 · 8.395 Zit.
Calibrating Noise to Sensitivity in Private Data Analysis
2006 · 6.872 Zit.
Deep Learning with Differential Privacy
2016 · 5.595 Zit.
Communication-Efficient Learning of Deep Networks from Decentralized\n Data
2016 · 5.591 Zit.
Large-Scale Machine Learning with Stochastic Gradient Descent
2010 · 5.564 Zit.