Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Subgroup Invariant Perturbation for Unbiased Pre-Trained Model Prediction
10
Zitationen
4
Autoren
2021
Jahr
Abstract
Modern deep learning systems have achieved unparalleled success and several applications have significantly benefited due to these technological advancements. However, these systems have also shown vulnerabilities with strong implications on the fairness and trustability of such systems. Among these vulnerabilities, bias has been an <i>Achilles' heel problem</i>. Many applications such as face recognition and language translation have shown high levels of bias in the systems towards particular demographic sub-groups. Unbalanced representation of these sub-groups in the training data is one of the primary reasons of biased behavior. To address this important challenge, we propose a two-fold contribution: a bias estimation metric termed as <i>Precise Subgroup Equivalence</i> to jointly measure the bias in model prediction and the overall model performance. Secondly, we propose a novel bias mitigation algorithm which is inspired from adversarial perturbation and uses the PSE metric. The mitigation algorithm learns a single uniform perturbation termed as <i>Subgroup Invariant Perturbation</i> which is added to the input dataset to generate a transformed dataset. The transformed dataset, when given as input to the pre-trained model reduces the bias in model prediction. Multiple experiments performed on four publicly available face datasets showcase the effectiveness of the proposed algorithm for race and gender prediction.
Ähnliche Arbeiten
Rethinking the Inception Architecture for Computer Vision
2016 · 30.321 Zit.
MobileNetV2: Inverted Residuals and Linear Bottlenecks
2018 · 24.392 Zit.
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
2020 · 21.296 Zit.
CBAM: Convolutional Block Attention Module
2018 · 21.270 Zit.
Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification
2015 · 18.489 Zit.