Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.

Misleading Failures of Partial-input Baselines

2019·30 ZitationenOpen Access

Volltext beim Verlag öffnen

Zitationen

Autoren

2019

Jahr

Abstract

Recent work establishes dataset difficulty and removes annotation artifacts via partial-input baselines (e.g., hypothesis-only model for SNLI or question-only model for VQA). A successful partial-input baseline indicates that the dataset is cheatable. But the converse is not necessarily true: failures of partial-input baselines do not mean the dataset is free of artifacts. We first design artificial datasets to illustrate how the trivial patterns that are only visible in the full input can evade any partial-input baseline. Next, we identify such artifacts in the SNLI dataset—a hypothesis-only model augmented with trivial patterns in the premise can solve 15% of previously-thought “hard” examples. Our work provides a caveat for the use and creation of partial-input baselines for datasets.

Autoren

Institutionen

Themen

Topic ModelingExplainable Artificial Intelligence (XAI)Machine Learning in Healthcare

Volltext beim Verlag öffnen

Misleading Failures of Partial-input Baselines

Abstract

Ähnliche Arbeiten

Autoren

Institutionen

Themen