Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
TURL: Table Understanding through Representation Learning
43
Zitationen
5
Autoren
2020
Jahr
Abstract
Relational tables on the Web store a vast amount of knowledge. Owing to the wealth of such tables, there has been tremendous progress on a variety of tasks in the area of table understanding. However, existing work generally relies on heavily-engineered task-specific features and model architectures. In this paper, we present TURL, a novel framework that introduces the pre-training/fine-tuning paradigm to relational Web tables. During pre-training, our framework learns deep contextualized representations on relational tables in an unsupervised manner. Its universal model design with pre-trained representations can be applied to a wide range of tasks with minimal task-specific fine-tuning. Specifically, we propose a structure-aware Transformer encoder to model the row-column structure of relational tables, and present a new Masked Entity Recovery (MER) objective for pre-training to capture the semantics and knowledge in large-scale unlabeled data. We systematically evaluate TURL with a benchmark consisting of 6 different tasks for table understanding (e.g., relation extraction, cell filling). We show that TURL generalizes well to all tasks and substantially outperforms existing methods in almost all instances.
Ähnliche Arbeiten
The REDCap consortium: Building an international community of software platform partners
2019 · 23.537 Zit.
The FAIR Guiding Principles for scientific data management and stewardship
2016 · 17.411 Zit.
Bayesian Data Analysis
1995 · 13.754 Zit.
k-ANONYMITY: A MODEL FOR PROTECTING PRIVACY
2002 · 8.455 Zit.
Business Intelligence and Analytics: From Big Data to Big Impact
2012 · 5.989 Zit.