Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
The Cityscapes Dataset for Semantic Urban Scene Understanding
11.611
Zitationen
9
Autoren
2016
Jahr
Abstract
Visual understanding of complex urban street scenes is an enabling factor for a wide range of applications. Object detection has benefited enormously from large-scale datasets, especially in the context of deep learning. For semantic urban scene understanding, however, no current dataset adequately captures the complexity of real-world urban scenes. To address this, we introduce Cityscapes, a benchmark suite and large-scale dataset to train and test approaches for pixel-level and instance-level semantic labeling. Cityscapes is comprised of a large, diverse set of stereo video sequences recorded in streets from 50 different cities. 5000 of these images have high quality pixel-level annotations, 20 000 additional images have coarse annotations to enable methods that leverage large volumes of weakly-labeled data. Crucially, our effort exceeds previous attempts in terms of dataset size, annotation richness, scene variability, and complexity. Our accompanying empirical study provides an in-depth analysis of the dataset characteristics, as well as a performance evaluation of several state-of-the-art approaches based on our benchmark.
Ähnliche Arbeiten
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
2016 · 52.734 Zit.
Histograms of Oriented Gradients for Human Detection
2005 · 31.637 Zit.
Fast R-CNN
2015 · 27.346 Zit.
Focal Loss for Dense Object Detection
2017 · 24.642 Zit.
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
2017 · 9.892 Zit.