Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
SiamRPN++: Evolution of Siamese Visual Tracking With Very Deep Networks
2.481
Zitationen
6
Autoren
2019
Jahr
Abstract
Siamese network based trackers formulate tracking as convolutional feature cross-correlation between target template and searching region. However, Siamese trackers still have accuracy gap compared with state-of-the-art algorithms and they cannot take advantage of feature from deep networks, such as ResNet-50 or deeper. In this work we prove the core reason comes from the lack of strict translation invariance. By comprehensive theoretical analysis and experimental validations, we break this restriction through a simple yet effective spatial aware sampling strategy and successfully train a ResNet-driven Siamese tracker with significant performance gain. Moreover, we propose a new model architecture to perform depth-wise and layer-wise aggregations, which not only further improves the accuracy but also reduces the model size. We conduct extensive ablation studies to demonstrate the effectiveness of the proposed tracker, which obtains currently the best results on four large tracking benchmarks, including OTB2015, VOT2018, UAV123, and LaSOT. Our model will be released to facilitate further studies based on this problem.
Ähnliche Arbeiten
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
2016 · 53.375 Zit.
Histograms of Oriented Gradients for Human Detection
2005 · 31.750 Zit.
Fast R-CNN
2015 · 27.604 Zit.
Focal Loss for Dense Object Detection
2017 · 25.063 Zit.
The Cityscapes Dataset for Semantic Urban Scene Understanding
2016 · 11.736 Zit.