Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.

TransMVSNet: Global Context-aware Multi-view Stereo Network with Transformers

2022·234 Zitationen·2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Volltext beim Verlag öffnen

234

Zitationen

Autoren

2022

Jahr

Abstract

In this paper, we present TransMVSNet, based on our exploration of feature matching in multi-view stereo (MVS). We analogize MVS back to its nature of a feature matching task and therefore propose a powerful Feature Matching Transformer (FMT) to leverage intra- (self-) and inter-(cross-) attention to aggregate long-range context information within and across images. To facilitate a better adaptation of the FMT, we leverage an Adaptive Receptive Field (ARF) module to ensure a smooth transit in scopes of features and bridge different stages with a feature pathway to pass transformed features and gradients across different scales. In addition, we apply pair-wise feature correlation to measure similarity between features, and adopt ambiguity-reducing focal loss to strengthen the supervision. To the best of our knowledge, TransMVSNet is the first attempt to leverage Transformer into the task of MVS. As a result, our method achieves state-of-the-art performance on DTU dataset, Tanks and Temples benchmark, and BlendedMVS dataset. Code is available at https://github.com/MegviiRobot/TransMVSNet.

Autoren

Institutionen

Themen

Advanced Vision and ImagingAdvanced Image Processing TechniquesMedical Image Segmentation Techniques

Volltext beim Verlag öffnen

TransMVSNet: Global Context-aware Multi-view Stereo Network with Transformers

Abstract

Ähnliche Arbeiten

Autoren

Institutionen

Themen