Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Vivim: A Video Vision Mamba for Ultrasound Video Segmentation
22
Zitationen
6
Autoren
2025
Jahr
Abstract
Ultrasound video segmentation gains increasing attention in clinical practice due to the redundant dynamic references in video frames. However, traditional convolutional neural networks have a limited receptive field and transformer-based networks are unsatisfactory in constructing long-term dependency from the perspective of computational complexity. This bottleneck poses a significant challenge when processing longer sequences in medical video analysis tasks using available devices with limited memory. Recently, state space models (SSMs), famous by Mamba, have exhibited linear complexity and impressive achievements in efficient long sequence modeling, which have developed deep neural networks by expanding the receptive field on many vision tasks significantly. Unfortunately, vanilla SSMs failed to simultaneously capture causal temporal cues and preserve non-casual spatial information. To this end, this paper presents a Video Vision Mamba-based framework, dubbed as Vivim, for ultrasound video segmentation tasks. Our Vivim can effectively compress the long-term spatiotemporal representation into sequences at varying scales with our designed Temporal Mamba Block. We also introduce an improved boundary-aware affine constraint across frames to enhance the discriminative ability of Vivim on ambiguous lesions. Extensive experiments on thyroid segmentation in ultrasound videos, breast lesion segmentation in ultrasound videos, and polyp segmentation in colonoscopy videos demonstrate the effectiveness and efficiency of our Vivim, superior to existing methods. The code and dataset are available at: https://github.com/scott-yjyang/Vivim.
Ähnliche Arbeiten
A Computational Approach to Edge Detection
1986 · 28.716 Zit.
Textural Features for Image Classification
1973 · 22.220 Zit.
Automated Anatomical Labeling of Activations in SPM Using a Macroscopic Anatomical Parcellation of the MNI MRI Single-Subject Brain
2002 · 16.574 Zit.
Normalized cuts and image segmentation
2000 · 15.553 Zit.
Nonlinear total variation based noise removal algorithms
1992 · 15.414 Zit.
Autoren
Institutionen
- Hong Kong University of Science and Technology(HK)
- University of Hong Kong(HK)
- South China University of Technology(CN)
- Chinese University of Hong Kong(HK)
- Agency for Science, Technology and Research(SG)
- Institute of High Performance Computing(SG)
- Guangdong Academy of Medical Sciences(CN)
- Southern Medical University(CN)