Symmetry Encoder-Decoder Network with Attention Mechanism for Fast Video Object Segmentation

Mingyue Guo; Dejun Zhang; Jun Sun; Yiqi Wu

doi:10.3390/sym11081006

Symmetry Encoder-Decoder Network with Attention Mechanism for Fast Video Object Segmentation

Symmetry ◽

10.3390/sym11081006 ◽

2019 ◽

Vol 11 (8) ◽

pp. 1006 ◽

Cited By ~ 2

Author(s):

Mingyue Guo ◽

Dejun Zhang ◽

Jun Sun ◽

Yiqi Wu

Keyword(s):

Object Segmentation ◽

Target Object ◽

General Purpose ◽

Attention Mechanism ◽

Video Object Segmentation ◽

Video Object ◽

Feature Maps ◽

Multi Scale ◽

Object Mask ◽

Fine Tune

Semi-supervised video object segmentation (VOS) has obtained significant progress in recent years. The general purpose of VOS methods is to segment objects in video sequences provided with a single annotation in the first frame. However, many of the recent successful methods heavily fine-tune the object mask in the first frame, which decreases their efficiency. In this work, to address this issue, we propose a symmetry encoder-decoder network with the attention mechanism for video object segmentation (SAVOS) requiring only one forward pass to segment the target object in a video. Specifically, the encoder generates a low-resolution mask with smoothed boundaries, while the decoder further refines the details of the segmentation mask and integrates lower level features progressively. Besides, to obtain accurate segmentation results, we sequentially apply the attention module on multi-scale feature maps for refinement. We conduct several experiments on three challenging datasets (i.e., DAVIS 2016, DAVIS 2017, and SegTrack v2) to show that SAVOS achieves competitive performance against the state-of-the-art.

Download Full-text

U2-ONet: A Two-Level Nested Octave U-Structure Network with a Multi-Scale Attention Mechanism for Moving Object Segmentation

Remote Sensing ◽

10.3390/rs13010060 ◽

2020 ◽

Vol 13 (1) ◽

pp. 60

Author(s):

Chenjie Wang ◽

Chengyuan Li ◽

Jun Liu ◽

Bin Luo ◽

Xin Su ◽

...

Keyword(s):

Moving Objects ◽

Object Segmentation ◽

Contextual Information ◽

Attention Mechanism ◽

Moving Object ◽

Feature Maps ◽

Moving Object Segmentation ◽

Practical Applications ◽

Multi Scale ◽

Spatial Redundancy

Most scenes in practical applications are dynamic scenes containing moving objects, so accurately segmenting moving objects is crucial for many computer vision applications. In order to efficiently segment all the moving objects in the scene, regardless of whether the object has a predefined semantic label, we propose a two-level nested octave U-structure network with a multi-scale attention mechanism, called U2-ONet. U2-ONet takes two RGB frames, the optical flow between these frames, and the instance segmentation of the frames as inputs. Each stage of U2-ONet is filled with the newly designed octave residual U-block (ORSU block) to enhance the ability to obtain more contextual information at different scales while reducing the spatial redundancy of the feature maps. In order to efficiently train the multi-scale deep network, we introduce a hierarchical training supervision strategy that calculates the loss at each level while adding knowledge-matching loss to keep the optimization consistent. The experimental results show that the proposed U2-ONet method can achieve a state-of-the-art performance in several general moving object segmentation datasets.

Download Full-text

Video object segmentation by Multi-Scale Pyramidal Multi-Dimensional LSTM with generated depth context

2016 IEEE International Conference on Image Processing (ICIP) ◽

10.1109/icip.2016.7532363 ◽

2016 ◽

Author(s):

Qiurui Wang ◽

Chun Yuan

Keyword(s):

Object Segmentation ◽

Video Object Segmentation ◽

Video Object ◽

Multi Scale

Download Full-text

Joint Attention Mechanism for Unsupervised Video Object Segmentation

10.1007/978-3-030-88004-0_13 ◽

2021 ◽

pp. 154-165

Author(s):

Rui Yao ◽

Xin Xu ◽

Yong Zhou ◽

Jiaqi Zhao ◽

Liang Fang

Keyword(s):

Joint Attention ◽

Object Segmentation ◽

Attention Mechanism ◽

Video Object Segmentation ◽

Video Object

Download Full-text

Collaborative Video Object Segmentation by Multi-Scale Foreground-Background Integration

IEEE Transactions on Pattern Analysis and Machine Intelligence ◽

10.1109/tpami.2021.3081597 ◽

2021 ◽

pp. 1-1

Author(s):

Zongxin Yang ◽

Yunchao Wei ◽

Yi Yang

Keyword(s):

Object Segmentation ◽

Video Object Segmentation ◽

Video Object ◽

Multi Scale ◽

Collaborative Video

Download Full-text

Video Object Segmentation Based on Location and RoIAlign in Weight Modulated Multi-Scale Network

2020 International Conference on Internet of Things and Intelligent Applications (ITIA) ◽

10.1109/itia50152.2020.9312358 ◽

2020 ◽

Author(s):

Wenqing Luo ◽

Yongzhao Zhan ◽

Qianling Wu

Keyword(s):

Object Segmentation ◽

Video Object Segmentation ◽

Video Object ◽

Multi Scale ◽

Scale Network

Download Full-text

Video Object Segmentation Research Based on Features Joint Modeling

Chinese Journal of Computers ◽

10.3724/sp.j.1016.2013.02356 ◽

2014 ◽

Vol 36 (11) ◽

pp. 2356-2363

Author(s):

Zong-Min LI ◽

Xu-Chao GONG ◽

Yu-Jie LIU

Keyword(s):

Object Segmentation ◽

Joint Modeling ◽

Video Object Segmentation ◽

Video Object

Download Full-text

Weakly supervised video object segmentation initialized with referring expression

Neurocomputing ◽

10.1016/j.neucom.2020.06.129 ◽

2020 ◽

Author(s):

XiaoQing Bu ◽

YuKuan Sun ◽

JianMing Wang ◽

KunLiang Liu ◽

JiaYu Liang ◽

...

Keyword(s):

Object Segmentation ◽

Video Object Segmentation ◽

Video Object ◽

Weakly Supervised

Download Full-text

A temporal attention based appearance model for video object segmentation

Applied Intelligence ◽

10.1007/s10489-021-02547-4 ◽

2021 ◽

Author(s):

Hui Wang ◽

Weibin Liu ◽

Weiwei Xing

Keyword(s):

Object Segmentation ◽

Video Object Segmentation ◽

Appearance Model ◽

Video Object ◽

Temporal Attention

Download Full-text

Hierarchical Embedding Guided Network for Video Object Segmentation

10.1109/icip42928.2021.9506091 ◽

2021 ◽

Author(s):

Chin-Hsuan Shih ◽

Wen-Jiin Tsai

Keyword(s):

Object Segmentation ◽

Video Object Segmentation ◽

Video Object

Download Full-text

Mask Selection and Propagation for Unsupervised Video Object Segmentation

2021 IEEE Winter Conference on Applications of Computer Vision (WACV) ◽

10.1109/wacv48630.2021.00172 ◽

2021 ◽

Author(s):

Shubhika Garg ◽

Vidit Goel

Keyword(s):

Object Segmentation ◽

Video Object Segmentation ◽

Video Object

Download Full-text