A magnocellular contribution to conscious object perception via temporal object segmentation

S. C. Goodhew; H. L. Boal; M. Edwards

doi:10.1167/14.10.1334

Motion-Attentive Transition for Zero-Shot Video Object Segmentation

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i07.7008 ◽

2020 ◽

Vol 34 (07) ◽

pp. 13066-13073 ◽

Cited By ~ 11

Author(s):

Tianfei Zhou ◽

Shunzhou Wang ◽

Yi Zhou ◽

Yazhou Yao ◽

Jianwu Li ◽

...

Keyword(s):

Object Representation ◽

Object Segmentation ◽

Object Motion ◽

Video Object Segmentation ◽

Video Object ◽

The Arts ◽

Temporal Object ◽

Multi Level ◽

Transition Network ◽

Spatio Temporal

In this paper, we present a novel Motion-Attentive Transition Network (MATNet) for zero-shot video object segmentation, which provides a new way of leveraging motion information to reinforce spatio-temporal object representation. An asymmetric attention block, called Motion-Attentive Transition (MAT), is designed within a two-stream encoder, which transforms appearance features into motion-attentive representations at each convolutional stage. In this way, the encoder becomes deeply interleaved, allowing for closely hierarchical interactions between object motion and appearance. This is superior to the typical two-stream architecture, which treats motion and appearance separately in each stream and often suffers from overfitting to appearance information. Additionally, a bridge network is proposed to obtain a compact, discriminative and scale-sensitive representation for multi-level encoder features, which is further fed into a decoder to achieve segmentation results. Extensive experiments on three challenging public benchmarks (i.e., DAVIS-16, FBMS and Youtube-Objects) show that our model achieves compelling performance against the state-of-the-arts. Code is available at: https://github.com/tfzhou/MATNet.

Download Full-text

Automatic video background replacement using shape-based probabilistic spatio-temporal object segmentation

2007 6th International Conference on Information, Communications & Signal Processing ◽

10.1109/icics.2007.4449725 ◽

2007 ◽

Cited By ~ 1

Author(s):

Rakib Ahmed ◽

Gour C. Karmakar ◽

Laurence S. Dooley

Keyword(s):

Object Segmentation ◽

Temporal Object ◽

Spatio Temporal

Download Full-text

Is all sparing created equal? Comparing lag-1 sparing and extended sparing in temporal object perception.

Journal of Experimental Psychology Human Perception & Performance ◽

10.1037/a0023508 ◽

2011 ◽

Vol 37 (5) ◽

pp. 1527-1541 ◽

Cited By ~ 3

Author(s):

Troy A. W. Visser ◽

Jeneva L. Ohan

Keyword(s):

Object Perception ◽

Temporal Object ◽

Lag 1 Sparing

Download Full-text

A magnocellular contribution to conscious perception via temporal object segmentation.

Journal of Experimental Psychology Human Perception & Performance ◽

10.1037/a0035769 ◽

2014 ◽

Vol 40 (3) ◽

pp. 948-959 ◽

Cited By ~ 5

Author(s):

Stephanie C. Goodhew ◽

Hannah L. Boal ◽

Mark Edwards

Keyword(s):

Object Segmentation ◽

Conscious Perception ◽

Temporal Object

Download Full-text

Weakly Supervised Few-shot Object Segmentation using Co-Attention with Visual and Semantic Embeddings

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/120 ◽

2020 ◽

Author(s):

Mennatullah Siam ◽

Naren Doraiswamy ◽

Boris N. Oreshkin ◽

Hengshuai Yao ◽

Martin Jagersand

Keyword(s):

Semantic Processing ◽

Object Segmentation ◽

State Of The Art ◽

Video Data ◽

Modal Interaction ◽

Shot Segmentation ◽

Bounding Box ◽

Segmentation Methods ◽

Temporal Object ◽

Weakly Supervised

Significant progress has been made recently in developing few-shot object segmentation methods. Learning is shown to be successful in few-shot segmentation settings, using pixel-level, scribbles and bounding box supervision. This paper takes another approach, i.e., only requiring image-level label for few-shot object segmentation. We propose a novel multi-modal interaction module for few-shot object segmentation that utilizes a co-attention mechanism using both visual and word embedding. Our model using image-level labels achieves 4.8% improvement over previously proposed image-level few-shot object segmentation. It also outperforms state-of-the-art methods that use weak bounding box supervision on PASCAL-5^i. Our results show that few-shot segmentation benefits from utilizing word embeddings, and that we are able to perform few-shot segmentation using stacked joint visual semantic processing with weak image-level labels. We further propose a novel setup, Temporal Object Segmentation for Few-shot Learning (TOSFL) for videos. TOSFL can be used on a variety of public video data such as Youtube-VOS, as demonstrated in both instance-level and category-level TOSFL experiments.

Download Full-text