Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking

Mustansar Fiaz; Arif Mahmood; Soon Ki Jung

doi:10.3390/s20144021

Learning Soft Mask Based Feature Fusion with Channel and Spatial Attention for Robust Visual Object Tracking

Sensors ◽

10.3390/s20144021 ◽

2020 ◽

Vol 20 (14) ◽

pp. 4021 ◽

Cited By ~ 2

Author(s):

Mustansar Fiaz ◽

Arif Mahmood ◽

Soon Ki Jung

Keyword(s):

Object Tracking ◽

Spatial Attention ◽

Feature Fusion ◽

State Of The Art ◽

Feature Representation ◽

Visual Object ◽

Target Feature ◽

Visual Object Tracking ◽

Low Level ◽

Benchmark Datasets

We propose to improve the visual object tracking by introducing a soft mask based low-level feature fusion technique. The proposed technique is further strengthened by integrating channel and spatial attention mechanisms. The proposed approach is integrated within a Siamese framework to demonstrate its effectiveness for visual object tracking. The proposed soft mask is used to give more importance to the target regions as compared to the other regions to enable effective target feature representation and to increase discriminative power. The low-level feature fusion improves the tracker robustness against distractors. The channel attention is used to identify more discriminative channels for better target representation. The spatial attention complements the soft mask based approach to better localize the target objects in challenging tracking scenarios. We evaluated our proposed approach over five publicly available benchmark datasets and performed extensive comparisons with 39 state-of-the-art tracking algorithms. The proposed tracker demonstrates excellent performance compared to the existing state-of-the-art trackers.

Download Full-text

Visual Object Tracking in RGB-D Data via Genetic Feature Learning

Complexity ◽

10.1155/2019/4539410 ◽

2019 ◽

Vol 2019 ◽

pp. 1-8

Author(s):

Ming-xin Jiang ◽

Xian-xian Luo ◽

Tao Hai ◽

Hai-yan Wang ◽

Song Yang ◽

...

Keyword(s):

Object Tracking ◽

Feature Learning ◽

Feature Representation ◽

Visual Object ◽

Visual Object Tracking ◽

Fitness Evaluation ◽

Candidate Solution ◽

Genetic Feature ◽

Computer Vision Applications ◽

Crossover And Mutation

Visual object tracking is a fundamental component in many computer vision applications. Extracting robust features of object is one of the most important steps in tracking. As trackers, only formulated on RGB data, are usually affected by occlusions, appearance, or illumination variations, we propose a novel RGB-D tracking method based on genetic feature learning in this paper. Our approach addresses feature learning as an optimization problem. As owning the advantage of parallel computing, genetic algorithm (GA) has fast speed of convergence and excellent global optimization performance. At the same time, unlike handcrafted feature and deep learning methods, GA can be employed to solve the problem of feature representation without prior knowledge, and it has no use for a large number of parameters to be learned. The candidate solution in RGB or depth modality is represented as an encoding of an image in GA, and genetic feature is learned through population initialization, fitness evaluation, selection, crossover, and mutation. The proposed RGB-D tracker is evaluated on popular benchmark dataset, and experimental results indicate that our method achieves higher accuracy and faster tracking speed.

Download Full-text

Visual object tracking based on adaptive multi-feature fusion in complex scenarios

Eleventh International Conference on Digital Image Processing (ICDIP 2019) ◽

10.1117/12.2540112 ◽

2019 ◽

Author(s):

WANG HENGJUN

Keyword(s):

Object Tracking ◽

Feature Fusion ◽

Visual Object ◽

Visual Object Tracking

Download Full-text

The State-of-the-Art in Handling Occlusions for Visual Object Tracking

IEICE Transactions on Information and Systems ◽

10.1587/transinf.2014edr0002 ◽

2015 ◽

Vol E98.D (7) ◽

pp. 1260-1274 ◽

Cited By ~ 10

Author(s):

Kourosh MESHGI ◽

Shin ISHII

Keyword(s):

Object Tracking ◽

State Of The Art ◽

The State ◽

Visual Object ◽

Visual Object Tracking

Download Full-text

Low-Rank Multi-Channel Features for Robust Visual Object Tracking

Symmetry ◽

10.3390/sym11091155 ◽

2019 ◽

Vol 11 (9) ◽

pp. 1155 ◽

Cited By ~ 3

Author(s):

Fawad ◽

Muhammad Jamil Khan ◽

MuhibUr Rahman ◽

Yasar Amin ◽

Hannu Tenhunen

Keyword(s):

Computational Complexity ◽

Object Tracking ◽

State Of The Art ◽

Color Naming ◽

Circulant Matrix ◽

Low Rank ◽

Support Vector ◽

Visual Object ◽

Visual Object Tracking ◽

Kernel Correlation

Kernel correlation filters (KCF) demonstrate significant potential in visual object tracking by employing robust descriptors. Proper selection of color and texture features can provide robustness against appearance variations. However, the use of multiple descriptors would lead to a considerable feature dimension. In this paper, we propose a novel low-rank descriptor, that provides better precision and success rate in comparison to state-of-the-art trackers. We accomplished this by concatenating the magnitude component of the Overlapped Multi-oriented Tri-scale Local Binary Pattern (OMTLBP), Robustness-Driven Hybrid Descriptor (RDHD), Histogram of Oriented Gradients (HoG), and Color Naming (CN) features. We reduced the rank of our proposed multi-channel feature to diminish the computational complexity. We formulated the Support Vector Machine (SVM) model by utilizing the circulant matrix of our proposed feature vector in the kernel correlation filter. The use of discrete Fourier transform in the iterative learning of SVM reduced the computational complexity of our proposed visual tracking algorithm. Extensive experimental results on Visual Tracker Benchmark dataset show better accuracy in comparison to other state-of-the-art trackers.

Download Full-text

Visual Object Tracking Based on Mutual Learning between Cohort Multiscale Feature-Fusion Networks with Weighted Loss

IEEE Transactions on Circuits and Systems for Video Technology ◽

10.1109/tcsvt.2020.2994744 ◽

2020 ◽

pp. 1-1

Author(s):

Jiaojiao Fang ◽

Guizhong Liu

Keyword(s):

Object Tracking ◽

Feature Fusion ◽

Visual Object ◽

Visual Object Tracking ◽

Mutual Learning

Download Full-text

Visual perception based adaptive feature fusion for visual object tracking

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) ◽

10.1109/smc.2017.8122800 ◽

2017 ◽

Author(s):

Evan Krieger ◽

Vijayan K. Asari

Keyword(s):

Visual Perception ◽

Object Tracking ◽

Feature Fusion ◽

Visual Object ◽

Visual Object Tracking

Download Full-text

Visual Object Tracking Robust to Illumination Variation Based on Hyperline Clustering

Information ◽

10.3390/info10010026 ◽

2019 ◽

Vol 10 (1) ◽

pp. 26 ◽

Cited By ~ 2

Author(s):

Senquan Yang ◽

Yuan Xie ◽

Pu Li ◽

Haoxiang Wen ◽

Huan Luo ◽

...

Keyword(s):

State Of The Art ◽

Shape Deformation ◽

Visual Object ◽

Excellent Performance ◽

Illumination Variation ◽

Visual Object Tracking ◽

Discriminant Model ◽

Lower Accuracy ◽

Benchmark Datasets ◽

Online Tracking

Color histogram-based trackers have obtained excellent performance against many challenging situations. However, since the appearance of color is sensitive to illumination, they tend to achieve lower accuracy when illumination is severely variant throughout a sequence. To overcome this limitation, we propose a novel hyperline clustering based discriminant model, an illumination invariant model that is able to distinguish the object from its surrounding background. Furthermore, we exploit this model and propose an anchor based scale estimation to cope with shape deformation and scale variation. Numerous experiments on recent online tracking benchmark datasets demonstrate that our approach achieve favorable performance compared with several state-of-the-art tracking algorithms. In particular, our approach achieves higher accuracy than comparative methods in the illumination variant and shape deformation challenging situations.

Download Full-text

Siamese High-Level Feature Refine Network for Visual Object Tracking

Electronics ◽

10.3390/electronics9111918 ◽

2020 ◽

Vol 9 (11) ◽

pp. 1918 ◽

Cited By ~ 1

Author(s):

Md. Maklachur Rahman ◽

Md Rishad Ahmed ◽

Lamyanba Laishram ◽

Seock Ho Kim ◽

Soon Ki Jung

Keyword(s):

Visual Tracking ◽

Feature Representation ◽

Visual Object ◽

Target Feature ◽

Discriminative Ability ◽

Visual Object Tracking ◽

Discrimination Ability ◽

Proposed Model ◽

Real Time Tracking ◽

High Level

Siamese network-based trackers are broadly applied to solve visual tracking problems due to its balanced performance in terms of speed and accuracy. Tracking desired objects in challenging scenarios is still one of the fundamental concerns during visual tracking. This research paper proposes a feature refined end-to-end tracking framework with real-time tracking speed and considerable performance. The feature refine network has been incorporated to enhance the target feature representation power, utilizing high-level semantic information. Besides, it allows the network to capture the salient information to locate the target and learns to represent the target feature in a more generalized way advancing the overall tracking performance, particularly in the challenging sequences. But, only the feature refine module is unable to handle such challenges because of its less discriminative ability. To overcome this difficulty, we employ an attention module inside the feature refine network that strengths the tracker discrimination ability between the target and background. Furthermore, we conduct extensive experiments to ensure the proposed tracker’s effectiveness using several popular tracking benchmarks, demonstrating that our proposed model achieves state-of-the-art performance over other trackers.

Download Full-text

Distractor-Aware Deep Regression for Visual Tracking

Sensors ◽

10.3390/s19020387 ◽

2019 ◽

Vol 19 (2) ◽

pp. 387 ◽

Cited By ~ 1

Author(s):

Ming Du ◽

Yan Ding ◽

Xiuyun Meng ◽

Hua-Liang Wei ◽

Yifan Zhao

Keyword(s):

Object Tracking ◽

Visual Tracking ◽

Test Data ◽

Loss Function ◽

State Of The Art ◽

Target Object ◽

Visual Object ◽

Visual Object Tracking ◽

Training Samples ◽

Better Than

In recent years, regression trackers have drawn increasing attention in the visual-object tracking community due to their favorable performance and easy implementation. The tracker algorithms directly learn mapping from dense samples around the target object to Gaussian-like soft labels. However, in many real applications, when applied to test data, the extreme imbalanced distribution of training samples usually hinders the robustness and accuracy of regression trackers. In this paper, we propose a novel effective distractor-aware loss function to balance this issue by highlighting the significant domain and by severely penalizing the pure background. In addition, we introduce a full differentiable hierarchy-normalized concatenation connection to exploit abstractions across multiple convolutional layers. Extensive experiments were conducted on five challenging benchmark-tracking datasets, that is, OTB-13, OTB-15, TC-128, UAV-123, and VOT17. The experimental results are promising and show that the proposed tracker performs much better than nearly all the compared state-of-the-art approaches.

Download Full-text

Robust Visual Object Tracking using Context-Based Spatial Variation via Multi-Feature Fusion

Information Sciences ◽

10.1016/j.ins.2021.06.084 ◽

2021 ◽

Author(s):

Dinesh Elayaperumal ◽

Young Hoon Joo

Keyword(s):

Spatial Variation ◽

Object Tracking ◽

Feature Fusion ◽

Visual Object ◽

Visual Object Tracking

Download Full-text