A Robust Visual Tracking Algorithm Based on Spatial-Temporal Context Hierarchical Response Fusion

Wancheng Zhang; Yanmin Luo; Zhi Chen; Yongzhao Du; Daxin Zhu; Peizhong Liu

doi:10.3390/a12010008

A Robust Visual Tracking Algorithm Based on Spatial-Temporal Context Hierarchical Response Fusion

Algorithms ◽

10.3390/a12010008 ◽

2018 ◽

Vol 12 (1) ◽

pp. 8 ◽

Cited By ~ 2

Author(s):

Wancheng Zhang ◽

Yanmin Luo ◽

Zhi Chen ◽

Yongzhao Du ◽

Daxin Zhu ◽

...

Keyword(s):

Visual Tracking ◽

Correlation Filter ◽

Temporal Context ◽

Visual Object ◽

Correlation Filters ◽

Visual Object Tracking ◽

Illumination Changes ◽

Model Update ◽

Benchmark Datasets ◽

Hierarchical Features

Discriminative correlation filters (DCFs) have been shown to perform superiorly in visual object tracking. However, visual tracking is still challenging when the target objects undergo complex scenarios such as occlusion, deformation, scale changes and illumination changes. In this paper, we utilize the hierarchical features of convolutional neural networks (CNNs) and learn a spatial-temporal context correlation filter on convolutional layers. Then, the translation is estimated by fusing the response score of the filters on the three convolutional layers. In terms of scale estimation, we learn a discriminative correlation filter to estimate scale from the best confidence results. Furthermore, we proposed a re-detection activation discrimination method to improve the robustness of visual tracking in the case of tracking failure and an adaptive model update method to reduce tracking drift caused by noisy updates. We evaluate the proposed tracker with DCFs and deep features on OTB benchmark datasets. The tracking results demonstrated that the proposed algorithm is superior to several state-of-the-art DCF methods in terms of accuracy and robustness.

Download Full-text

Visual Tracking Based on Complementary Learners with Distractor Handling

Mathematical Problems in Engineering ◽

10.1155/2017/5295601 ◽

2017 ◽

Vol 2017 ◽

pp. 1-13 ◽

Cited By ~ 2

Author(s):

Suryo Adhi Wibowo ◽

Hansoo Lee ◽

Eun Kyeong Kim ◽

Sungshin Kim

Keyword(s):

Visual Tracking ◽

Object Representation ◽

Target Location ◽

Target Object ◽

Tracking Algorithm ◽

Color Histogram ◽

Correlation Filter ◽

Visual Object ◽

Visual Object Tracking ◽

Benchmark Datasets

The representation of the object is an important factor in building a robust visual object tracking algorithm. To resolve this problem, complementary learners that use color histogram- and correlation filter-based representation to represent the target object can be used since they each have advantages that can be exploited to compensate the other’s drawback in visual tracking. Further, a tracking algorithm can fail because of the distractor, even when complementary learners have been implemented for the target object representation. In this study, we show that, in order to handle the distractor, first the distractor must be detected by learning the responses from the color-histogram- and correlation-filter-based representation. Then, to determine the target location, we can decide whether the responses from each representation should be merged or only the response from the correlation filter should be used. This decision depends on the result obtained from the distractor detection process. Experiments were performed on the widely used VOT2014 and VOT2015 benchmark datasets. It was verified that our proposed method performs favorably as compared with several state-of-the-art visual tracking algorithms.

Download Full-text

Parallel Correlation Filters for Real-Time Visual Tracking

Sensors ◽

10.3390/s19102362 ◽

2019 ◽

Vol 19 (10) ◽

pp. 2362 ◽

Cited By ~ 5

Author(s):

Yijin Yang ◽

Yihong Zhang ◽

Demin Li ◽

Zhijie Wang

Keyword(s):

Object Tracking ◽

Real Time ◽

Research Field ◽

Tracking Performance ◽

Correlation Filter ◽

Visual Object ◽

Correlation Filters ◽

Illumination Variation ◽

Visual Object Tracking ◽

Appearance Changes

Correlation filter-based methods have recently performed remarkably well in terms of accuracy and speed in the visual object tracking research field. However, most existing correlation filter-based methods are not robust to significant appearance changes in the target, especially when the target undergoes deformation, illumination variation, and rotation. In this paper, a novel parallel correlation filters (PCF) framework is proposed for real-time visual object tracking. Firstly, the proposed method constructs two parallel correlation filters, one for tracking the appearance changes in the target, and the other for tracking the translation of the target. Secondly, through weighted merging the response maps of these two parallel correlation filters, the proposed method accurately locates the center position of the target. Finally, in the training stage, a new reasonable distribution of the correlation output is proposed to replace the original Gaussian distribution to train more accurate correlation filters, which can prevent the model from drifting to achieve excellent tracking performance. The extensive qualitative and quantitative experiments on the common object tracking benchmarks OTB-2013 and OTB-2015 have demonstrated that the proposed PCF tracker outperforms most of the state-of-the-art trackers and achieves a high real-time tracking performance.

Download Full-text

CFNN: Correlation Filter Neural Network for Visual Object Tracking

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/309 ◽

2017 ◽

Cited By ~ 2

Author(s):

Yang Li ◽

Zhan Xu ◽

Jianke Zhu

Keyword(s):

Neural Network ◽

Visual Tracking ◽

Network Architecture ◽

Back Propagation ◽

Correlation Filter ◽

Visual Object ◽

Neural Network Architecture ◽

Visual Object Tracking ◽

Single Target ◽

Wide Range

Albeit convolutional neural network (CNN) has shown promising capacity in many computer vision tasks, applying it to visual tracking is yet far from solved. Existing methods either employ a large external dataset to undertake exhaustive pre-training or suffer from less satisfactory results in terms of accuracy and robustness. To track single target in a wide range of videos, we present a novel Correlation Filter Neural Network architecture, as well as a complete visual tracking pipeline, The proposed approach is a special case of CNN, whose initialization does not need any pre-training on the external dataset. The initialization of network enjoys the merits of cyclic sampling to achieve the appealing discriminative capability, while the network updating scheme adopts advantages from back-propagation in order to capture new appearance variations. The tracking pipeline integrates both aspects well by making them complementary to each other. We validate our tracker on OTB-2013 benchmark. The proposed tracker obtains the promising results compared to most of existing representative trackers.

Download Full-text

Motion-Aware Correlation Filters for Online Visual Tracking

Sensors ◽

10.3390/s18113937 ◽

2018 ◽

Vol 18 (11) ◽

pp. 3937 ◽

Cited By ~ 5

Author(s):

Yihong Zhang ◽

Yijin Yang ◽

Wuneng Zhou ◽

Lifeng Shi ◽

Demin Li

Keyword(s):

Object Tracking ◽

Visual Tracking ◽

Model Updating ◽

Visual Object ◽

Adaptive Model ◽

Correlation Filters ◽

Current Frame ◽

Visual Object Tracking ◽

Fast Motion ◽

Better Than

The discriminative correlation filters-based methods struggle deal with the problem of fast motion and heavy occlusion, the problem can severely degrade the performance of trackers, ultimately leading to tracking failures. In this paper, a novel Motion-Aware Correlation Filters (MACF) framework is proposed for online visual object tracking, where a motion-aware strategy based on joint instantaneous motion estimation Kalman filters is integrated into the Discriminative Correlation Filters (DCFs). The proposed motion-aware strategy is used to predict the possible region and scale of the target in the current frame by utilizing the previous estimated 3D motion information. Obviously, this strategy can prevent model drift caused by fast motion. On the base of the predicted region and scale, the MACF detects the position and scale of the target by using the DCFs-based method in the current frame. Furthermore, an adaptive model updating strategy is proposed to address the problem of corrupted models caused by occlusions, where the learning rate is determined by the confidence of the response map. The extensive experiments on popular Object Tracking Benchmark OTB-100, OTB-50 and unmanned aerial vehicles (UAV) video have demonstrated that the proposed MACF tracker performs better than most of the state-of-the-art trackers and achieves a high real-time performance. In addition, the proposed approach can be integrated easily and flexibly into other visual tracking algorithms.

Download Full-text

Robust Scale Adaptive Visual Tracking with Correlation Filters

Applied Sciences ◽

10.3390/app8112037 ◽

2018 ◽

Vol 8 (11) ◽

pp. 2037 ◽

Cited By ~ 1

Author(s):

Chunbao Li ◽

Bo Yang

Keyword(s):

Visual Tracking ◽

State Of The Art ◽

Estimation Method ◽

Color Naming ◽

Target Object ◽

Correlation Filter ◽

Correlation Filters ◽

Object Proposals ◽

Benchmark Datasets ◽

Candidate Object

Visual tracking is a challenging task in computer vision due to various appearance changes of the target object. In recent years, correlation filter plays an important role in visual tracking and many state-of-the-art correlation filter based trackers are proposed in the literature. However, these trackers still have certain limitations. Most of existing trackers cannot well deal with scale variation, and they may easily drift to the background in the case of occlusion. To overcome the above problems, we propose a Correlation Filters based Scale Adaptive (CFSA) visual tracker. In the tracker, a modified EdgeBoxes generator, is proposed to generate high-quality candidate object proposals for tracking. The pool of generated candidate object proposals is adopted to estimate the position of the target object using a kernelized correlation filter based tracker with HOG and color naming features. In order to deal with changes in target scale, a scale estimation method is proposed by combining the water flow driven MBD (minimum barrier distance) algorithm with the estimated position. Furthermore, an online updating schema is adopted to reduce the interference of the surrounding background. Experimental results on two large benchmark datasets demonstrate that the CFSA tracker achieves favorable performance compared with the state-of-the-art trackers.

Download Full-text

Adaptive Channel Selection for Robust Visual Object Tracking with Discriminative Correlation Filters

International Journal of Computer Vision ◽

10.1007/s11263-021-01435-1 ◽

2021 ◽

Author(s):

Tianyang Xu ◽

Zhenhua Feng ◽

Xiao-Jun Wu ◽

Josef Kittler

Keyword(s):

Object Tracking ◽

Augmented Lagrangian Method ◽

Channel Selection ◽

Image Feature ◽

Superior Performance ◽

Appearance Model ◽

Visual Object ◽

Correlation Filters ◽

Visual Object Tracking ◽

Feature Representations

AbstractDiscriminative Correlation Filters (DCF) have been shown to achieve impressive performance in visual object tracking. However, existing DCF-based trackers rely heavily on learning regularised appearance models from invariant image feature representations. To further improve the performance of DCF in accuracy and provide a parsimonious model from the attribute perspective, we propose to gauge the relevance of multi-channel features for the purpose of channel selection. This is achieved by assessing the information conveyed by the features of each channel as a group, using an adaptive group elastic net inducing independent sparsity and temporal smoothness on the DCF solution. The robustness and stability of the learned appearance model are significantly enhanced by the proposed method as the process of channel selection performs implicit spatial regularisation. We use the augmented Lagrangian method to optimise the discriminative filters efficiently. The experimental results obtained on a number of well-known benchmarking datasets demonstrate the effectiveness and stability of the proposed method. A superior performance over the state-of-the-art trackers is achieved using less than $$10\%$$ 10 % deep feature channels.

Download Full-text

Aberrance suppressed spatio-temporal correlation filters for visual object tracking

Pattern Recognition ◽

10.1016/j.patcog.2021.107922 ◽

2021 ◽

Vol 115 ◽

pp. 107922

Author(s):

Dinesh Elayaperumal ◽

Young Hoon Joo

Keyword(s):

Object Tracking ◽

Temporal Correlation ◽

Visual Object ◽

Correlation Filters ◽

Visual Object Tracking ◽

Spatio Temporal

Download Full-text

Spatio-Temporal Context, Correlation Filter and Measurement Estimation Collaboration Based Visual Object Tracking

Sensors ◽

10.3390/s21082841 ◽

2021 ◽

Vol 21 (8) ◽

pp. 2841

Author(s):

Khizer Mehmood ◽

Abdul Jalil ◽

Ahmad Ali ◽

Baber Khan ◽

Maria Murad ◽

...

Keyword(s):

Kalman Filter ◽

Object Tracking ◽

Environmental Changes ◽

Correlation Filter ◽

Temporal Context ◽

Visual Object ◽

Target Model ◽

Change Of Scale ◽

Tracking Model ◽

Spatio Temporal

Despite eminent progress in recent years, various challenges associated with object tracking algorithms such as scale variations, partial or full occlusions, background clutters, illumination variations are still required to be resolved with improved estimation for real-time applications. This paper proposes a robust and fast algorithm for object tracking based on spatio-temporal context (STC). A pyramid representation-based scale correlation filter is incorporated to overcome the STC’s inability on the rapid change of scale of target. It learns appearance induced by variations in the target scale sampled at a different set of scales. During occlusion, most correlation filter trackers start drifting due to the wrong update of samples. To prevent the target model from drift, an occlusion detection and handling mechanism are incorporated. Occlusion is detected from the peak correlation score of the response map. It continuously predicts target location during occlusion and passes it to the STC tracking model. After the successful detection of occlusion, an extended Kalman filter is used for occlusion handling. This decreases the chance of tracking failure as the Kalman filter continuously updates itself and the tracking model. Further improvement to the model is provided by fusion with average peak to correlation energy (APCE) criteria, which automatically update the target model to deal with environmental changes. Extensive calculations on the benchmark datasets indicate the efficacy of the proposed tracking method with state of the art in terms of performance analysis.

Download Full-text

Multipath Based Correlation Filter for Visual Object Tracking

Lecture Notes in Computer Science - Pattern Recognition and Machine Intelligence ◽

10.1007/978-3-030-34872-4_54 ◽

2019 ◽

pp. 490-498

Author(s):

Himadri Sekhar Bhunia ◽

Alok Kanti Deb ◽

Jayanta Mukhopadhyay

Keyword(s):

Object Tracking ◽

Correlation Filter ◽

Visual Object ◽

Visual Object Tracking

Download Full-text

Learning spatio-temporal context via hierarchical features for visual tracking

Signal Processing Image Communication ◽

10.1016/j.image.2018.04.010 ◽

2018 ◽

Vol 66 ◽

pp. 50-65 ◽

Cited By ~ 1

Author(s):

Yi Cao ◽

Hongbing Ji ◽

Wenbo Zhang ◽

Fei Xue

Keyword(s):

Visual Tracking ◽

Temporal Context ◽

Spatio Temporal ◽

Hierarchical Features

Download Full-text