scholarly journals ACSiamRPN: Adaptive Context Sampling for Visual Object Tracking

Electronics ◽  
2020 ◽  
Vol 9 (9) ◽  
pp. 1528
Author(s):  
Xiaofei Qin ◽  
Yipeng Zhang ◽  
Hang Chang ◽  
Hao Lu ◽  
Xuedian Zhang

In visual object tracking fields, the Siamese network tracker, based on the region proposal network (SiamRPN), has achieved promising tracking effects, both in speed and accuracy. However, it did not consider the relationship and differences between the long-range context information of various objects. In this paper, we add a global context block (GC block), which is lightweight and can effectively model long-range dependency, to the Siamese network part of SiamRPN so that the object tracker can better understand the tracking scene. At the same time, we propose a novel convolution module, called a cropping-inside selective kernel block (CiSK block), based on selective kernel convolution (SK convolution, a module proposed in selective kernel networks) and use it in the region proposal network (RPN) part of SiamRPN, which can adaptively adjust the size of the receptive field for different types of objects. We make two improvements to SK convolution in the CiSK block. The first improvement is that in the fusion step of SK convolution, we use both global average pooling (GAP) and global maximum pooling (GMP) to enhance global information embedding. The second improvement is that after the selection step of SK convolution, we crop out the outermost pixels of features to reduce the impact of padding operations. The experiment results show that on the OTB100 benchmark, we achieved an accuracy of 0.857 and a success rate of 0.643. On the VOT2016 and VOT2019 benchmarks, we achieved expected average overlap (EAO) scores of 0.394 and 0.240, respectively.

2020 ◽  
Vol 400 ◽  
pp. 53-72
Author(s):  
Jun Wang ◽  
Weibin Liu ◽  
Weiwei Xing ◽  
Liqiang Wang ◽  
Shunli Zhang

2020 ◽  
Vol 50 (7) ◽  
pp. 3068-3080 ◽  
Author(s):  
Jianbing Shen ◽  
Xin Tang ◽  
Xingping Dong ◽  
Ling Shao

2021 ◽  
Vol 11 (4) ◽  
pp. 1963
Author(s):  
Shanshan Luo ◽  
Baoqing Li ◽  
Xiaobing Yuan ◽  
Huawei Liu

The Discriminative Correlation Filter (DCF) has been universally recognized in visual object tracking, thanks to its excellent accuracy and high speed. Nevertheless, these DCF-based trackers perform poorly in long-term tracking. The reasons include the following aspects—first, they have low adaptability to significant appearance changes in long-term tracking and are prone to tracking failure; second, these trackers lack a practical re-detection module to find the target again after tracking failure. In our work, we propose a new long-term tracking strategy to solve these issues. First, we make the best of the static and dynamic information of the target by introducing the motion features to our long-term tracker and obtain a more robust tracker. Second, we introduce a low-rank sparse dictionary learning method for re-detection. This re-detection module can exploit a correlation among these training samples and alleviate the impact of occlusion and noise. Third, we propose a new reliability evaluation method to model an adaptive update, which can switch expediently between the tracking module and the re-detection module. Massive experiments demonstrate that our proposed approach has an obvious improvement in precision and success rate over these state-of-the-art trackers.


Sign in / Sign up

Export Citation Format

Share Document