Object Modelling and Tracking in Videos via Multidimensional Features

ISRN Signal Processing ◽

10.5402/2011/173176 ◽

2011 ◽

Vol 2011 ◽

pp. 1-15 ◽

Cited By ~ 3

Author(s):

Zhuhan Jiang

Keyword(s):

Bayesian Inference ◽

Object Tracking ◽

Video Sequence ◽

Object Motion ◽

Video Sequences ◽

Current Frame ◽

Object Modelling ◽

Previous Frame ◽

Object Features ◽

Probabilistic Location

We propose to model a tracked object in a video sequence by locating a list of object features that are ranked according to their ability to differentiate against the image background. The Bayesian inference is utilised to derive the probabilistic location of the object in the current frame, with the prior being approximated from the previous frame and the posterior achieved via the current pixel distribution of the object. Consideration has also been made to a number of relevant aspects of object tracking including multidimensional features and the mixture of colours, textures, and object motion. The experiment of the proposed method on the video sequences has been conducted and has shown its effectiveness in capturing the target in a moving background and with nonrigid object motion.

Download Full-text

CenterTrack3D: Improved CenterTrack more Suitable for 3D Objects

Journal of Autonomous Vehicles and Systems ◽

10.1115/1.4050863 ◽

2021 ◽

pp. 1-32

Author(s):

Lipeng Gu ◽

Shaoyuan Sun ◽

Xunhua Liu ◽

Xiang Li

Keyword(s):

Object Tracking ◽

Research Field ◽

Unmanned Vehicles ◽

Current Frame ◽

3D Objects ◽

Detection And Tracking ◽

Tracking Algorithms ◽

Occluded Objects ◽

Previous Frame ◽

3D Information

Abstract Compared with 2D multi-object tracking algorithms, 3D multi-object tracking algorithms have more research significance and broad application prospects in the unmanned vehicles research field. Aiming at the problem of 3D multi-object detection and tracking, in this paper, the multi-object tracker CenterTrack, which focuses on 2D multi-object tracking task while ignoring object 3D information, is improved mainly from two aspects of detection and tracking, and the improved network is called CenterTrack3D. In terms of detection, CenterTrack3D uses the idea of attention mechanism to optimize the way that the previous-frame image and the heatmap of previous-frame tracklets are added to the current-frame image as input, and second convolutional layer of the output head is replaced by dynamic convolution layer, which further improves the ability to detect occluded objects. In terms of tracking, a cascaded data association algorithm based on 3D Kalman filter is proposed to make full use of the 3D information of objects in the image and increase the robustness of the 3D multi-object tracker. The experimental results show that, compared with the original CenterTrack and the existing 3D multi-object tracking methods, CenterTrack3D achieves 88.75% MOTA for cars and 59.40% MOTA for pedestrians and is very competitive on the KITTI tracking benchmark test set.

Download Full-text

Object tracking algorithm by moving video camera

Doklady of the National Academy of Sciences of Belarus ◽

10.29235/1561-8323-2020-64-2-144-149 ◽

2020 ◽

Vol 64 (2) ◽

pp. 144-149

Author(s):

B. A. Zalesky

Keyword(s):

Color Space ◽

Video Camera ◽

Video Sequences ◽

Data Set ◽

Current Frame ◽

Average Value ◽

Rgb Color Space ◽

Pixel Color ◽

Mobile Computers ◽

Object Features

The algorithm ACT (Adaptive Color Tracker) to track objects by a moving video camera is presented. One of the features of the algorithm is the adaptation of the feature set of the tracked object to the background of the current frame. At each step, the algorithm extracts from the object features those that are more specific to the object and at the same time are at least specific to the current frame background, since the rest of the object features not only do not contribute to the separation of the tracked object from the background, but also impede its correct detection. The features of the object and background are formed based on the color representations of scenes. They can be computed in two ways. The first way is 3D-color vectors of the clustered image of the object and the background by a fast version of the well-known k-means algorithm. The second way consists in simpler and faster partitioning of the RGB-color space into 3D-parallelepipeds and subsequent replacement of the color of each pixel with the average value of all colors belonging to the same parallelepiped as the pixel color. Another specificity of the algorithm is its simplicity, which allows it to be used on small mobile computers, such as the Jetson TXT1 or TXT2.The algorithm was tested on video sequences captured by various camcorders, as well as by using the well-known TV77 data set, containing 77 different tagged video sequences. The tests have shown the efficiency of the algorithm. On the test images, its accuracy and speed overcome the characteristics of the trackers implemented in the computer vision library OpenCV 4.1.

Download Full-text

MULTIPLE VEHICLES AND PEOPLE TRACKING IN AERIAL IMAGERY USING STACK OF MICRO SINGLE-OBJECT-TRACKING CNNS

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xlii-4-w18-163-2019 ◽

2019 ◽

Vol XLII-4/W18 ◽

pp. 163-170

Author(s):

R. Bahmanyar ◽

S. M. Azimi ◽

P. Reinartz

Keyword(s):

Object Tracking ◽

Large Scale ◽

Image Sequences ◽

Aerial Imagery ◽

Aerial Images ◽

Aerial Image ◽

Single Object ◽

Current Frame ◽

Multiple Vehicles ◽

Previous Frame

Abstract. Geo-referenced real-time vehicle and person tracking in aerial imagery has a variety of applications such as traffic and large-scale event monitoring, disaster management, and also for input into predictive traffic and crowd models. However, object tracking in aerial imagery is still an unsolved challenging problem due to the tiny size of the objects as well as different scales and the limited temporal resolution of geo-referenced datasets. In this work, we propose a new approach based on Convolutional Neural Networks (CNNs) to track multiple vehicles and people in aerial image sequences. As the large number of objects in aerial images can exponentially increase the processing demands in multiple object tracking scenarios, the proposed approach utilizes the stack of micro CNNs, where each micro CNN is responsible for a single-object tracking task. We call our approach Stack of Micro-Single- Object-Tracking CNNs (SMSOT-CNN). More precisely, using a two-stream CNN, we extract a set of features from two consecutive frames for each object, with the given location of the object in the previous frame. Then, we assign each MSOT-CNN the extracted features of each object to predict the object location in the current frame. We train and validate the proposed approach on the vehicle and person sets of the KIT AIS dataset of object tracking in aerial image sequences. Results indicate the accurate and time-efficient tracking of multiple vehicles and people by the proposed approach.

Download Full-text

Basketball Tracking using Mean Shift Algorithm

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.c4159.098319 ◽

2019 ◽

Vol 8 (3) ◽

pp. 339-344

Keyword(s):

Object Tracking ◽

Video Sequence ◽

Mean Shift ◽

Vital Role ◽

Video Sequences ◽

Commercial Application ◽

Mean Shift Algorithm ◽

Ball Tracking ◽

Military Systems ◽

Detection And Recognition

Video analysis plays a vital role in commercial application, sports and military systems. Various methods are presented in literature. Mean shift algorithm is presented in this paper for basket ball tracking because it is more efficient than other that is defined by histograms. The tracking is the important block in the detection and recognition of the basket ball. Different object tracking algorithms are investigated. The performance of tracking in two video sequences is performed and the method gives 91.3% precision for video sequence 1 and 93.6% for sequence 2

Download Full-text

Particle Filter Algorithm for Object Tracking in Video Sequence Based on Chromatic Information

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2018.4667 ◽

2018 ◽

Vol 6 (4) ◽

pp. 4044-4049

Author(s):

Prachi R. Narkhede

Keyword(s):

Particle Filter ◽

Object Tracking ◽

Video Sequence ◽

Particle Filter Algorithm

Download Full-text

A data association algorithm for multiple object tracking in video sequences

IEE Seminar on Target Tracking: Algorithms and Applications ◽

10.1049/ic:20060565 ◽

2006 ◽

Cited By ~ 6

Author(s):

M.H. Jaward ◽

L. Mihaylova ◽

N. Canagarajah ◽

D. Bull

Keyword(s):

Object Tracking ◽

Data Association ◽

Multiple Object Tracking ◽

Video Sequences ◽

Multiple Object ◽

Association Algorithm

Download Full-text

A method of iterative image normalization for tasks of visual navigation of UAVs

Information Technology and Nanotechnology ◽

10.18287/1613-0073-2019-2391-144-152 ◽

2019 ◽

pp. 144-152

Author(s):

M O Elantcev ◽

I O Arkhipov ◽

R M Gafarov

Keyword(s):

Satellite Image ◽

Visual Navigation ◽

Transformation Matrix ◽

Current Frame ◽

Scale Parameters ◽

Perspective Distortion ◽

Shift Rotation ◽

Previous Frame ◽

Aerial Vehicle ◽

Camera Position

The work deals with a method of eliminating the perspective distortion of an image acquired from an unmanned aerial vehicle (UAV) camera in order to transform it to match the parameters of the satellite image. The normalization is performed in one of the two ways. The first variant consists in the calculation of an image transformation matrix based on the camera position and orientation. The second variant is based on matching the current frame with the previous one. The matching results in the shift, rotation, and scale parameters that are used to obtain an initial set of pairs of corresponding keypoints. From this set four pairs are selected to calculate the perspective transformation matrix. This matrix is in turn used to obtain a new set of pairs of corresponding keypoints. The process is repeated while the number of the pairs in the new set exceeds the number in the current one. The accumulated transformation matrix is then multiplied by the transformation matrix obtained during the normalization of the previous frame. The final part presents the results of the method that show that the proposed method can improve the accuracy of the visual navigation system at low computational costs.

Download Full-text

Performance Analysis of Alpha Beta Filter, Kalman Filter and Meanshift for Object Tracking in Video Sequences

International Journal of Image Graphics and Signal Processing ◽

10.5815/ijigsp.2015.03.04 ◽

2015 ◽

Vol 7 (3) ◽

pp. 24-30 ◽

Cited By ~ 8

Author(s):

Ravi Kumar Jatoth ◽

◽

Sanjana Gopisetty ◽

Moiz Hussain

Keyword(s):

Kalman Filter ◽

Performance Analysis ◽

Object Tracking ◽

Video Sequences ◽

Alpha Beta

Download Full-text

Multiple object tracking with background estimation in hyperspectral video sequences

2015 7th Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing (WHISPERS) ◽

10.1109/whispers.2015.8075367 ◽

2015 ◽

Cited By ~ 1

Author(s):

Z. Kandylakis ◽

K. Karantzalos ◽

A. Doulamis ◽

N. Doulamis

Keyword(s):

Object Tracking ◽

Multiple Object Tracking ◽

Video Sequences ◽

Background Estimation ◽

Multiple Object

Download Full-text

MOVING CAMERA MOVING OBJECT SEGMENTATION IN COMPRESSED VIDEO SEQUENCES

International Journal of Image and Graphics ◽

10.1142/s0219467809003617 ◽

2009 ◽

Vol 09 (04) ◽

pp. 609-627 ◽

Cited By ~ 1

Author(s):

J. WANG ◽

N. V. PATEL ◽

W. I. GROSKY ◽

F. FOTOUHI

Keyword(s):

Motion Detection ◽

Object Segmentation ◽

Object Motion ◽

Moving Object ◽

Transformation Model ◽

Video Sequences ◽

Compressed Domain ◽

Camera Motion ◽

Consistency Check ◽

Moving Object Segmentation

In this paper, we address the problem of camera and object motion detection in the compressed domain. The estimation of camera motion and the moving object segmentation have been widely stated in a variety of context for video analysis, due to their capabilities of providing essential clues for interpreting the high-level semantics of video sequences. A novel compressed domain motion estimation and segmentation scheme is presented and applied in this paper. MPEG-2 compressed domain information, namely Motion Vectors (MV) and Discrete Cosine Transform (DCT) coefficients, is filtered and manipulated to obtain a dense and reliable Motion Vector Field (MVF) over consecutive frames. An iterative segmentation scheme based upon the generalized affine transformation model is exploited to effect the global camera motion detection. The foreground spatiotemporal objects are separated from the background using the temporal consistency check to the output of the iterative segmentation. This consistency check process can coalesce the resulting foreground blocks and weed out unqualified blocks. Illustrative examples are provided to demonstrate the efficacy of the proposed approach.

Download Full-text