Robust Individual-Cell/Object Tracking via PCANet Deep Network in Biomedicine and Computer Vision

BioMed Research International ◽

10.1155/2016/8182416 ◽

2016 ◽

Vol 2016 ◽

pp. 1-15

Author(s):

Bineng Zhong ◽

Shengnan Pan ◽

Cheng Wang ◽

Tian Wang ◽

Jixiang Du ◽

...

Keyword(s):

Deep Learning ◽

Object Tracking ◽

Large Scale ◽

Individual Cell ◽

Fundamental Problem ◽

Feature Learning ◽

Principal Component ◽

Feature Representation ◽

Appearance Model ◽

Extrinsic Factors

Tracking individual-cell/object over time is important in understanding drug treatment effects on cancer cells and video surveillance. A fundamental problem of individual-cell/object tracking is to simultaneously address the cell/object appearance variations caused by intrinsic and extrinsic factors. In this paper, inspired by the architecture of deep learning, we propose a robust feature learning method for constructing discriminative appearance models without large-scale pretraining. Specifically, in the initial frames, an unsupervised method is firstly used to learn the abstract feature of a target by exploiting both classic principal component analysis (PCA) algorithms with recent deep learning representation architectures. We use learned PCA eigenvectors as filters and develop a novel algorithm to represent a target by composing of a PCA-based filter bank layer, a nonlinear layer, and a patch-based pooling layer, respectively. Then, based on the feature representation, a neural network with one hidden layer is trained in a supervised mode to construct a discriminative appearance model. Finally, to alleviate the tracker drifting problem, a sample update scheme is carefully designed to keep track of the most representative and diverse samples during tracking. We test the proposed tracking method on two standard individual cell/object tracking benchmarks to show our tracker's state-of-the-art performance.

Download Full-text

Visual Object Tracking in RGB-D Data via Genetic Feature Learning

Complexity ◽

10.1155/2019/4539410 ◽

2019 ◽

Vol 2019 ◽

pp. 1-8

Author(s):

Ming-xin Jiang ◽

Xian-xian Luo ◽

Tao Hai ◽

Hai-yan Wang ◽

Song Yang ◽

...

Keyword(s):

Object Tracking ◽

Feature Learning ◽

Feature Representation ◽

Visual Object ◽

Visual Object Tracking ◽

Fitness Evaluation ◽

Candidate Solution ◽

Genetic Feature ◽

Computer Vision Applications ◽

Crossover And Mutation

Visual object tracking is a fundamental component in many computer vision applications. Extracting robust features of object is one of the most important steps in tracking. As trackers, only formulated on RGB data, are usually affected by occlusions, appearance, or illumination variations, we propose a novel RGB-D tracking method based on genetic feature learning in this paper. Our approach addresses feature learning as an optimization problem. As owning the advantage of parallel computing, genetic algorithm (GA) has fast speed of convergence and excellent global optimization performance. At the same time, unlike handcrafted feature and deep learning methods, GA can be employed to solve the problem of feature representation without prior knowledge, and it has no use for a large number of parameters to be learned. The candidate solution in RGB or depth modality is represented as an encoding of an image in GA, and genetic feature is learned through population initialization, fitness evaluation, selection, crossover, and mutation. The proposed RGB-D tracker is evaluated on popular benchmark dataset, and experimental results indicate that our method achieves higher accuracy and faster tracking speed.

Download Full-text

Online Multi-Object Tracking Based on Feature Representation and Bayesian Filtering Within a Deep Learning Architecture

IEEE Access ◽

10.1109/access.2019.2901520 ◽

2019 ◽

Vol 7 ◽

pp. 27923-27935 ◽

Cited By ~ 4

Author(s):

Jun Xiang ◽

Guoshuai Zhang ◽

Jianhua Hou

Keyword(s):

Deep Learning ◽

Object Tracking ◽

Feature Representation ◽

Bayesian Filtering

Download Full-text

Large-scale dynamic social data representation for structure feature learning

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-189010 ◽

2020 ◽

Vol 39 (4) ◽

pp. 5253-5262

Author(s):

Xiaoxian Zhang ◽

Jianpei Zhang ◽

Jing Yang

Keyword(s):

Social Networks ◽

Social Network ◽

Large Scale ◽

Dynamic Change ◽

Feature Learning ◽

Data Representation ◽

Feature Representation ◽

Small Scale ◽

Dynamic Feature ◽

Dynamic Social Networks

The problems caused by network dimension disasters and computational complexity have become an important issue to be solved in the field of social network research. The existing methods for network feature learning are mostly based on static and small-scale assumptions, and there is no modified learning for the unique attributes of social networks. Therefore, existing learning methods cannot adapt to the dynamic and large-scale of current social networks. Even super large scale and other features. This paper mainly studies the feature representation learning of large-scale dynamic social network structure. In this paper, the positive and negative damping sampling of network nodes in different classes is carried out, and the dynamic feature learning method for newly added nodes is constructed, which makes the model feasible for the extraction of structural features of large-scale social networks in the process of dynamic change. The obtained node feature representation has better dynamic robustness. By selecting the real datasets of three large-scale dynamic social networks and the experiments of dynamic link prediction in social networks, it is found that DNPS has achieved a large performance improvement over the benchmark model in terms of prediction accuracy and time efficiency. When the α value is around 0.7, the model effect is optimal.

Download Full-text

Fast Image Search with Pixel-Based Deep Learning Framework via Efficient Compact Binary Code and Addictive Latent Layer

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001418590048 ◽

2017 ◽

Vol 32 (03) ◽

pp. 1859004 ◽

Cited By ~ 1

Author(s):

Jun Yi Li ◽

Jian Hua Li

Keyword(s):

Deep Learning ◽

Nearest Neighbor ◽

Feature Learning ◽

Nearest Neighbor Search ◽

Feature Representation ◽

Image Feature ◽

Retrieval Algorithm ◽

Image Search ◽

Learning Framework ◽

Compact Binary

As we know, the nearest neighbor search is a good and effective method for good-sized image search. This paper mainly introduced how to learn an outstanding image feature representation form and a series of compact binary Hash coding functions under deep learning framework. Our concept is that binary codes can be obtained using a hidden layer to present some latent concepts dominating the class labels with usable data labels. Our method is effective in obtaining hash codes and image representations, so it is suitable for good-sized dataset. It is demonstrated in our experiment that the performances of the proposed algorithms were then verified on three different databases, MNIST, CIFAR-10 and Caltech-101. The experimental results reveal that two-proposed image Hash retrieval algorithm based on pixel-level automatic feature learning show higher search accuracy than the other algorithms; moreover, these two algorithms were proved to be more favorable in scalability and generality.

Download Full-text

Deep Learning-Based Sentimental Analysis for Large-Scale Imbalanced Twitter Data

Future Internet ◽

10.3390/fi11090190 ◽

2019 ◽

Vol 11 (9) ◽

pp. 190 ◽

Cited By ~ 3

Author(s):

Jamal ◽

Xianqiao ◽

Aldabbas

Keyword(s):

Deep Learning ◽

Large Scale ◽

State Of The Art ◽

Hybrid Approach ◽

Principal Component ◽

Specific Topic ◽

Weighting Method ◽

Psychological Conditions ◽

Twitter Data ◽

Wide Range

Emotions detection in social media is very effective to measure the mood of people about a specific topic, news, or product. It has a wide range of applications, including identifying psychological conditions such as anxiety or depression in users. However, it is a challenging task to distinguish useful emotions’ features from a large corpus of text because emotions are subjective, with limited fuzzy boundaries that may be expressed in different terminologies and perceptions. To tackle this issue, this paper presents a hybrid approach of deep learning based on TensorFlow with Keras for emotions detection on a large scale of imbalanced tweets’ data. First, preprocessing steps are used to get useful features from raw tweets without noisy data. Second, the entropy weighting method is used to compute the importance of each feature. Third, class balancer is applied to balance each class. Fourth, Principal Component Analysis (PCA) is applied to transform high correlated features into normalized forms. Finally, the TensorFlow based deep learning with Keras algorithm is proposed to predict high-quality features for emotions classification. The proposed methodology is analyzed on a dataset of 1,600,000 tweets collected from the website ‘kaggle’. Comparison is made of the proposed approach with other state of the art techniques on different training ratios. It is proved that the proposed approach outperformed among other techniques.

Download Full-text

Appearance Model Based Moving Object Matching across Disjoint Camera Views

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.760-762.1322 ◽

2013 ◽

Vol 760-762 ◽

pp. 1322-1326

Author(s):

Kong Shuai Yu ◽

Dong Hu

Keyword(s):

Principal Component Analysis ◽

Transfer Function ◽

Object Tracking ◽

Real World ◽

Principal Component ◽

Appearance Model ◽

Color Spectrum ◽

Probabilistic Principal Component Analysis ◽

Camera Surveillance ◽

Disjoint Camera Views

A new object tracking scheme for multi-camera surveillance with non-overlapping views is proposed in this paper. Brightness transfer function (BTF) is used to establish relative appearance correspondence between different views. Mixtures of probabilistic principal component analysis (MPPCA) is incooperated to learn the subspace of brightness transfer function with the concern to deal with multiple different brightness areas in a scene. The incremental major color spectrum histogram (IMCSH) is used as similarity measure for reliable matching. Experimental results with real world videos show the effectiveness of the proposed algorithm.

Download Full-text

Hyperspectral Image Classification Algorithm Based on Principal Component Texture Feature Deep Learning

Journal of Medical Imaging and Health Informatics ◽

10.1166/jmihi.2020.3133 ◽

2020 ◽

Vol 10 (9) ◽

pp. 2027-2031

Author(s):

Xu Yifang

Keyword(s):

Deep Learning ◽

Image Classification ◽

Hyperspectral Image ◽

Texture Feature ◽

Feature Learning ◽

Principal Component ◽

Classification Algorithm ◽

Classification Problems ◽

Hyperspectral Image Classification ◽

Reconstruction Performance

Hyperspectral image classification refers to a key difficulty on the domain of remote sensing image processing. Feature learning is the basis of hyperspectral image classification problems. In addition, how to jointly use the space spectrum information is Also an important issue in hyperspectral image classification. Recent ages have seen that as further exploration is developing, the method of hyperspectral image cauterization according to deep learning has been rapidly developed. However, existing deep networks often only consider reconstruction performance while ignoring the task itself. In addition, for improving preciseness of classification, most categorization methods use the fixed-size neighborhood of per hyperspectral pixel as the object of feature extraction, ignoring the identification and difference between the neighborhood pixel and the current pixel. On the basis of exploration above, our research group put forward with an image classification algorithm based on principal component texture feature deep learning, and achieved good results.

Download Full-text

Learning Attention for Object Tracking with Adversarial Learning Network

10.21203/rs.3.rs-15512/v3 ◽

2020 ◽

Author(s):

Xu Cheng ◽

Chen Song ◽

Yongxiang Gu ◽

Beijing Chen ◽

Lin Zhou ◽

...

Keyword(s):

Deep Learning ◽

Object Tracking ◽

Large Scale ◽

Fine Tuning ◽

Object Localization ◽

Generative Adversarial Network ◽

Adversarial Learning ◽

Training Time ◽

Learning Network ◽

Object State

Abstract Artificial intelligence has been widely studied on solving intelligent surveillance analysis and security problems in recent years. Although many multimedia security approaches have been proposed by using deep learning network model, there are still some challenges on their performances which deserve in-depth research. On one hand, high computational complexity of current deep learning methods makes it hard to be applied to real-time scenario. On the other hand, it is difficult to obtain the specific features of a video by fine-tuning the network online with the object state of the first frame, which fails to capture rich appearance variations of the object. To solve above two issues, in this paper, an effective object tracking method with learning attention is proposed to achieve the object localization and reduce the training time in adversarial learning framework. First, a prediction network is designed to track the object in video sequences. The object positions of the first ten frames are employed to fine-tune prediction network, which can fully mine a specific features of an object. Second, the prediction network is integrated into the generative adversarial network framework, which randomly generates masks to capture object appearance variations via adaptively dropout input features. Third, we present a spatial attention mechanism to improve the tracking performance. The proposed network can identify the mask that maintains the most robust features of the objects over a long temporal span. Extensive experiments on two large-scale benchmarks demonstrate that the proposed algorithm performs favorably against state-of-the-art methods.

Download Full-text

VISUAL TRACKING UTILIZING OBJECT CONCEPT FROM DEEP LEARNING NETWORK

ISPRS Annals of Photogrammetry Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-annals-iv-1-w1-125-2017 ◽

2017 ◽

Vol IV-1/W1 ◽

pp. 125-132 ◽

Cited By ~ 1

Author(s):

C. Xiao ◽

A. Yilmaz ◽

S. Lia

Keyword(s):

Deep Learning ◽

Visual Tracking ◽

Large Scale ◽

Target Position ◽

Appearance Model ◽

Learning Network ◽

Concept Model ◽

Deep Learning Network ◽

High Level ◽

The Given

Despite having achieved good performance, visual tracking is still an open area of research, especially when target undergoes serious appearance changes which are not included in the model. So, in this paper, we replace the appearance model by a concept model which is learned from large-scale datasets using a deep learning network. The concept model is a combination of high-level semantic information that is learned from myriads of objects with various appearances. In our tracking method, we generate the target’s concept by combining the learned object concepts from classification task. We also demonstrate that the last convolutional feature map can be used to generate a heat map to highlight the possible location of the given target in new frames. Finally, in the proposed tracking framework, we utilize the target image, the search image cropped from the new frame and their heat maps as input into a localization network to find the final target position. Compared to the other state-of-the-art trackers, the proposed method shows the comparable and at times better performance in real-time.

Download Full-text

Video-Based Person Re-Identification: Methods, Datasets, and Deep Learning

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.c6524.029320 ◽

2020 ◽

Vol 9 (3) ◽

pp. 4249-4254

Keyword(s):

Deep Learning ◽

Large Scale ◽

Metric Learning ◽

Feature Representation ◽

Retrieval Models ◽

Still Images ◽

Research Attention ◽

Current Trends ◽

Huge Data ◽

New Applications

Video Analytics applications like security and surveillance face a critical problem of person re-identification abbreviated as re-ID. The last decade witnessed the emergence of large-scale datasets and deep learning methods to use these huge data volumes. Most current re-ID methods are classified into either image-based or video-based re-ID. Matching persons across multiple camera views have attracted lots of recent research attention. Feature representation and metric learning are major issues for person re-identification. The focus of re-ID work is now shifting towards developing end-to-end re-Id and tracking systems for practical use with dynamic datasets. Most previous works contributed to the significant progress of person re-identification on still images using image retrieval models. This survey considers the more informative and challenging video-based person re-ID problem, pedestrian re-ID in particular. Publicly available datasets and codes are listed as a part of this work. Current trends which include open re-identification systems, use of discriminative features and deep learning is marching towards new applications in security and surveillance, typically for tracking

Download Full-text