Tensor-Based Algorithms for Image Classification

Stefan Klus; Patrick Gelß

doi:10.3390/a12110240

Tensor-Based Algorithms for Image Classification

Algorithms ◽

10.3390/a12110240 ◽

2019 ◽

Vol 12 (11) ◽

pp. 240 ◽

Cited By ~ 1

Author(s):

Stefan Klus ◽

Patrick Gelß

Keyword(s):

Neural Network ◽

Image Classification ◽

State Of The Art ◽

The Other ◽

Learning Problems ◽

Governing Equations ◽

Data Set ◽

Tensor Networks ◽

Multidimensional Approximation ◽

Novel Approaches

Interest in machine learning with tensor networks has been growing rapidly in recent years. We show that tensor-based methods developed for learning the governing equations of dynamical systems from data can, in the same way, be used for supervised learning problems and propose two novel approaches for image classification. One is a kernel-based reformulation of the previously introduced multidimensional approximation of nonlinear dynamics (MANDy), the other an alternating ridge regression in the tensor train format. We apply both methods to the MNIST and fashion MNIST data set and show that the approaches are competitive with state-of-the-art neural network-based classifiers.

Download Full-text

Fast Accurate and Automatic Brushstroke Extraction

ACM Transactions on Multimedia Computing Communications and Applications ◽

10.1145/3429742 ◽

2021 ◽

Vol 17 (2) ◽

pp. 1-24

Author(s):

Yunfei Fu ◽

Hongchuan Yu ◽

Chih-Kuo Yeh ◽

Tong-Yee Lee ◽

Jian J. Zhang

Keyword(s):

Neural Network ◽

Efficient Algorithm ◽

Deep Neural Network ◽

High Efficiency ◽

State Of The Art ◽

High Reliability ◽

The Other ◽

Manual Annotation ◽

Stroke Extraction ◽

Art Research

Brushstrokes are viewed as the artist’s “handwriting” in a painting. In many applications such as style learning and transfer, mimicking painting, and painting authentication, it is highly desired to quantitatively and accurately identify brushstroke characteristics from old masters’ pieces using computer programs. However, due to the nature of hundreds or thousands of intermingling brushstrokes in the painting, it still remains challenging. This article proposes an efficient algorithm for brush Stroke extraction based on a Deep neural network, i.e., DStroke. Compared to the state-of-the-art research, the main merit of the proposed DStroke is to automatically and rapidly extract brushstrokes from a painting without manual annotation, while accurately approximating the real brushstrokes with high reliability. Herein, recovering the faithful soft transitions between brushstrokes is often ignored by the other methods. In fact, the details of brushstrokes in a master piece of painting (e.g., shapes, colors, texture, overlaps) are highly desired by artists since they hold promise to enhance and extend the artists’ powers, just like microscopes extend biologists’ powers. To demonstrate the high efficiency of the proposed DStroke, we perform it on a set of real scans of paintings and a set of synthetic paintings, respectively. Experiments show that the proposed DStroke is noticeably faster and more accurate at identifying and extracting brushstrokes, outperforming the other methods.

Download Full-text

Dangerous Scenes Recognition During Hoisting Based on Faster Region-Based Convolutional Neural Network

Volume 2: Mechanics and Behavior of Active Materials; Structural Health Monitoring; Bioinspired Smart Materials and Systems; Energy Harvesting; Emerging Technologies ◽

10.1115/smasis2018-8226 ◽

2018 ◽

Author(s):

Hongguo Su ◽

Mingyuan Zhang ◽

Shengyuan Li ◽

Xuefeng Zhao

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

Image Classification ◽

State Of The Art ◽

Spatial Location ◽

Average Precision ◽

Practical Applications ◽

Security Enhancement ◽

Human Interactions

In the last couple of years, advancements in the deep learning, especially in convolutional neural networks, proved to be a boon for the image classification and recognition tasks. One of the important practical applications of object detection and image classification can be for security enhancement. If dangerous objects or scenes can be identified automatically, then a lot of accidents can be prevented. For this purpose, in this paper we made use of state-of-the-art implementation of Faster Region-based Convolutional Neural Network (Faster R-CNN) based on the monitoring video of hoisting sites to train a model to detect the dangerous object and the worker. By extracting the locations of them, object-human interactions during hoisting, mainly for changes in their spatial location relationship, can be understood whereby estimating whether the scene is safe or dangerous. Experimental results showed that the pre-trained model achieved good performance with a high mean average precision of 97.66% on object detection and the proposed method fulfilled the goal of dangerous scenes recognition perfectly.

Download Full-text

IMPROVEMENT OF IMAGE CLASSIFICATION USING WAVELET COEFFICIENTS WITH STRUCTURED-BASED NEURAL NETWORK

International Journal of Neural Systems ◽

10.1142/s012906570800152x ◽

2008 ◽

Vol 18 (03) ◽

pp. 195-205 ◽

Cited By ~ 36

Author(s):

WEIBAO ZOU ◽

ZHERU CHI ◽

KING CHUEN LO

Keyword(s):

Neural Network ◽

Wavelet Analysis ◽

Image Classification ◽

Dimensional Space ◽

Training Data ◽

Wavelet Coefficients ◽

Classification Rate ◽

Data Set ◽

Low Dimensional ◽

High Level

Image classification is a challenging problem in organizing a large image database. However, an effective method for such an objective is still under investigation. A method based on wavelet analysis to extract features for image classification is presented in this paper. After an image is decomposed by wavelet, the statistics of its features can be obtained by the distribution of histograms of wavelet coefficients, which are respectively projected onto two orthogonal axes, i.e., x and y directions. Therefore, the nodes of tree representation of images can be represented by the distribution. The high level features are described in low dimensional space including 16 attributes so that the computational complexity is significantly decreased. 2800 images derived from seven categories are used in experiments. Half of the images were used for training neural network and the other images used for testing. The features extracted by wavelet analysis and the conventional features are used in the experiments to prove the efficacy of the proposed method. The classification rate on the training data set with wavelet analysis is up to 91%, and the classification rate on the testing data set reaches 89%. Experimental results show that our proposed approach for image classification is more effective.

Download Full-text

Adaptive Multisensor Acquisition via Spatial Contextual Information for Compressive Spectral Image Classification

10.36227/techrxiv.14988804.v1 ◽

2021 ◽

Author(s):

Nelson Diaz ◽

Juan Marcos ◽

Esteban Vera ◽

Henry Arguello

Keyword(s):

Image Classification ◽

State Of The Art ◽

Contextual Information ◽

Real Data ◽

Experimental Setup ◽

Spectral Image ◽

Data Set ◽

Adaptive Sensing

Results of extensive simulations are shown for two state-of-the-art databases: Pavia University and Indian Pines. Furthermore, an experimental setup that performs the adaptive sensing was built to test the performance of the proposed approach on a real data set.

Download Full-text

Image Classification Algorithm Based on Big Data and Multilabel Learning of Improved Convolutional Neural Network

Wireless Communications and Mobile Computing ◽

10.1155/2021/3138398 ◽

2021 ◽

Vol 2021 ◽

pp. 1-11

Author(s):

Haibin Chang ◽

Ying Cui

Keyword(s):

Neural Network ◽

Neural Networks ◽

Convolutional Neural Network ◽

Image Classification ◽

Convolutional Neural Networks ◽

Classification Algorithm ◽

Large Set ◽

Data Set ◽

Multilabel Learning ◽

Average Maximum

More and more image materials are used in various industries these days. Therefore, how to collect useful images from a large set has become an urgent priority. Convolutional neural networks (CNN) have achieved good results in certain image classification tasks, but there are still problems such as poor classification ability, low accuracy, and slow convergence speed. This article mainly introduces the image classification algorithm (ICA) research based on the multilabel learning of the improved convolutional neural network and some improvement ideas for the research of the ICA based on the multilabel learning of the convolutional neural network. This paper proposes an ICA research method based on multilabel learning of improved convolutional neural networks, including the image classification process, convolutional network algorithm, and multilabel learning algorithm. The conclusions show that the average maximum classification accuracy of the improved CNN in this paper is 90.63%, and the performance is better, which is beneficial to improving the efficiency of image classification. The improved CNN network structure has reached the highest accuracy rate of 91.47% on the CIFAR-10 data set, which is much higher than the traditional CNN algorithm.

Download Full-text

Uncertainty-Aware Few-Shot Image Classification

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2021/471 ◽

2021 ◽

Author(s):

Zhizheng Zhang ◽

Cuiling Lan ◽

Wenjun Zeng ◽

Zhibo Chen ◽

Shih-Fu Chang

Keyword(s):

Neural Network ◽

Image Classification ◽

State Of The Art ◽

Metric Learning ◽

The State ◽

Modeling Uncertainty ◽

Support Set ◽

Art Performance ◽

Graph Based Model ◽

Query Sample

Few-shot image classification learns to recognize new categories from limited labelled data. Metric learning based approaches have been widely investigated, where a query sample is classified by finding the nearest prototype from the support set based on their feature similarities. A neural network has different uncertainties on its calculated similarities of different pairs. Understanding and modeling the uncertainty on the similarity could promote the exploitation of limited samples in few-shot optimization. In this work, we propose Uncertainty-Aware Few-Shot framework for image classification by modeling uncertainty of the similarities of query-support pairs and performing uncertainty-aware optimization. Particularly, we exploit such uncertainty by converting observed similarities to probabilistic representations and incorporate them to the loss for more effective optimization. In order to jointly consider the similarities between a query and the prototypes in a support set, a graph-based model is utilized to estimate the uncertainty of the pairs. Extensive experiments show our proposed method brings significant improvements on top of a strong baseline and achieves the state-of-the-art performance.

Download Full-text

Application Research of Deep Convolutional Neural Network in Computer Vision

Journal of Networking and Telecommunications ◽

10.18282/jnt.v2i2.886 ◽

2020 ◽

Vol 2 (2) ◽

pp. 23

Author(s):

Lei Wang

Keyword(s):

Neural Network ◽

Neural Networks ◽

Computer Vision ◽

Face Recognition ◽

Human Brain ◽

Image Classification ◽

Convolutional Neural Networks ◽

Convolution Neural Network ◽

Data Set ◽

Deep Convolution Neural Network

<p>As an important research achievement in the field of brain like computing, deep convolution neural network has been widely used in many fields such as computer vision, natural language processing, information retrieval, speech recognition, semantic understanding and so on. It has set off a wave of neural network research in industry and academia and promoted the development of artificial intelligence. At present, the deep convolution neural network mainly simulates the complex hierarchical cognitive laws of the human brain by increasing the number of layers of the network, using a larger training data set, and improving the network structure or training learning algorithm of the existing neural network, so as to narrow the gap with the visual system of the human brain and enable the machine to acquire the capability of "abstract concepts". Deep convolution neural network has achieved great success in many computer vision tasks such as image classification, target detection, face recognition, pedestrian recognition, etc. Firstly, this paper reviews the development history of convolutional neural networks. Then, the working principle of the deep convolution neural network is analyzed in detail. Then, this paper mainly introduces the representative achievements of convolution neural network from the following two aspects, and shows the improvement effect of various technical methods on image classification accuracy through examples. From the aspect of adding network layers, the structures of classical convolutional neural networks such as AlexNet, ZF-Net, VGG, GoogLeNet and ResNet are discussed and analyzed. From the aspect of increasing the size of data set, the difficulties of manually adding labeled samples and the effect of using data amplification technology on improving the performance of neural network are introduced. This paper focuses on the latest research progress of convolution neural network in image classification and face recognition. Finally, the problems and challenges to be solved in future brain-like intelligence research based on deep convolution neural network are proposed.</p>

Download Full-text

Image Classification for Vehicle Type Dataset Using State-of-the-art Convolutional Neural Network Architecture

Proceedings of the 2018 Artificial Intelligence and Cloud Computing Conference on ZZZ - AICCC '18 ◽

10.1145/3299819.3299822 ◽

2018 ◽

Author(s):

Yian Seo ◽

Kyung-shik Shin

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Image Classification ◽

Network Architecture ◽

State Of The Art ◽

Neural Network Architecture ◽

Vehicle Type

Download Full-text

Unsupervised Representation High-Resolution Remote Sensing Image Scene Classification via Contrastive Learning Convolutional Neural Network

Photogrammetric Engineering & Remote Sensing ◽

10.14358/pers.87.8.577 ◽

2021 ◽

Vol 87 (8) ◽

pp. 577-591

Author(s):

Fengpeng Li ◽

Jiabao Li ◽

Wei Han ◽

Ruyi Feng ◽

Lizhe Wang

Keyword(s):

Neural Network ◽

Remote Sensing ◽

Deep Learning ◽

High Resolution ◽

Convolutional Neural Network ◽

State Of The Art ◽

Remote Sensing Image ◽

Scene Classification ◽

Data Set ◽

Unsupervised Deep Learning

Inspired by the outstanding achievement of deep learning, supervised deep learning representation methods for high-spatial-resolution remote sensing image scene classification obtained state-of-the-art performance. However, supervised deep learning representation methods need a considerable amount of labeled data to capture class-specific features, limiting the application of deep learning-based methods while there are a few labeled training samples. An unsupervised deep learning representation, high-resolution remote sensing image scene classification method is proposed in this work to address this issue. The proposed method, called contrastive learning, narrows the distance between positive views: color channels belonging to the same images widens the gaps between negative view pairs consisting of color channels from different images to obtain class-specific data representations of the input data without any supervised information. The classifier uses extracted features by the convolutional neural network (CNN)-based feature extractor with labeled information of training data to set space of each category and then, using linear regression, makes predictions in the testing procedure. Comparing with existing unsupervised deep learning representation high-resolution remote sensing image scene classification methods, contrastive learning CNN achieves state-of-the-art performance on three different scale benchmark data sets: small scale RSSCN7 data set, midscale aerial image data set, and large-scale NWPU-RESISC45 data set.

Download Full-text

Bitstream-Based Neural Network for Scalable, Efficient, and Accurate Deep Learning Hardware

Frontiers in Neuroscience ◽

10.3389/fnins.2020.543472 ◽

2020 ◽

Vol 14 ◽

Author(s):

Hyeonuk Sim ◽

Jongeun Lee

Keyword(s):

Neural Network ◽

Deep Learning ◽

Recognition Accuracy ◽

Fault Tolerant ◽

State Of The Art ◽

The Other ◽

Stochastic Computing ◽

Hardware Implementations ◽

Neuromorphic Hardware ◽

Very High

While convolutional neural networks (CNNs) continue to renew state-of-the-art performance across many fields of machine learning, their hardware implementations tend to be very costly and inflexible. Neuromorphic hardware, on the other hand, targets higher efficiency but their inference accuracy lags far behind that of CNNs. To bridge the gap between deep learning and neuromorphic computing, we present bitstream-based neural network, which is both efficient and accurate as well as being flexible in terms of arithmetic precision and hardware size. Our bitstream-based neural network (called SC-CNN) is built on top of CNN but inspired by stochastic computing (SC), which uses bitstreams to represent numbers. Being based on CNN, our SC-CNN can be trained with backpropagation, ensuring very high inference accuracy. At the same time our SC-CNN is deterministic, hence repeatable, and is highly accurate and scalable even to large networks. Our experimental results demonstrate that our SC-CNN is highly accurate up to ImageNet-targeting CNNs, and improves efficiency over conventional digital designs ranging through 50–100% in operations-per-area depending on the CNN and the application scenario, while losing <1% in recognition accuracy. In addition, our SC-CNN implementations can be much more fault-tolerant than conventional digital implementations.

Download Full-text