A Training Method for Low Rank Convolutional Neural Networks Based on Alternating Tensor Compose-Decompose Method

Sukho Lee; Hyein Kim; Byeongseon Jeong; Jungho Yoon

doi:10.3390/app11020643

A Training Method for Low Rank Convolutional Neural Networks Based on Alternating Tensor Compose-Decompose Method

Applied Sciences ◽

10.3390/app11020643 ◽

2021 ◽

Vol 11 (2) ◽

pp. 643

Author(s):

Sukho Lee ◽

Hyein Kim ◽

Byeongseon Jeong ◽

Jungho Yoon

Keyword(s):

Neural Networks ◽

Computer Vision ◽

Deep Learning ◽

Convolutional Neural Networks ◽

Learning Model ◽

Low Rank ◽

Training Method ◽

Training Stage ◽

Fixed Structure ◽

Deep Learning Model

Over the past decade, deep learning-based computer vision methods have been shown to surpass previous state-of-the-art computer vision techniques in various fields, and have made great progress in various computer vision problems, including object detection, object segmentation, face recognition, etc. Nowadays, major IT companies are adding new deep-learning-based computer technologies to edge devices such as smartphones. However, since the computational cost of deep learning-based models is still high for edge devices, research is being actively carried out to compress deep learning-based models while not sacrificing high performance. Recently, many lightweight architectures have been proposed for deep learning-based models which are based on low-rank approximation. In this paper, we propose an alternating tensor compose-decompose (ATCD) method for the training of low-rank convolutional neural networks. The proposed training method can better train a compressed low-rank deep learning model than the conventional fixed-structure based training method, so that a compressed deep learning model with higher performance can be obtained in the end of the training. As a representative and exemplary model to which the proposed training method can be applied, we propose a rank-1 convolutional neural network (CNN) which has a structure alternatively containing 3-D rank-1 filters and 1-D filters in the training stage and a 1-D structure in the testing stage. After being trained, the 3-D rank-1 filters can be permanently decomposed into 1-D filters to achieve a fast inference in the test time. The reason that the 1-D filters are not being trained directly in 1-D form in the training stage is that the training of the 3-D rank-1 filters is easier due to the better gradient flow, which makes the training possible even in the case when the fixed structured network with fixed consecutive 1-D filters cannot be trained at all. We also show that the same training method can be applied to the well-known MobileNet architecture so that better parameters can be obtained than with the conventional fixed-structure training method. Furthermore, we show that the 1-D filters in a ResNet like structure can also be trained with the proposed method, which shows the fact that the proposed method can be applied to various structures of networks.

Download Full-text

A Deep Learning Model Based on Convolutional Neural Networks for Classification of Magnetic Resonance Prostate Images

Artificial Intelligence and Applied Mathematics in Engineering Problems - Lecture Notes on Data Engineering and Communications Technologies ◽

10.1007/978-3-030-36178-5_59 ◽

2020 ◽

pp. 701-708

Author(s):

Fatih Uysal ◽

Fırat Hardalaç ◽

Mustafa Koç

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Magnetic Resonance ◽

Convolutional Neural Networks ◽

Learning Model ◽

Model Based ◽

Deep Learning Model

Download Full-text

A Novel Deep Learning Model Based on Convolutional Neural Networks for Employee Churn Prediction

Journal of Forecasting ◽

10.1002/for.2827 ◽

2021 ◽

Author(s):

Ebru Pekel Ozmen ◽

Tuncay Ozcan

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Convolutional Neural Networks ◽

Learning Model ◽

Churn Prediction ◽

Model Based ◽

Deep Learning Model

Download Full-text

Development and Validation of a Deep Learning Model Using Convolutional Neural Networks to Identify Scaphoid Fractures in Radiographs

JAMA Network Open ◽

10.1001/jamanetworkopen.2021.6096 ◽

2021 ◽

Vol 4 (5) ◽

pp. e216096

Author(s):

Alfred P. Yoon ◽

Yi-Lun Lee ◽

Robert L. Kane ◽

Chang-Fu Kuo ◽

Chihung Lin ◽

...

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Convolutional Neural Networks ◽

Learning Model ◽

Scaphoid Fractures ◽

Development And Validation ◽

Deep Learning Model

Download Full-text

Cascaded one-shot deformable convolutional neural networks: Developing a deep learning model for respiratory motion estimation in ultrasound sequences

Medical Image Analysis ◽

10.1016/j.media.2020.101793 ◽

2020 ◽

Vol 65 ◽

pp. 101793 ◽

Cited By ~ 1

Author(s):

Fei Liu ◽

Dan Liu ◽

Jie Tian ◽

Xiaoyan Xie ◽

Xin Yang ◽

...

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Motion Estimation ◽

Convolutional Neural Networks ◽

Respiratory Motion ◽

Learning Model ◽

Deep Learning Model

Download Full-text

Errata for "Cerebro: a data system for optimized deep learning model selection"

Proceedings of the VLDB Endowment ◽

10.14778/3447689.3447691 ◽

2021 ◽

Vol 14 (6) ◽

pp. 863-863

Author(s):

Supun Nakandala ◽

Yuhao Zhang ◽

Arun Kumar

Keyword(s):

Deep Learning ◽

Model Selection ◽

Learning Model ◽

Communication Cost ◽

Data System ◽

Training Method ◽

Fine Grained ◽

Correct Communication ◽

Deep Learning Model

We discovered that there was an inconsistency in the communication cost formulation for the decentralized fine-grained training method in Table 2 of our paper [1]. We used Horovod as the archetype for decentralized fine-grained approaches, and its correct communication cost is higher than what we had reported. So, we amend the communication cost of decentralized fine-grained to [EQUATION]

Download Full-text

Image classification using Deep learning

International Journal of Engineering & Technology ◽

10.14419/ijet.v7i2.7.10892 ◽

2018 ◽

Vol 7 (2.7) ◽

pp. 614 ◽

Cited By ~ 5

Author(s):

M Manoj krishna ◽

M Neelima ◽

M Harshali ◽

M Venu Gopala Rao

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Image Processing ◽

Computer Vision ◽

Deep Learning ◽

Image Classification ◽

Convolutional Neural Networks ◽

Classical Problem

The image classification is a classical problem of image processing, computer vision and machine learning fields. In this paper we study the image classification using deep learning. We use AlexNet architecture with convolutional neural networks for this purpose. Four test images are selected from the ImageNet database for the classification purpose. We cropped the images for various portion areas and conducted experiments. The results show the effectiveness of deep learning based image classification using AlexNet.

Download Full-text

Flood susceptibility mapping and assessment using a novel deep learning model combining multilayer perceptron and autoencoder neural networks

Journal of Flood Risk Management ◽

10.1111/jfr3.12683 ◽

2020 ◽

Author(s):

Mohammad Ahmadlou ◽

A'kif Al‐Fugara ◽

Abdel Rahman Al‐Shabeeb ◽

Aman Arora ◽

Rida Al‐Adamat ◽

...

Keyword(s):

Neural Networks ◽

Deep Learning ◽

Multilayer Perceptron ◽

Learning Model ◽

Susceptibility Mapping ◽

Flood Susceptibility ◽

Model Combining ◽

Flood Susceptibility Mapping ◽

Deep Learning Model

Download Full-text

A Study of The Convolutional Neural Networks Applications

UKH Journal of Science and Engineering ◽

10.25079/ukhjse.v3n2y2019.pp31-40 ◽

2019 ◽

Vol 3 (2) ◽

pp. 31-40 ◽

Cited By ~ 2

Author(s):

Ahmed Shamsaldin ◽

Polla Fattah ◽

Tarik Rashid ◽

Nawzad Al-Salihi

Keyword(s):

Neural Networks ◽

Computer Vision ◽

Deep Learning ◽

Natural Language Processing ◽

Face Recognition ◽

Convolutional Neural Networks ◽

Language Processing ◽

Text Classification ◽

Scene Labeling ◽

Real World Problems

At present, deep learning is widely used in a broad range of arenas. A convolutional neural networks (CNN) is becoming the star of deep learning as it gives the best and most precise results when cracking real-world problems. In this work, a brief description of the applications of CNNs in two areas will be presented: First, in computer vision, generally, that is, scene labeling, face recognition, action recognition, and image classification; Second, in natural language processing, that is, the fields of speech recognition and text classification.

Download Full-text

Masked Face Analysis via Multi-Task Deep Learning

Journal of Imaging ◽

10.3390/jimaging7100204 ◽

2021 ◽

Vol 7 (10) ◽

pp. 204

Author(s):

Vatsa S. Patel ◽

Zhongliang Nie ◽

Trung-Nghia Le ◽

Tam V. Nguyen

Keyword(s):

Computer Vision ◽

Deep Learning ◽

Face Recognition ◽

Face Mask ◽

Learning Model ◽

Unified Framework ◽

Improve Performance ◽

Face Analysis ◽

And Gender ◽

Deep Learning Model

Face recognition with wearable items has been a challenging task in computer vision and involves the problem of identifying humans wearing a face mask. Masked face analysis via multi-task learning could effectively improve performance in many fields of face analysis. In this paper, we propose a unified framework for predicting the age, gender, and emotions of people wearing face masks. We first construct FGNET-MASK, a masked face dataset for the problem. Then, we propose a multi-task deep learning model to tackle the problem. In particular, the multi-task deep learning model takes the data as inputs and shares their weight to yield predictions of age, expression, and gender for the masked face. Through extensive experiments, the proposed framework has been found to provide a better performance than other existing methods.

Download Full-text

Multimodal Multi-tasking for Skin Lesion Classification Using Deep Neural Networks

10.1007/978-3-030-90439-5_3 ◽

2021 ◽

pp. 27-38

Author(s):

Rafaela Carvalho ◽

João Pedrosa ◽

Tudor Nedelcu

Keyword(s):

Neural Networks ◽

Computer Vision ◽

Deep Learning ◽

Skin Lesion ◽

Deep Neural Networks ◽

Classification Performance ◽

Additional Information ◽

Deep Learning Model ◽

Lesion Classification

AbstractSkin cancer is one of the most common types of cancer and, with its increasing incidence, accurate early diagnosis is crucial to improve prognosis of patients. In the process of visual inspection, dermatologists follow specific dermoscopic algorithms and identify important features to provide a diagnosis. This process can be automated as such characteristics can be extracted by computer vision techniques. Although deep neural networks can extract useful features from digital images for skin lesion classification, performance can be improved by providing additional information. The extracted pseudo-features can be used as input (multimodal) or output (multi-tasking) to train a robust deep learning model. This work investigates the multimodal and multi-tasking techniques for more efficient training, given the single optimization of several related tasks in the latter, and generation of better diagnosis predictions. Additionally, the role of lesion segmentation is also studied. Results show that multi-tasking improves learning of beneficial features which lead to better predictions, and pseudo-features inspired by the ABCD rule provide readily available helpful information about the skin lesion.

Download Full-text