Deep Learning Model with Transfer Learning to Infer Personal Preferences in Images

Jaeho Oh; Mincheol Kim; Sang-Woo Ban

doi:10.3390/app10217641

Deep Learning Model with Transfer Learning to Infer Personal Preferences in Images

Applied Sciences ◽

10.3390/app10217641 ◽

2020 ◽

Vol 10 (21) ◽

pp. 7641

Author(s):

Jaeho Oh ◽

Mincheol Kim ◽

Sang-Woo Ban

Keyword(s):

Transfer Learning ◽

Learning Model ◽

Scene Recognition ◽

Public Image ◽

Preference Learning ◽

Indoor Scene ◽

Proposed Model ◽

In The Wild ◽

Visual Preferences ◽

Visual Characteristics

In this paper, we propose a deep convolutional neural network model with transfer learning that reflects personal preferences from inter-domain databases of images having atypical visual characteristics. The proposed model utilized three public image databases (Fashion-MNIST, Labeled Faces in the Wild [LFW], and Indoor Scene Recognition) that include images with atypical visual characteristics in order to train and infer personal visual preferences. The effectiveness of transfer learning for incremental preference learning was verified by experiments using inter-domain visual datasets with different visual characteristics. Moreover, a gradient class activation mapping (Grad-CAM) approach was applied to the proposed model, providing explanations about personal visual preference possibilities. Experiments showed that the proposed preference-learning model using transfer learning outperformed a preference model not using transfer learning. In terms of the accuracy of preference recognition, the proposed model showed a maximum of about 7.6% improvement for the LFW database and a maximum of about 9.4% improvement for the Indoor Scene Recognition database, compared to the model that did not reflect transfer learning.

Download Full-text

Deep Learning Scene Recognition Method Based on Localization Enhancement

Sensors ◽

10.3390/s18103376 ◽

2018 ◽

Vol 18 (10) ◽

pp. 3376 ◽

Cited By ~ 3

Author(s):

Wei Guo ◽

Ran Wu ◽

Yanhua Chen ◽

Xinyan Zhu

Keyword(s):

Indoor Localization ◽

Rapid Development ◽

Recognition Rate ◽

Image Data ◽

Scene Recognition ◽

Experimental Result ◽

Indoor Location ◽

Indoor Scene ◽

Proposed Model ◽

Signals Of Opportunity

With the rapid development of indoor localization in recent years; signals of opportunity have become a reliable and convenient source for indoor localization. The mobile device cannot only capture images of the indoor environment in real-time, but can also obtain one or more different types of signals of opportunity as well. Based on this, we design a convolutional neural network (CNN) model that concatenates features of image data and signals of opportunity for localization by using indoor scene datasets and simulating the situation of indoor location probability. Using the method of transfer learning on the Inception V3 network model feature information is added to assist in scene recognition. The experimental result shows that, for two different experiment sceneries, the accuracies of the prediction results are 97.0% and 96.6% using the proposed model, compared to 69.0% and 81.2% by the method of overlapping positioning information and the base map, and compared to 73.3% and 77.7% by using the fine-tuned Inception V3 model. The accuracy of indoor scene recognition is improved; in particular, the error rate at the spatial connection of different scenes is decreased, and the recognition rate of similar scenes is increased.

Download Full-text

EnsemV3X: a novel ensembled deep learning architecture for multi-label scene classification

PeerJ Computer Science ◽

10.7717/peerj-cs.557 ◽

2021 ◽

Vol 7 ◽

pp. e557

Author(s):

Priyal Sobti ◽

Anand Nayyar ◽

Niharika ◽

Preeti Nagrath

Keyword(s):

Computer Vision ◽

Transfer Learning ◽

Scene Recognition ◽

Fine Tuning ◽

Amazon Mechanical Turk ◽

Scene Classification ◽

Large Database ◽

Crowd Sourcing ◽

Proposed Model ◽

The Web

Convolutional neural network is widely used to perform the task of image classification, including pretraining, followed by fine-tuning whereby features are adapted to perform the target task, on ImageNet. ImageNet is a large database consisting of 15 million images belonging to 22,000 categories. Images collected from the Web are labeled using Amazon Mechanical Turk crowd-sourcing tool by human labelers. ImageNet is useful for transfer learning because of the sheer volume of its dataset and the number of object classes available. Transfer learning using pretrained models is useful because it helps to build computer vision models in an accurate and inexpensive manner. Models that have been pretrained on substantial datasets are used and repurposed for our requirements. Scene recognition is a widely used application of computer vision in many communities and industries, such as tourism. This study aims to show multilabel scene classification using five architectures, namely, VGG16, VGG19, ResNet50, InceptionV3, and Xception using ImageNet weights available in the Keras library. The performance of different architectures is comprehensively compared in the study. Finally, EnsemV3X is presented in this study. The proposed model with reduced number of parameters is superior to state-of-of-the-art models Inception and Xception because it demonstrates an accuracy of 91%.

Download Full-text

AN EFFICIENT MACHINE LEARNING MODEL FOR PREDICTION OF ACUTE MYOCARDIAL INFARCTION

Recent Advances in Computer Science and Communications ◽

10.2174/2666255813666200325104317 ◽

2020 ◽

Vol 13 ◽

Author(s):

Dhilsath Fathima.M ◽

S. Justin Samuel ◽

R. Hari Haran

Keyword(s):

Machine Learning ◽

Myocardial Infarction ◽

Acute Myocardial Infarction ◽

Logistic Regression ◽

Decision Tree ◽

Learning Model ◽

Training Dataset ◽

Data Set ◽

Machine Learning Model ◽

Proposed Model

Aim: This proposed work is used to develop an improved and robust machine learning model for predicting Myocardial Infarction (MI) could have substantial clinical impact. Objectives: This paper explains how to build machine learning based computer-aided analysis system for an early and accurate prediction of Myocardial Infarction (MI) which utilizes framingham heart study dataset for validation and evaluation. This proposed computer-aided analysis model will support medical professionals to predict myocardial infarction proficiently. Methods: The proposed model utilize the mean imputation to remove the missing values from the data set, then applied principal component analysis to extract the optimal features from the data set to enhance the performance of the classifiers. After PCA, the reduced features are partitioned into training dataset and testing dataset where 70% of the training dataset are given as an input to the four well-liked classifiers as support vector machine, k-nearest neighbor, logistic regression and decision tree to train the classifiers and 30% of test dataset is used to evaluate an output of machine learning model using performance metrics as confusion matrix, classifier accuracy, precision, sensitivity, F1-score, AUC-ROC curve. Results: Output of the classifiers are evaluated using performance measures and we observed that logistic regression provides high accuracy than K-NN, SVM, decision tree classifiers and PCA performs sound as a good feature extraction method to enhance the performance of proposed model. From these analyses, we conclude that logistic regression having good mean accuracy level and standard deviation accuracy compared with the other three algorithms. AUC-ROC curve of the proposed classifiers is analyzed from the output figure.4, figure.5 that logistic regression exhibits good AUC-ROC score, i.e. around 70% compared to k-NN and decision tree algorithm. Conclusion: From the result analysis, we infer that this proposed machine learning model will act as an optimal decision making system to predict the acute myocardial infarction at an early stage than an existing machine learning based prediction models and it is capable to predict the presence of an acute myocardial Infarction with human using the heart disease risk factors, in order to decide when to start lifestyle modification and medical treatment to prevent the heart disease.

Download Full-text

Affective Expression Analysis in-the-wild using Multi-Task Temporal Statistical Deep Learning Model

2020 15th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2020) ◽

10.1109/fg47880.2020.00093 ◽

2020 ◽

Author(s):

Nhu-Tai Do ◽

Tram-Tran Nguyen-Quynh ◽

Soo-Hyung Kim

Keyword(s):

Deep Learning ◽

Expression Analysis ◽

Learning Model ◽

Affective Expression ◽

In The Wild ◽

Deep Learning Model

Download Full-text

Novel deep transfer learning model for COVID-19 patient detection using X-ray chest images

Journal of Ambient Intelligence and Humanized Computing ◽

10.1007/s12652-021-03306-6 ◽

2021 ◽

Author(s):

N. Kumar ◽

M. Gupta ◽

D. Gupta ◽

S. Tiwari

Keyword(s):

Transfer Learning ◽

Learning Model ◽

X Ray

Download Full-text

Transfer Learning of a Deep Learning Model for Exploring Tourists’ Urban Image Using Geotagged Photos

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi10030137 ◽

2021 ◽

Vol 10 (3) ◽

pp. 137

Author(s):

Youngok Kang ◽

Nahye Cho ◽

Jiyoung Yoon ◽

Soyeon Park ◽

Jiyeon Kim

Keyword(s):

Deep Learning ◽

Transfer Learning ◽

Field Studies ◽

Learning Model ◽

Learning Technologies ◽

Regional Study ◽

Processing Technologies ◽

Final Model ◽

Urban Image ◽

Deep Learning Model

Recently, as computer vision and image processing technologies have rapidly advanced in the artificial intelligence (AI) field, deep learning technologies have been applied in the field of urban and regional study through transfer learning. In the tourism field, studies are emerging to analyze the tourists’ urban image by identifying the visual content of photos. However, previous studies have limitations in properly reflecting unique landscape, cultural characteristics, and traditional elements of the region that are prominent in tourism. With the purpose of going beyond these limitations of previous studies, we crawled 168,216 Flickr photos, created 75 scenes and 13 categories as a tourist’ photo classification by analyzing the characteristics of photos posted by tourists and developed a deep learning model by continuously re-training the Inception-v3 model. The final model shows high accuracy of 85.77% for the Top 1 and 95.69% for the Top 5. The final model was applied to the entire dataset to analyze the regions of attraction and the tourists’ urban image in Seoul. We found that tourists feel attracted to Seoul where the modern features such as skyscrapers and uniquely designed architectures and traditional features such as palaces and cultural elements are mixed together in the city. This work demonstrates a tourist photo classification suitable for local characteristics and the process of re-training a deep learning model to effectively classify a large volume of tourists’ photos.

Download Full-text

A KLIEP-based Transfer Learning Model for Gear Fault Diagnosis under Varying Working Conditions

2020 International Conference on Sensing, Measurement & Data Analytics in the era of Artificial Intelligence (ICSMD) ◽

10.1109/icsmd50554.2020.9261691 ◽

2020 ◽

Author(s):

Chao Chen ◽

Fei Shen ◽

Zhaoyan Fan ◽

Robert X. Gao ◽

Ruqiang Yan

Keyword(s):

Fault Diagnosis ◽

Transfer Learning ◽

Working Conditions ◽

Learning Model ◽

Gear Fault ◽

Gear Fault Diagnosis

Download Full-text

Application of a modified Inception-v3 model in the dynasty-based classification of ancient murals

EURASIP Journal on Advances in Signal Processing ◽

10.1186/s13634-021-00740-8 ◽

2021 ◽

Vol 2021 (1) ◽

Author(s):

Jianfang Cao ◽

Minmin Yan ◽

Yiming Jia ◽

Xiaodong Tian ◽

Zibang Zhang

Keyword(s):

Transfer Learning ◽

Adaptive Learning ◽

Deep Level ◽

Image Resolution ◽

Learning Rate ◽

Stochastic Gradient Descent ◽

Natural Image ◽

Historical Period ◽

Proposed Model

AbstractIt is difficult to identify the historical period in which some ancient murals were created because of damage due to artificial and/or natural factors; similarities in content, style, and color among murals; low image resolution; and other reasons. This study proposed a transfer learning-fused Inception-v3 model for dynasty-based classification. First, the model adopted Inception-v3 with frozen fully connected and softmax layers for pretraining over ImageNet. Second, the model fused Inception-v3 with transfer learning for parameter readjustment over small datasets. Third, the corresponding bottleneck files of the mural images were generated, and the deep-level features of the images were extracted. Fourth, the cross-entropy loss function was employed to calculate the loss value at each step of the training, and an algorithm for the adaptive learning rate on the stochastic gradient descent was applied to unify the learning rate. Finally, the updated softmax classifier was utilized for the dynasty-based classification of the images. On the constructed small datasets, the accuracy rate, recall rate, and F1 value of the proposed model were 88.4%, 88.36%, and 88.32%, respectively, which exhibited noticeable increases compared with those of typical deep learning models and modified convolutional neural networks. Comparisons of the classification outcomes for the mural dataset with those for other painting datasets and natural image datasets showed that the proposed model achieved stable classification outcomes with a powerful generalization capacity. The training time of the proposed model was only 0.7 s, and overfitting seldom occurred.

Download Full-text

Developing Games Using a Principles-Based Approach

Journal of Coaching Education ◽

10.1123/jce.4.2.88 ◽

2011 ◽

Vol 4 (2) ◽

pp. 88

Author(s):

Peter Baggetta

Keyword(s):

New Zealand ◽

Transfer Learning ◽

Learning Model ◽

Expertise Development ◽

Teaching Games For Understanding ◽

Theory To Practice ◽

Tactical Decision

The Teaching Games for Understanding (TGfU) model was first developed by Bunker and Thorpe in 1982 as a model for coaches to help players become more skillful players. Since then other versions of the model have been developed such as the tactical decision-learning model (Grehaigne, Godbout, & Bouthier, 2001) in France and the game–sense approach (Australian Sports Commission, 1991) in Australia and New Zealand. The key aspect of all the models is the design of well-structured conditioned and modified games that require players to make decisions to develop their game understanding and tactical awareness. However, both novice and experienced coaches often struggle with connecting theory to practice especially in the area of creating and developing contextualized games that actually transfer learning from training to performance in games. In order to effectively create and use games that transfer learning, coaches can use a Principles-Based approach to develop games. The Principles-Based approach removes the dichotomy of traditional drills versus games and instead combines the drills approach with a games-context approach that links principles to skills that allow for increased individual and team expertise development. This presentation will first describe a model for developing and connecting principles, policies, tactics and skills for team play. Following this the presentation will then describe how to use the principles to create contextualized games that connect practices with performance and progresses novice players toward becoming more competent performers.

Download Full-text

Discriminative Multi-modal Feature Fusion for RGBD Indoor Scene Recognition

2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) ◽

10.1109/cvpr.2016.324 ◽

2016 ◽

Cited By ~ 31

Author(s):

Hongyuan Zhu ◽

Jean-Baptiste Weibel ◽

Shijian Lu

Keyword(s):

Feature Fusion ◽

Scene Recognition ◽

Indoor Scene

Download Full-text