Vision-Based Classification of Mosquito Species: Comparison of Conventional and Deep Learning Methods

Kazushige Okayasu; Kota Yoshida; Masataka Fuchida; Akio Nakamura

doi:10.3390/app9183935

Vision-Based Classification of Mosquito Species: Comparison of Conventional and Deep Learning Methods

Applied Sciences ◽

10.3390/app9183935 ◽

2019 ◽

Vol 9 (18) ◽

pp. 3935 ◽

Cited By ~ 3

Author(s):

Kazushige Okayasu ◽

Kota Yoshida ◽

Masataka Fuchida ◽

Akio Nakamura

Keyword(s):

Deep Learning ◽

Conventional Method ◽

Data Augmentation ◽

Mosquito Species ◽

Experimental Results ◽

Support Vector ◽

Learning Method ◽

Species Classification ◽

Scale Invariant ◽

Machine Method

This study aims to propose a vision-based method to classify mosquito species. To investigate the efficiency of the method, we compared two different classification methods: The handcraft feature-based conventional method and the convolutional neural network-based deep learning method. For the conventional method, 12 types of features were adopted for handcraft feature extraction, while a support vector machine method was adopted for classification. For the deep learning method, three types of architectures were adopted for classification. We built a mosquito image dataset, which included 14,400 images with three types of mosquito species. The dataset comprised 12,000 images for training, 1500 images for testing, and 900 images for validating. Experimental results revealed that the accuracy of the conventional method using the scale-invariant feature transform algorithm was 82.4% at maximum, whereas the accuracy of the deep learning method was 95.5% in a residual network using data augmentation. From the experimental results, deep learning can be considered to be effective for classifying the mosquito species of the proposed dataset. Furthermore, data augmentation improves the accuracy of mosquito species’ classification.

Download Full-text

Data augmentation and transfer learning strategies for reaction prediction in low chemical data regimes

Organic Chemistry Frontiers ◽

10.1039/d0qo01636e ◽

2021 ◽

Author(s):

Yun Zhang ◽

Ling Wang ◽

Xinqiao Wang ◽

Chengyun Zhang ◽

Jiamin Ge ◽

...

Keyword(s):

Organic Chemistry ◽

Deep Learning ◽

Drug Discovery ◽

Research And Development ◽

Learning Strategies ◽

Transfer Learning ◽

Chemical Reactions ◽

Data Augmentation ◽

Learning Method ◽

Reaction Prediction

An effective and rapid deep learning method to predict chemical reactions contributes to the research and development of organic chemistry and drug discovery.

Download Full-text

A deep learning method for extensible microstructural quantification of DP steel enhanced by physical metallurgy-guided data augmentation

Materials Characterization ◽

10.1016/j.matchar.2021.111392 ◽

2021 ◽

pp. 111392

Author(s):

Chunguang Shen ◽

Xiaolu Wei ◽

Chenchong Wang ◽

Wei Xu

Keyword(s):

Deep Learning ◽

Data Augmentation ◽

Physical Metallurgy ◽

Learning Method ◽

Dp Steel

Download Full-text

The Development of an Identification Photo Booth System based on a Deep Learning Automatic Image Capturing Method

Journal of Imaging Science and Technology ◽

10.2352/j.imagingsci.technol.2021.65.2.020403 ◽

2020 ◽

Author(s):

Yu-Xiang Zhao ◽

Yi-Zeng Hsieh ◽

Shih-Syun Lin

Keyword(s):

Deep Learning ◽

Experimental Results ◽

Automatic Annotation ◽

Learning Method ◽

Facial Region ◽

Facial Landmarks ◽

Image Capturing ◽

The Face ◽

Facial Contours

With advances in technology, photo booths equipped with automatic capturing systems have gradually replaced the identification (ID) photo service provided by photography studios, thereby enabling consumers to save a considerable amount of time and money. Common automatic capturing systems employ text and voice instructions to guide users in capturing their ID photos; however, the capturing results may not conform to ID photo specifications. To address this issue, this study proposes an ID photo capturing algorithm that can automatically detect facial contours and adjust the size of captured images. The authors adopted a deep learning method (You Only Look Once) to detect the face and applied a semi-automatic annotation technique of facial landmarks to find the lip and chin regions from the facial region. In the experiments, subjects were seated at various distances and heights for testing the performance of the proposed algorithm. The experimental results show that the proposed algorithm can effectively and accurately capture ID photos that satisfy the required specifications.

Download Full-text

Oversampling Based on Data Augmentation in Convolutional Neural Network for Silicon Wafer Defect Classification

Knowledge Innovation Through Intelligent Software Methodologies, Tools and Techniques - Frontiers in Artificial Intelligence and Applications ◽

10.3233/faia200547 ◽

2020 ◽

Author(s):

Uzma Batool ◽

Mohd Ibrahim Shapiai ◽

Nordinah Ismail ◽

Hilman Fauzi ◽

Syahrizal Salleh

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Silicon Wafer ◽

Data Augmentation ◽

Imbalanced Data ◽

Training Data ◽

Defect Classification ◽

Learning Method ◽

Test Set

Silicon wafer defect data collected from fabrication facilities is intrinsically imbalanced because of the variable frequencies of defect types. Frequently occurring types will have more influence on the classification predictions if a model gets trained on such skewed data. A fair classifier for such imbalanced data requires a mechanism to deal with type imbalance in order to avoid biased results. This study has proposed a convolutional neural network for wafer map defect classification, employing oversampling as an imbalance addressing technique. To have an equal participation of all classes in the classifier’s training, data augmentation has been employed, generating more samples in minor classes. The proposed deep learning method has been evaluated on a real wafer map defect dataset and its classification results on the test set returned a 97.91% accuracy. The results were compared with another deep learning based auto-encoder model demonstrating the proposed method, a potential approach for silicon wafer defect classification that needs to be investigated further for its robustness.

Download Full-text

DL-CRISPR: A Deep Learning Method for Off-Target Activity Prediction in CRISPR/Cas9 With Data Augmentation

IEEE Access ◽

10.1109/access.2020.2989454 ◽

2020 ◽

Vol 8 ◽

pp. 76610-76617

Author(s):

Yu Zhang ◽

Yahui Long ◽

Rui Yin ◽

Chee Keong Kwoh

Keyword(s):

Deep Learning ◽

Data Augmentation ◽

Learning Method ◽

Activity Prediction ◽

Target Activity

Download Full-text

pDeep3: Towards More Accurate Spectrum Prediction with Fast Few-Shot Learning

10.1101/2020.09.13.295105 ◽

2020 ◽

Author(s):

Ching Tarn ◽

Wen-Feng Zeng ◽

Zhengcong Fei ◽

Si-Min He

Keyword(s):

Deep Learning ◽

Data Quality ◽

Prediction Accuracy ◽

Source Code ◽

Experimental Results ◽

Collision Induced Dissociation ◽

Learning Method ◽

Energy Collision ◽

Spectrum Prediction ◽

The Difference

AbstractSpectrum prediction using deep learning has attracted a lot of attention in recent years. Although existing deep learning methods have dramatically increased the prediction accuracy, there is still considerable space for improvement, which is presently limited by the difference of fragmentation types or instrument settings. In this work, we use the few-shot learning method to fit the data online to make up for the shortcoming. The method is evaluated using ten datasets, where the instruments includes Velos, QE, Lumos, and Sciex, with collision energies being differently set. Experimental results show that few-shot learning can achieve higher prediction accuracy with almost negligible computing resources. For example, on the dataset from a untrained instrument Sciex-6600, within about 10 seconds, the prediction accuracy is increased from 69.7% to 86.4%; on the CID (collision-induced dissociation) dataset, the prediction accuracy of the model trained by HCD (higher energy collision dissociation) spectra is increased from 48.0% to 83.9%. It is also shown that, the method is not critical to data quality and is sufficiently efficient to fill the accuracy gap. The source code of pDeep3 is available at http://pfind.ict.ac.cn/software/pdeep3.

Download Full-text

Deep learning accurately predicts estrogen receptor status in breast cancer metabolomics data

10.1101/214254 ◽

2017 ◽

Author(s):

Fadhl M Alakwaa ◽

Kumardeep Chaudhary ◽

Lana X Garmire

Keyword(s):

Breast Cancer ◽

Machine Learning ◽

Estrogen Receptor ◽

Deep Learning ◽

Support Vector ◽

Integrated Analysis ◽

Learning Method ◽

Learning Methods ◽

Metabolomics Data ◽

Machine Learning Methods

ABSTRACTMetabolomics holds the promise as a new technology to diagnose highly heterogeneous diseases. Conventionally, metabolomics data analysis for diagnosis is done using various statistical and machine learning based classification methods. However, it remains unknown if deep neural network, a class of increasingly popular machine learning methods, is suitable to classify metabolomics data. Here we use a cohort of 271 breast cancer tissues, 204 positive estrogen receptor (ER+) and 67 negative estrogen receptor (ER-), to test the accuracies of autoencoder, a deep learning (DL) framework, as well as six widely used machine learning models, namely Random Forest (RF), Support Vector Machines (SVM), Recursive Partitioning and Regression Trees (RPART), Linear Discriminant Analysis (LDA), Prediction Analysis for Microarrays (PAM), and Generalized Boosted Models (GBM). DL framework has the highest area under the curve (AUC) of 0.93 in classifying ER+/ER-patients, compared to the other six machine learning algorithms. Furthermore, the biological interpretation of the first hidden layer reveals eight commonly enriched significant metabolomics pathways (adjusted P-value<0.05) that cannot be discovered by other machine learning methods. Among them, protein digestion & absorption and ATP-binding cassette (ABC) transporters pathways are also confirmed in integrated analysis between metabolomics and gene expression data in these samples. In summary, deep learning method shows advantages for metabolomics based breast cancer ER status classification, with both the highest prediction accurcy (AUC=0.93) and better revelation of disease biology. We encourage the adoption of autoencoder based deep learning method in the metabolomics research community for classification.

Download Full-text

Sparse coded spatial pyramid matching and multi-kernel integrated SVM for non-linear scene classification

Journal of Electrical Engineering ◽

10.2478/jee-2021-0053 ◽

2021 ◽

Vol 72 (6) ◽

pp. 374-380

Author(s):

Bhavinkumar Gajjar ◽

Hiren Mewada ◽

Ashwin Patani

Keyword(s):

Deep Learning ◽

Support Vector ◽

Scene Classification ◽

Scale Invariant ◽

Training Time ◽

Multi Scale ◽

Non Linear ◽

Pyramid Matching ◽

Scale Invariant Feature ◽

Distinct Features

Abstract Support vector machine (SVM) techniques and deep learning have been prevalent in object classification for many years. However, deep learning is computation-intensive and can require a long training time. SVM is significantly faster than Convolution Neural Network (CNN). However, the SVM has limited its applications in the mid-size dataset as it requires proper tuning. Recently the parameterization of multiple kernels has shown greater flexibility in the characterization of the dataset. Therefore, this paper proposes a sparse coded multi-scale approach to reduce training complexity and tuning of SVM using a non-linear fusion of kernels for large class natural scene classification. The optimum features are obtained by parameterizing the dictionary, Scale Invariant Feature Transform (SIFT) parameters, and fusion of multiple kernels. Experiments were conducted on a large dataset to examine the multi-kernel space capability to find distinct features for better classification. The proposed approach founds to be promising than the linear multi-kernel SVM approaches achieving 91.12 % maximum accuracy.

Download Full-text

Two Birds with One Stone: Series Saliency for Accurate and Interpretable Multivariate Time Series Forecasting

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2021/397 ◽

2021 ◽

Author(s):

Qingyi Pan ◽

Wenbo Hu ◽

Ning Chen

Keyword(s):

Time Series ◽

Deep Learning ◽

Data Augmentation ◽

Multivariate Time Series ◽

Time Series Forecasting ◽

Experimental Results ◽

Learning Methods ◽

Forecasting Accuracy ◽

Sliding Windows ◽

Scheme Of Series

It is important yet challenging to perform accurate and interpretable time series forecasting. Though deep learning methods can boost forecasting accuracy, they often sacrifice interpretability. In this paper, we present a new scheme of series saliency to boost both accuracy and interpretability. By extracting series images from sliding windows of the time series, we design series saliency as a mixup strategy with a learnable mask between the series images and their perturbed versions. Series saliency is model agnostic and performs as an adaptive data augmentation method for training deep models. Moreover, by slightly changing the objective, we optimize series saliency to find a mask for interpretable forecasting in both feature and time dimensions. Experimental results on several real datasets demonstrate that series saliency is effective to produce accurate time-series forecasting results as well as generate temporal interpretations.

Download Full-text

Mapping Paddy Rice Using a Convolutional Neural Network (CNN) with Landsat 8 Datasets in the Dongting Lake Area, China

Remote Sensing ◽

10.3390/rs10111840 ◽

2018 ◽

Vol 10 (11) ◽

pp. 1840 ◽

Cited By ~ 21

Author(s):

Meng Zhang ◽

Hui Lin ◽

Guangxing Wang ◽

Hua Sun ◽

Jing Fu

Keyword(s):

Neural Network ◽

Deep Learning ◽

Convolutional Neural Network ◽

Land Surface ◽

Single Channel ◽

Paddy Rice ◽

Support Vector ◽

Landsat 8 ◽

Lake Area ◽

Machine Method

Rice is one of the world’s major staple foods, especially in China. Highly accurate monitoring on rice-producing land is, therefore, crucial for assessing food supplies and productivity. Recently, the deep-learning convolutional neural network (CNN) has achieved considerable success in remote-sensing data analysis. A CNN-based paddy-rice mapping method using the multitemporal Landsat 8, phenology data, and land-surface temperature (LST) was developed during this study. First, the spatial–temporal adaptive reflectance fusion model (STARFM) was used to blend the moderate-resolution imaging spectroradiometer (MODIS) and Landsat data for obtaining multitemporal Landsat-like data. Subsequently, the threshold method is applied to derive the phenological variables from the Landsat-like (Normalized difference vegetation index) NDVI time series. Then, a generalized single-channel algorithm was employed to derive LST from the Landsat 8. Finally, multitemporal Landsat 8 spectral images, combined with phenology and LST data, were employed to extract paddy-rice information using a patch-based deep-learning CNN algorithm. The results show that the proposed method achieved an overall accuracy of 97.06% and a Kappa coefficient of 0.91, which are 6.43% and 0.07 higher than that of the support vector machine method, and 7.68% and 0.09 higher than that of the random forest method, respectively. Moreover, the Landsat-derived rice area is strongly correlated (R2 = 0.9945) with government statistical data, demonstrating that the proposed method has potential in large-scale paddy-rice mapping using moderate spatial resolution images.

Download Full-text