Convolutional Neural Network Based Approach to In Silico Non-Anticipating Prediction of Antigenic Distance for Influenza Virus

Majid Forghani; Michael Khachay

doi:10.3390/v12091019

Convolutional Neural Network Based Approach to In Silico Non-Anticipating Prediction of Antigenic Distance for Influenza Virus

Viruses ◽

10.3390/v12091019 ◽

2020 ◽

Vol 12 (9) ◽

pp. 1019

Author(s):

Majid Forghani ◽

Michael Khachay

Keyword(s):

Influenza Virus ◽

Protein Sequences ◽

Ground Truth ◽

Novel Approach ◽

Feature Spaces ◽

Hemagglutinin Protein ◽

Distance Approximation ◽

Immunological Assays ◽

Antigenic Cartography ◽

Physical And Chemical

Evaluation of the antigenic similarity degree between the strains of the influenza virus is highly important for vaccine production. The conventional method used to measure such a degree is related to performing the immunological assays of hemagglutinin inhibition. Namely, the antigenic distance between two strains is calculated on the basis of HI assays. Usually, such distances are visualized by using some kind of antigenic cartography method. The known drawback of the HI assay is that it is rather time-consuming and expensive. In this paper, we propose a novel approach for antigenic distance approximation based on deep learning in the feature spaces induced by hemagglutinin protein sequences and Convolutional Neural Networks (CNNs). To apply a CNN to compare the protein sequences, we utilize the encoding based on the physical and chemical characteristics of amino acids. By varying (hyper)parameters of the CNN architecture design, we find the most robust network. Further, we provide insight into the relationship between approximated antigenic distance and antigenicity by evaluating the network on the HI assay database for the H1N1 subtype. The results indicate that the best-trained network gives a high-precision approximation for the ground-truth antigenic distances, and can be used as a good exploratory tool in practical tasks.

Download Full-text

A Study on Host Tropism Determinants of Influenza Virus Using Machine Learning

Current Bioinformatics ◽

10.2174/1574893614666191104160927 ◽

2020 ◽

Vol 15 (2) ◽

pp. 121-134 ◽

Cited By ~ 2

Author(s):

Eunmi Kwon ◽

Myeongji Cho ◽

Hayeon Kim ◽

Hyeon S. Son

Keyword(s):

Machine Learning ◽

Amino Acids ◽

Influenza Virus ◽

Random Forest ◽

Physicochemical Properties ◽

Protein Sequences ◽

Influenza Viruses ◽

Host Tropism ◽

Post Hoc ◽

Ha Protein

Background: The host tropism determinants of influenza virus, which cause changes in the host range and increase the likelihood of interaction with specific hosts, are critical for understanding the infection and propagation of the virus in diverse host species. Methods: Six types of protein sequences of influenza viral strains isolated from three classes of hosts (avian, human, and swine) were obtained. Random forest, naïve Bayes classification, and knearest neighbor algorithms were used for host classification. The Java language was used for sequence analysis programming and identifying host-specific position markers. Results: A machine learning technique was explored to derive the physicochemical properties of amino acids used in host classification and prediction. HA protein was found to play the most important role in determining host tropism of the influenza virus, and the random forest method yielded the highest accuracy in host prediction. Conserved amino acids that exhibited host-specific differences were also selected and verified, and they were found to be useful position markers for host classification. Finally, ANOVA analysis and post-hoc testing revealed that the physicochemical properties of amino acids, comprising protein sequences combined with position markers, differed significantly among hosts. Conclusion: The host tropism determinants and position markers described in this study can be used in related research to classify, identify, and predict the hosts of influenza viruses that are currently susceptible or likely to be infected in the future.

Download Full-text

Reactivity of human convalescent sera with influenza virus hemagglutinin protein mutants at antigenic site A

Microbiology and Immunology ◽

10.1111/j.1348-0421.2012.00412.x ◽

2012 ◽

Vol 56 (2) ◽

pp. 99-106 ◽

Cited By ~ 5

Author(s):

Eri Nobusawa ◽

Katsumi Omagari ◽

Setsuko Nakajima ◽

Katsuhisa Nakajima

Keyword(s):

Influenza Virus ◽

Antigenic Site ◽

Influenza Virus Hemagglutinin ◽

Protein Mutants ◽

Hemagglutinin Protein

Download Full-text

The cleavage of the hemagglutinin protein of H5N2 avian influenza virus in yeast

Journal of Virological Methods ◽

10.1016/j.jviromet.2007.07.012 ◽

2007 ◽

Vol 146 (1-2) ◽

pp. 293-297 ◽

Cited By ~ 12

Author(s):

Chi Y. Wang ◽

Yu L. Luo ◽

Yu T. Chen ◽

Shu K. Li ◽

Chi H. Lin ◽

...

Keyword(s):

Influenza Virus ◽

Avian Influenza ◽

Avian Influenza Virus ◽

Hemagglutinin Protein

Download Full-text

Practical example for use of the supervised vicarious calibration (SVC) method on multisource hyperspectral imagery data – ValCalHyp airborne hyperspectral campaign under the EUFAR framework

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprsarchives-xl-7-51-2014 ◽

2014 ◽

Vol XL-7 ◽

pp. 51-53

Author(s):

A. Brook ◽

E. Ben Dor

Keyword(s):

Atmospheric Correction ◽

Ground Truth ◽

Critical Issue ◽

Radiometric Calibration ◽

Vicarious Calibration ◽

Flight Direction ◽

Novel Approach ◽

Novel Method ◽

General Goal ◽

Cross Calibration

A novel approach for radiometric calibration and atmospheric correction of airborne hyperspectral (HRS) data, termed supervised vicarious calibration (SVC) was proposed by Brook and Ben-Dor in 2010. The present study was aimed at validating this SVC approach by simultaneously using several different airborne HSR sensors that acquired HSR data over several selected sites at the same time. The general goal of this study was to apply a cross-calibration approach to examine the capability and stability of the SVC method and to examine its validity. This paper reports the result of the multi sensors campaign took place over Salon de Provenance, France on behalf of the ValCalHyp project took place in 2011. The SVC method enabled the rectification of the radiometric drift of each sensor and improves their performance significantly. The flight direction of the SVC targets was found to be a critical issue for such correction and recommendations have been set for future utilization of this novel method. The results of the SVC method were examined by comparing ground-truth spectra of several selected validation targets with the image spectra as well as by comparing the classified water quality images generated from all sensors over selected water bodies.

Download Full-text

COMPARING VIRUS CLASSIFICATION USING GENOMIC MATERIALS ACCORDING TO DIFFERENT TAXONOMIC LEVELS

Journal of Bioinformatics and Computational Biology ◽

10.1142/s0219720013430038 ◽

2013 ◽

Vol 11 (06) ◽

pp. 1343003 ◽

Cited By ~ 4

Author(s):

JING-DOO WANG

Keyword(s):

Molecular Biology ◽

Dna Sequences ◽

Protein Sequences ◽

Experimental Results ◽

Virus Classification ◽

Taxonomic Level ◽

Biological Classification ◽

Novel Approach

In this paper, three genomic materials — DNA sequences, protein sequences, and regions (domains) are used to compare methods of virus classification. Virus classes (categories) are divided by various taxonomic level of virus into three datasets for 6 order, 42 family, and 33 genera. To increase the robustness and comparability of experimental results of virus classification, the classes are selected that contain at least 10 instances, and meanwhile each instance contains at least one region name. Experimental results show that the approach using region names achieved the best accuracies — reaching 99.9%, 97.3%, and 99.0% for 6 orders, 42 families, and 33 genera, respectively. This paper not only involves exhaustive experiments that compare virus classifications using different genomic materials, but also proposes a novel approach to biological classification based on molecular biology instead of traditional morphology.

Download Full-text

Toward Development of a Vocal Fold Contact Pressure Probe: Sensor Characterization and Validation Using Synthetic Vocal Fold Models

Applied Sciences ◽

10.3390/app9153002 ◽

2019 ◽

Vol 9 (15) ◽

pp. 3002 ◽

Cited By ~ 6

Author(s):

Mohsen Motie-Shirazi ◽

Matías Zañartu ◽

Sean D. Peterson ◽

Daryush D. Mehta ◽

James B. Kobler ◽

...

Keyword(s):

Contact Pressure ◽

Vocal Fold ◽

Ground Truth ◽

Pressure Probe ◽

Primary Role ◽

Novel Approach ◽

Sensor Position ◽

Probe Sensor ◽

Contact Pressures

Excessive vocal fold collision pressures during phonation are considered to play a primary role in the formation of benign vocal fold lesions, such as nodules. The ability to accurately and reliably acquire intraglottal pressure has the potential to provide unique insights into the pathophysiology of phonotrauma. Difficulties arise, however, in directly measuring vocal fold contact pressures due to physical intrusion from the sensor that may disrupt the contact mechanics, as well as difficulty in determining probe/sensor position relative to the contact location. These issues are quantified and addressed through the implementation of a novel approach for identifying the timing and location of vocal fold contact, and measuring intraglottal and vocal fold contact pressures via a pressure probe embedded in the wall of a hemi-laryngeal flow facility. The accuracy and sensitivity of the pressure measurements are validated against ground truth values. Application to in vivo approaches are assessed by acquiring intraglottal and VF contact pressures using a synthetic, self-oscillating vocal fold model in a hemi-laryngeal configuration, where the sensitivity of the measured intraglottal and vocal fold contact pressure relative to the sensor position is explored.

Download Full-text

A recombinant avian paramyxovirus serotype 3 expressing the hemagglutinin protein protects chickens against H5N1 highly pathogenic avian influenza virus challenge

Scientific Reports ◽

10.1038/s41598-020-59124-x ◽

2020 ◽

Vol 10 (1) ◽

Cited By ~ 2

Author(s):

Edris Shirvani ◽

Berin P. Varghese ◽

Anandan Paldurai ◽

Siba K. Samal

Keyword(s):

Influenza Virus ◽

Avian Influenza ◽

Avian Influenza Virus ◽

Highly Pathogenic Avian Influenza ◽

Avian Paramyxovirus ◽

Highly Pathogenic ◽

Virus Challenge ◽

Pathogenic Avian Influenza ◽

Hemagglutinin Protein ◽

Pathogenic Avian Influenza Virus

Download Full-text

The S190R mutation in the hemagglutinin protein of pandemic H1N1 2009 influenza virus increased its pathogenicity in mice

Science China Life Sciences ◽

10.1007/s11427-017-9156-1 ◽

2018 ◽

Vol 61 (7) ◽

pp. 836-843 ◽

Cited By ~ 1

Author(s):

Yongkun Chen ◽

Tian Bai ◽

Wenfei Zhu ◽

Rongbao Gao ◽

Zhihong Deng ◽

...

Keyword(s):

Influenza Virus ◽

Pandemic H1n1 ◽

Hemagglutinin Protein ◽

Pandemic H1n1 2009

Download Full-text

Soluble Recombinant Hemagglutinin Protein of H1N1pdm09 Influenza Virus Elicits Cross-Protection Against a Lethal H5N1 Challenge in Mice

Frontiers in Microbiology ◽

10.3389/fmicb.2019.02031 ◽

2019 ◽

Vol 10 ◽

Cited By ~ 3

Author(s):

Shinya Yamada ◽

Atsuhiro Yasuhara ◽

Yoshihiro Kawaoka

Keyword(s):

Influenza Virus ◽

Cross Protection ◽

Hemagglutinin Protein ◽

Recombinant Hemagglutinin

Download Full-text

Learning Compositional Representations of Interacting Systems with Restricted Boltzmann Machines: Comparative Study of Lattice Proteins

Neural Computation ◽

10.1162/neco_a_01210 ◽

2019 ◽

Vol 31 (8) ◽

pp. 1671-1717 ◽

Cited By ~ 1

Author(s):

Jérôme Tubiana ◽

Simona Cocco ◽

Rémi Monasson

Keyword(s):

Graphical Model ◽

A Priori ◽

Protein Sequences ◽

Ground Truth ◽

Representation Learning ◽

Statistical Features ◽

Restricted Boltzmann Machines ◽

Interacting Systems ◽

Hidden Layer ◽

Stochastic Mapping

A restricted Boltzmann machine (RBM) is an unsupervised machine learning bipartite graphical model that jointly learns a probability distribution over data and extracts their relevant statistical features. RBMs were recently proposed for characterizing the patterns of coevolution between amino acids in protein sequences and for designing new sequences. Here, we study how the nature of the features learned by RBM changes with its defining parameters, such as the dimensionality of the representations (size of the hidden layer) and the sparsity of the features. We show that for adequate values of these parameters, RBMs operate in a so-called compositional phase in which visible configurations sampled from the RBM are obtained by recombining these features. We then compare the performance of RBM with other standard representation learning algorithms, including principal or independent component analysis (PCA, ICA), autoencoders (AE), variational autoencoders (VAE), and their sparse variants. We show that RBMs, due to the stochastic mapping between data configurations and representations, better capture the underlying interactions in the system and are significantly more robust with respect to sample size than deterministic methods such as PCA or ICA. In addition, this stochastic mapping is not prescribed a priori as in VAE, but learned from data, which allows RBMs to show good performance even with shallow architectures. All numerical results are illustrated on synthetic lattice protein data that share similar statistical features with real protein sequences and for which ground-truth interactions are known.

Download Full-text