1,2-Chlorine atom migration in 3-chloro-2-butyl radicals: a computational studyElectronic supplementary information (ESI) available: Calculated energies and enthalpies of stationary points, full structural information, and structures of trans-but-2-ene (9) and trans-but-2-ene radical (10). See http://www.rsc.org/suppdata/ob/b2/b209981k/

Bernd Neumann; Hendrik Zipse

doi:10.1039/b209981k

MDIPA: a microRNA–drug interaction prediction approach based on non-negative matrix factorization

Bioinformatics ◽

10.1093/bioinformatics/btaa577 ◽

2020 ◽

Vol 36 (20) ◽

pp. 5061-5067

Author(s):

Ali Akbar Jamali ◽

Anthony Kusalik ◽

Fang-Xiang Wu

Keyword(s):

Drug Interaction ◽

Drug Interactions ◽

Matrix Factorization ◽

Structural Information ◽

Superior Performance ◽

Supplementary Information ◽

Similarity Matrix ◽

Interaction Prediction ◽

Prediction Approach ◽

Drug Similarity

Abstract Motivation Evidence has shown that microRNAs, one type of small biomolecule, regulate the expression level of genes and play an important role in the development or treatment of diseases. Drugs, as important chemical compounds, can interact with microRNAs and change their functions. The experimental identification of microRNA–drug interactions is time-consuming and expensive. Therefore, it is appealing to develop effective computational approaches for predicting microRNA–drug interactions. Results In this study, a matrix factorization-based method, called the microRNA–drug interaction prediction approach (MDIPA), is proposed for predicting unknown interactions among microRNAs and drugs. Specifically, MDIPA utilizes experimentally validated interactions between drugs and microRNAs, drug similarity and microRNA similarity to predict undiscovered interactions. A path-based microRNA similarity matrix is constructed, while the structural information of drugs is used to establish a drug similarity matrix. To evaluate its performance, our MDIPA is compared with four state-of-the-art prediction methods with an independent dataset and cross-validation. The results of both evaluation methods confirm the superior performance of MDIPA over other methods. Finally, the results of molecular docking in a case study with breast cancer confirm the efficacy of our approach. In conclusion, MDIPA can be effective in predicting potential microRNA–drug interactions. Availability and implementation All code and data are freely available from https://github.com/AliJam82/MDIPA. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

VarMap: a web tool for mapping genomic coordinates to protein sequence and structure and retrieving protein structural annotations

Bioinformatics ◽

10.1093/bioinformatics/btz482 ◽

2019 ◽

Vol 35 (22) ◽

pp. 4854-4856 ◽

Cited By ~ 8

Author(s):

James D Stephenson ◽

Roman A Laskowski ◽

Andrew Nightingale ◽

Matthew E Hurles ◽

Janet M Thornton

Keyword(s):

Protein Sequence ◽

Structural Information ◽

Protein Structures ◽

Supplementary Information ◽

Supplementary Data ◽

Web Tool ◽

Genomic Variants ◽

Structural Context ◽

Pathogenic Variants ◽

Transcript Evidence

Abstract Motivation Understanding the protein structural context and patterning on proteins of genomic variants can help to separate benign from pathogenic variants and reveal molecular consequences. However, mapping genomic coordinates to protein structures is non-trivial, complicated by alternative splicing and transcript evidence. Results Here we present VarMap, a web tool for mapping a list of chromosome coordinates to canonical UniProt sequences and associated protein 3D structures, including validation checks, and annotating them with structural information. Availability and implementation https://www.ebi.ac.uk/thornton-srv/databases/VarMap. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

SphereCon—a method for precise estimation of residue relative solvent accessible area from limited structural information

Bioinformatics ◽

10.1093/bioinformatics/btaa159 ◽

2020 ◽

Vol 36 (11) ◽

pp. 3372-3378

Author(s):

Alexander Gress ◽

Olga V Kalinina

Keyword(s):

Protein Function ◽

Structural Information ◽

Solvent Accessibility ◽

Three Dimensional ◽

Structural Data ◽

Supplementary Information ◽

Dimensional Structure ◽

Relative Solvent Accessibility ◽

Precise Measure ◽

The Impact

Abstract Motivation In proteins, solvent accessibility of individual residues is a factor contributing to their importance for protein function and stability. Hence one might wish to calculate solvent accessibility in order to predict the impact of mutations, their pathogenicity and for other biomedical applications. A direct computation of solvent accessibility is only possible if all atoms of a protein three-dimensional structure are reliably resolved. Results We present SphereCon, a new precise measure that can estimate residue relative solvent accessibility (RSA) from limited data. The measure is based on calculating the volume of intersection of a sphere with a cone cut out in the direction opposite of the residue with surrounding atoms. We propose a method for estimating the position and volume of residue atoms in cases when they are not known from the structure, or when the structural data are unreliable or missing. We show that in cases of reliable input structures, SphereCon correlates almost perfectly with the directly computed RSA, and outperforms other previously suggested indirect methods. Moreover, SphereCon is the only measure that yields accurate results when the identities of amino acids are unknown. A significant novel feature of SphereCon is that it can estimate RSA from inter-residue distance and contact matrices, without any information about the actual atom coordinates. Availability and implementation https://github.com/kalininalab/spherecon. Contact [email protected] Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Experimental and theoretical studies of gas phase NO3 and OH radical reactions with formaldehyde, acetaldehyde and their isotopomersElectronic supplementary information (ESI) available: program package FACSIMILE for simulation of the complex set of reactions and the kinetics of the reactor system. Kinetic data from the acetaldehyde studies, the spectral ranges and the compounds included in the subtraction procedures, and a full report of the statistical analyses. Calculated energies of the reactants, intermediate stationary points of the potential energy surface, and the products. Calculated vibrational wavenumbers of the reactants and transition states. See http://www.rsc.org/suppdata/cp/b2/b211234p/

Physical Chemistry Chemical Physics ◽

10.1039/b211234p ◽

2003 ◽

Vol 5 (9) ◽

pp. 1790-1805 ◽

Cited By ~ 48

Author(s):

Barbara D'Anna ◽

Vebjørn Bakken ◽

Jon Are Beukes ◽

Claus J. Nielsen ◽

Katarzyna Brudnik ◽

...

Keyword(s):

Gas Phase ◽

Energy Surface ◽

Stationary Points ◽

Supplementary Information ◽

Theoretical Studies ◽

Oh Radical ◽

Spectral Ranges ◽

Kinetics Of ◽

Experimental And Theoretical Studies ◽

Vibrational Wavenumbers

Download Full-text

Streamlined use of protein structures in variant analysis

10.1101/2021.09.10.459756 ◽

2021 ◽

Author(s):

Sandeep Kaur ◽

Neblina Sikta ◽

Andrea Schafferhans ◽

Nicola Bordin ◽

Mark J. Cowley ◽

...

Keyword(s):

Protein Function ◽

Molecular Mechanisms ◽

Structural Information ◽

Protein Structures ◽

Structural Data ◽

Supplementary Information ◽

3D Structures ◽

Link Type ◽

Variant Analysis ◽

Many Sources

AbstractMotivationVariant analysis is a core task in bioinformatics that requires integrating data from many sources. This process can be helped by using 3D structures of proteins, which can provide a spatial context that can provide insight into how variants affect function. Many available tools can help with mapping variants onto structures; but each has specific restrictions, with the result that many researchers fail to benefit from valuable insights that could be gained from structural data.ResultsTo address this, we have created a streamlined system for incorporating 3D structures into variant analysis. Variants can be easily specified via URLs that are easily readable and writable, and use the notation recommended by the Human Genome Variation Society (HGVS). For example, ‘https://aquaria.app/SARS-CoV-2/S/?N501Y’ specifies the N501Y variant of SARS-CoV-2 S protein. In addition to mapping variants onto structures, our system provides summary information from multiple external resources, including COSMIC, CATH-FunVar, and PredictProtein. Furthermore, our system identifies and summarizes structures containing the variant, as well as the variant-position. Our system supports essentially any mutation for any well-studied protein, and uses all available structural data — including models inferred via very remote homology — integrated into a system that is fast and simple to use. By giving researchers easy, streamlined access to a wealth of structural information during variant analysis, our system will help in revealing novel insights into the molecular mechanisms underlying protein function in health and disease.AvailabilityOur resource is freely available at the project home page (https://aquaria.app). After peer review, the code will be openly available via a GPL version 2 license at https://github.com/ODonoghueLab/Aquaria. PSSH2, the database of sequence-to-structure alignments, is also freely available for download at https://zenodo.org/record/[email protected] informationNone.

Download Full-text

Analysis of several key factors influencing deep learning-based inter-residue contact prediction

Bioinformatics ◽

10.1093/bioinformatics/btz679 ◽

2019 ◽

Cited By ~ 1

Author(s):

Tianqi Wu ◽

Jie Hou ◽

Badri Adhikari ◽

Jianlin Cheng

Keyword(s):

Deep Learning ◽

Structural Information ◽

Protein Structures ◽

Supplementary Information ◽

Prediction Methods ◽

Key Factors ◽

Sequence Alignments ◽

Multiple Sequence ◽

Contact Prediction ◽

Ab Initio Approach

Abstract Motivation Deep learning has become the dominant technology for protein contact prediction. However, the factors that affect the performance of deep learning in contact prediction have not been systematically investigated. Results We analyzed the results of our three deep learning-based contact prediction methods (MULTICOM-CLUSTER, MULTICOM-CONSTRUCT and MULTICOM-NOVEL) in the CASP13 experiment and identified several key factors [i.e. deep learning technique, multiple sequence alignment (MSA), distance distribution prediction and domain-based contact integration] that influenced the contact prediction accuracy. We compared our convolutional neural network (CNN)-based contact prediction methods with three coevolution-based methods on 75 CASP13 targets consisting of 108 domains. We demonstrated that the CNN-based multi-distance approach was able to leverage global coevolutionary coupling patterns comprised of multiple correlated contacts for more accurate contact prediction than the local coevolution-based methods, leading to a substantial increase of precision by 19.2 percentage points. We also tested different alignment methods and domain-based contact prediction with the deep learning contact predictors. The comparison of the three methods showed deeper sequence alignments and the integration of domain-based contact prediction with the full-length contact prediction improved the performance of contact prediction. Moreover, we demonstrated that the domain-based contact prediction based on a novel ab initio approach of parsing domains from MSAs alone without using known protein structures was a simple, fast approach to improve contact prediction. Finally, we showed that predicting the distribution of inter-residue distances in multiple distance intervals could capture more structural information and improve binary contact prediction. Availability and implementation https://github.com/multicom-toolbox/DNCON2/. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

The influence of hydrogen bonding interactions on the C–H bond activation step in class I ribonucleotide reductasesElectronic supplementary information (ESI) available: Energies and structures of stationary points for studied reactions. See http://www.rsc.org/suppdata/ob/b2/b210536p/

Organic & Biomolecular Chemistry ◽

10.1039/b210536p ◽

2003 ◽

Vol 1 (4) ◽

pp. 692-699 ◽

Cited By ~ 15

Author(s):

Hendrik Zipse

Keyword(s):

Hydrogen Bonding ◽

Bond Activation ◽

Class I ◽

Stationary Points ◽

Supplementary Information ◽

Hydrogen Bonding Interactions

Download Full-text

Fold recognition by scoring protein maps using the congruence coefficient

Bioinformatics ◽

10.1093/bioinformatics/btaa833 ◽

2020 ◽

Author(s):

Pietro Di Lena ◽

Pierre Baldi

Keyword(s):

Structure Prediction ◽

Structural Information ◽

Statistical Significance ◽

Fold Recognition ◽

Homology Search ◽

Supplementary Information ◽

Map Comparison ◽

Contact Distance ◽

Congruence Coefficient ◽

Mean And Variance

Abstract Motivation Protein fold recognition is a key step for template-based modeling approaches to protein structure prediction. Although closely related folds can be easily identified by sequence homology search in sequence databases, fold recognition is notoriously more difficult when it involves the identification of distantly related homologs. Recent progress in residue–residue contact and distance prediction opens up the possibility of improving fold recognition by using structural information contained in predicted distance and contact maps. Results Here we propose to use the congruence coefficient as a metric of similarity between maps. We prove that this metric has several interesting mathematical properties which allow one to compute in polynomial time its exact mean and variance over all possible (exponentially many) alignments between two symmetric matrices, and assess the statistical significance of similarity between aligned maps. We perform fold recognition tests by recovering predicted target contact/distance maps from the two most recent Critical Assessment of Structure Prediction editions and over 27 000 non-homologous structural templates from the ECOD database. On this large benchmark, we compare fold recognition performances of different alignment tools with their own similarity scores against those obtained using the congruence coefficient. We show that the congruence coefficient overall improves fold recognition over other methods, proving its effectiveness as a general similarity metric for protein map comparison. Availability and implementation The congruence coefficient software CCpro is available as part of the SCRATCH suite at: http://scratch.proteomics.ics.uci.edu/. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Direct measurement of the energy barrier for 1,2-chlorine atom migration in ?-methyl-?-chlorobenzyl(chloro)crbene

Journal of Physical Organic Chemistry ◽

10.1002/poc.610050510 ◽

1992 ◽

Vol 5 (5) ◽

pp. 285-286 ◽

Cited By ~ 7

Author(s):

Michael T. H. Liu ◽

John E. Chateauneuf

Keyword(s):

Chlorine Atom ◽

Direct Measurement ◽

Energy Barrier ◽

Atom Migration

Download Full-text

Matrix isolation FTIR and molecular orbital study of E and Z acetaldoxime monomersElectronic supplementary information (ESI) available: Internal coordinates for the Z and E isomers, calculated barriers for the rotation of methyl groups and the relative energies of the relevant stationary points for acetaldoxime. See http://www.rsc.org/suppdata/cp/b2/b203240f/

Physical Chemistry Chemical Physics ◽

10.1039/b203240f ◽

2002 ◽

Vol 4 (14) ◽

pp. 3289-3296 ◽

Cited By ~ 25

Author(s):

Agnieszka Andrzejewska ◽

Leszek Lapinski ◽

Igor Reva ◽

Rui Fausto

Keyword(s):

Molecular Orbital ◽

Matrix Isolation ◽

Stationary Points ◽

Supplementary Information ◽

Methyl Groups ◽

Molecular Orbital Study ◽

Internal Coordinates ◽

Orbital Study

Download Full-text