Analysis and prediction of functional sub-types from protein sequence alignments

2000 ◽  
Vol 303 (1) ◽  
pp. 61-76 ◽  
Author(s):  
Sridhar S. Hannenhalli ◽  
Robert B. Russell
Author(s):  
Geraldine Buysschaert ◽  
Kenneth Verstraete ◽  
Savvas N. Savvides ◽  
Bjorn Vergauwen

Short-chain dehydrogenases/reductases (SDRs) are a rapidly expanding superfamily of enzymes that are found in all kingdoms of life. Hallmarked by a highly conserved Asn-Ser-Tyr-Lys catalytic tetrad, SDRs have a broad substrate spectrum and play diverse roles in key metabolic processes. Locus tag VVA1599 inVibrio vulnificusencodes a short-chain dehydrogenase (hereafter referred to as SDRvv) which lacks the signature catalytic tetrad of SDR members. Structure-based protein sequence alignments have suggested that SDRvv may harbour a unique binding site for its nicotinamide cofactor. To date, structural studies of SDRs with altered catalytic centres are underrepresented in the scientific literature, thus limiting understanding of their spectrum of substrate and cofactor preferences. Here, the expression, purification and crystallization of recombinant SDRvv are presented. Two well diffracting crystal forms could be obtained by cocrystallization in the presence of the reduced form of the phosphorylated nicotinamide cofactor NADPH. The collected data were of sufficient quality for successful structure determination by molecular replacement and subsequent refinement. This work sets the stage for deriving the identity of the natural substrate of SDRvv and the structure–function landscape of typical and atypical SDRs.


2007 ◽  
Vol 35 (Web Server) ◽  
pp. W649-W652 ◽  
Author(s):  
J. Pei ◽  
B.-H. Kim ◽  
M. Tang ◽  
N. V. Grishin

2015 ◽  
Vol 13 (05) ◽  
pp. 1550028 ◽  
Author(s):  
Westley Arthur Sherman ◽  
Durga Bhavani Kuchibhatla ◽  
Vachiranee Limviphuvadh ◽  
Sebastian Maurer-Stroh ◽  
Birgit Eisenhaber ◽  
...  

Next-generation sequencing advances are rapidly expanding the number of human mutations to be analyzed for causative roles in genetic disorders. Our Human Protein Mutation Viewer (HPMV) is intended to explore the biomolecular mechanistic significance of non-synonymous human mutations in protein-coding genomic regions. The tool helps to assess whether protein mutations affect the occurrence of sequence-architectural features (globular domains, targeting signals, post-translational modification sites, etc.). As input, HPMV accepts protein mutations — as UniProt accessions with mutations (e.g. HGVS nomenclature), genome coordinates, or FASTA sequences. As output, HPMV provides an interactive cartoon showing the mutations in relation to elements of the sequence architecture. A large variety of protein sequence architectural features were selected for their particular relevance to mutation interpretation. Clicking a sequence feature in the cartoon expands a tree view of additional information including multiple sequence alignments of conserved domains and a simple 3D viewer mapping the mutation to known PDB structures, if available. The cartoon is also correlated with a multiple sequence alignment of similar sequences from other organisms. In cases where a mutation is likely to have a straightforward interpretation (e.g. a point mutation disrupting a well-understood targeting signal), this interpretation is suggested. The interactive cartoon can be downloaded as standalone viewer in Java jar format to be saved and viewed later with only a standard Java runtime environment. The HPMV website is: http://hpmv.bii.a-star.edu.sg/ .


2015 ◽  
Vol 112 (22) ◽  
pp. 7003-7008 ◽  
Author(s):  
Jing Tong ◽  
Ruslan I. Sadreyev ◽  
Jimin Pei ◽  
Lisa N. Kinch ◽  
Nick V. Grishin

Inference of homology from protein sequences provides an essential tool for analyzing protein structure, function, and evolution. Current sequence-based homology search methods are still unable to detect many similarities evident from protein spatial structures. In computer science a search engine can be improved by considering networks of known relationships within the search database. Here, we apply this idea to protein-sequence–based homology search and show that it dramatically enhances the search accuracy. Our new method, COMPADRE (COmparison of Multiple Protein sequence Alignments using Database RElationships) assesses the relationship between the query sequence and a hit in the database by considering the similarity between the query and hit’s known homologs. This approach increases detection quality, boosting the precision rate from 18% to 83% at half-coverage of all database homologs. The increased precision rate allows detection of a large fraction of protein structural relationships, thus providing structure and function predictions for previously uncharacterized proteins. Our results suggest that this general approach is applicable to a wide variety of methods for detection of biological similarities. The web server is available at prodata.swmed.edu/compadre.


2002 ◽  
Vol 18 (2) ◽  
pp. 306-314 ◽  
Author(s):  
M. Cline ◽  
R. Hughey ◽  
K. Karplus

Author(s):  
Mu Gao ◽  
Jeffrey Skolnick

Abstract Motivation From evolutionary interference, function annotation to structural prediction, protein sequence comparison has provided crucial biological insights. While many sequence alignment algorithms have been developed, existing approaches often cannot detect hidden structural relationships in the ‘twilight zone’ of low sequence identity. To address this critical problem, we introduce a computational algorithm that performs protein Sequence Alignments from deep-Learning of Structural Alignments (SAdLSA, silent ‘d’). The key idea is to implicitly learn the protein folding code from many thousands of structural alignments using experimentally determined protein structures. Results To demonstrate that the folding code was learned, we first show that SAdLSA trained on pure α-helical proteins successfully recognizes pairs of structurally related pure β-sheet protein domains. Subsequent training and benchmarking on larger, highly challenging datasets show significant improvement over established approaches. For challenging cases, SAdLSA is ∼150% better than HHsearch for generating pairwise alignments and ∼50% better for identifying the proteins with the best alignments in a sequence library. The time complexity of SAdLSA is O(N) thanks to GPU acceleration. Availability and implementation Datasets and source codes of SAdLSA are available free of charge for academic users at http://sites.gatech.edu/cssb/sadlsa/. Contact [email protected] or [email protected] Supplementary information Supplementary data are available at Bioinformatics online.


Sign in / Sign up

Export Citation Format

Share Document