RaacLogo: a new sequence logo generator by using reduced amino acid clusters

Author(s):  
Lei Zheng ◽  
Dongyang Liu ◽  
Wuritu Yang ◽  
Lei Yang ◽  
Yongchun Zuo

Abstract Sequence logos give a fast and concise display in visualizing consensus sequence. Protein exhibits greater complexity and diversity than DNA, which usually affects the graphical representation of the logo. Reduced amino acids perform powerful ability for simplifying complexity of sequence alignment, which motivated us to establish RaacLogo. As a new sequence logo generator by using reduced amino acid alphabets, RaacLogo can easily generate many different simplified logos tailored to users by selecting various reduced amino acid alphabets that consisted of more than 40 clustering algorithms. This current web server provides 74 types of reduced amino acid alphabet, which were manually extracted to generate 673 reduced amino acid clusters (RAACs) for dealing with protein alignment. A two-dimensional selector was proposed for easily selecting desired RAACs with underlying biology knowledge. It is anticipated that the RaacLogo web server will play more high-potential roles for protein sequence alignment, topological estimation and protein design experiments. RaacLogo is freely available at http://bioinfor.imu.edu.cn/raaclogo.

2020 ◽  
Vol 17 (1) ◽  
pp. 59-77
Author(s):  
Anand Kumar Nelapati ◽  
JagadeeshBabu PonnanEttiyappan

Background:Hyperuricemia and gout are the conditions, which is a response of accumulation of uric acid in the blood and urine. Uric acid is the product of purine metabolic pathway in humans. Uricase is a therapeutic enzyme that can enzymatically reduces the concentration of uric acid in serum and urine into more a soluble allantoin. Uricases are widely available in several sources like bacteria, fungi, yeast, plants and animals.Objective:The present study is aimed at elucidating the structure and physiochemical properties of uricase by insilico analysis.Methods:A total number of sixty amino acid sequences of uricase belongs to different sources were obtained from NCBI and different analysis like Multiple Sequence Alignment (MSA), homology search, phylogenetic relation, motif search, domain architecture and physiochemical properties including pI, EC, Ai, Ii, and were performed.Results:Multiple sequence alignment of all the selected protein sequences has exhibited distinct difference between bacterial, fungal, plant and animal sources based on the position-specific existence of conserved amino acid residues. The maximum homology of all the selected protein sequences is between 51-388. In singular category, homology is between 16-337 for bacterial uricase, 14-339 for fungal uricase, 12-317 for plants uricase, and 37-361 for animals uricase. The phylogenetic tree constructed based on the amino acid sequences disclosed clusters indicating that uricase is from different source. The physiochemical features revealed that the uricase amino acid residues are in between 300- 338 with a molecular weight as 33-39kDa and theoretical pI ranging from 4.95-8.88. The amino acid composition results showed that valine amino acid has a high average frequency of 8.79 percentage compared to different amino acids in all analyzed species.Conclusion:In the area of bioinformatics field, this work might be informative and a stepping-stone to other researchers to get an idea about the physicochemical features, evolutionary history and structural motifs of uricase that can be widely used in biotechnological and pharmaceutical industries. Therefore, the proposed in silico analysis can be considered for protein engineering work, as well as for gout therapy.


2020 ◽  
Vol 14 (3) ◽  
pp. 235-246
Author(s):  
Sara Abdollahi ◽  
Mohammad H. Morowvat ◽  
Amir Savardashtaki ◽  
Cambyz Irajie ◽  
Sohrab Najafipour ◽  
...  

Background: Arginine deiminase is a bacterial enzyme, which degrades L-arginine. Some human cancers such as hepatocellular carcinoma (HCC) and melanoma are auxotrophic for arginine. Therefore, PEGylated arginine deiminase (ADI-PEG20) is a good anticancer candidate with antitumor effects. It causes local depletion of L-arginine and growth inhibition in arginineauxotrophic tumor cells. The FDA and EMA have granted orphan status to this drug. Some recently published patents have dealt with this enzyme or its PEGylated form. Objective: Due to increasing attention to it, we aimed to evaluate and compare 30 arginine deiminase proteins from different bacterial species through in silico analysis. Methods: The exploited analyses included the investigation of physicochemical properties, multiple sequence alignment (MSA), motif, superfamily, phylogenetic and 3D comparative analyses of arginine deiminase proteins thorough various bioinformatics tools. Results: The most abundant amino acid in the arginine deiminase proteins is leucine (10.13%) while the least amino acid ratio is cysteine (0.98%). Multiple sequence alignment showed 47 conserved patterns between 30 arginine deiminase amino acid sequences. The results of sequence homology among 30 different groups of arginine deiminase enzymes revealed that all the studied sequences located in amidinotransferase superfamily. Based on the phylogenetic analysis, two major clusters were identified. Considering the results of various in silico studies; we selected the five best candidates for further investigations. The 3D structures of the best five arginine deiminase proteins were generated by the I-TASSER server and PyMOL. The RAMPAGE analysis revealed that 81.4%-91.4%, of the selected sequences, were located in the favored region of arginine deiminase proteins. Conclusion: The results of this study shed light on the basic physicochemical properties of thirty major arginine deiminase sequences. The obtained data could be employed for further in vivo and clinical studies and also for developing the related therapeutic enzymes.


2021 ◽  
Vol 14 (1) ◽  
Author(s):  
Siddhartha Kundu

Abstract Objective Non-haem iron(II)- and 2-oxoglutarate-dependent dioxygenases (i2OGdd), are a taxonomically and functionally diverse group of enzymes. The active site comprises ferrous iron in a hexa-coordinated distorted octahedron with the apoenzyme, 2-oxoglutarate and a displaceable water molecule. Current information on novel i2OGdd members is sparse and relies on computationally-derived annotation schema. The dissimilar amino acid composition and variable active site geometry thereof, results in differing reaction chemistries amongst i2OGdd members. An additional need of researchers is a curated list of sequences with putative i2OGdd function which can be probed further for empirical data. Results This work reports the implementation of $$Fe\left(2\right)OG$$ F e 2 O G , a web server with dual functionality and an extension of previous work on i2OGdd enzymes $$\left(Fe\left(2\right)OG\equiv \{H2OGpred,DB2OG\}\right)$$ F e 2 O G ≡ { H 2 O G p r e d , D B 2 O G } . $$Fe\left(2\right)OG$$ F e 2 O G , in this form is completely revised, updated (URL, scripts, repository) and will strengthen the knowledge base of investigators on i2OGdd biochemistry and function. $$Fe\left(2\right)OG$$ F e 2 O G , utilizes the superior predictive propensity of HMM-profiles of laboratory validated i2OGdd members to predict probable active site geometries in user-defined protein sequences. $$Fe\left(2\right)OG$$ F e 2 O G , also provides researchers with a pre-compiled list of analyzed and searchable i2OGdd-like sequences, many of which may be clinically relevant. $$Fe(2)OG$$ F e ( 2 ) O G , is freely available (http://204.152.217.16/Fe2OG.html) and supersedes all previous versions, i.e., H2OGpred, DB2OG.


1993 ◽  
Vol 13 (4) ◽  
pp. 2554-2563 ◽  
Author(s):  
D Wojciechowicz ◽  
C F Lu ◽  
J Kurjan ◽  
P N Lipke

alpha-Agglutinin is a cell adhesion glycoprotein expressed on the cell wall of Saccharomyces cerevisiae alpha cells. Binding of alpha-agglutinin to its ligand a-agglutinin, expressed by a cells, mediates cell-cell contact during mating. Analysis of truncations of the 650-amino-acid alpha-agglutinin structural gene AG alpha 1 delineated functional domains of alpha-agglutinin. Removal of the C-terminal hydrophobic sequence allowed efficient secretion of the protein and loss of cell surface attachment. This cell surface anchorage domain was necessary for linkage to a glycosyl phosphatidylinositol anchor. A construct expressing the N-terminal 350 amino acid residues retained full a-agglutinin-binding activity, localizing the binding domain to the N-terminal portion of alpha-agglutinin. A 278-residue N-terminal peptide was inactive; therefore, the binding domain includes residues between 278 and 350. The segment of alpha-agglutinin between amino acid residues 217 and 308 showed significant structural and sequence similarity to a consensus sequence for immunoglobulin superfamily variable-type domains. The similarity of the alpha-agglutinin-binding domain to mammalian cell adhesion proteins suggests that this structure is a highly conserved feature of adhesion proteins in diverse eukaryotes.


2012 ◽  
Vol 60 (3) ◽  
pp. 495-505
Author(s):  
M. Startek ◽  
S. Lasota ◽  
M. Sykulski ◽  
A. Bułak ◽  
L. Noé ◽  
...  

Abstract In this paper we present two algorithms that may serve as efficient alternatives to the well-known PSI BLAST tool: SeedBLAST and CTX-PSI Blast. Both may benefit from the knowledge about amino acid composition specific to a given protein family: SeedBLAST uses the advisedly designed seed, while CTX-PSI BLAST extends PSI BLAST with the context-specific substitution model. The seeding technique became central in the theory of sequence alignment. There are several efficient tools applying seeds to DNA homology search, but not to protein homology search. In this paper we fill this gap. We advocate the use of multiple subset seeds derived from a hierarchical tree of amino acid residues. Our method computes, by an evolutionary algorithm, seeds that are specifically designed for a given protein family. The seeds are represented by deterministic finite automata (DFAs) and built into the NCBI-BLAST software. This extended tool, named SeedBLAST, is compared to the original BLAST and PSI-BLAST on several protein families. Our results demonstrate a superiority of SeedBLAST in terms of efficiency, especially in the case of twilight zone hits. The contextual substitution model has been proven to increase sensitivity of protein alignment. In this paper we perform a next step in the contextual alignment program. We announce a contextual version of the PSI-BLAST algorithm, an iterative version of the NCBI-BLAST tool. The experimental evaluation has been performed demonstrating a significantly higher sensitivity compared to the ordinary PSI-BLAST algorithm.


1988 ◽  
Vol 8 (2) ◽  
pp. 737-746
Author(s):  
D Eide ◽  
P Anderson

The transposable element Tc1 is responsible for most spontaneous mutations that occur in Caenorhabditis elegans variety Bergerac. We investigated the genetic and molecular properties of Tc1 transposition and excision. We show that Tc1 insertion into the unc-54 myosin heavy-chain gene was strongly site specific. The DNA sequences of independent Tc1 insertion sites were similar to each other, and we present a consensus sequence for Tc1 insertion that describes these similarities. We show that Tc1 excision was usually imprecise. Tc1 excision was imprecise in both germ line and somatic cells. Imprecise excision generated novel unc-54 alleles that had amino acid substitutions, amino acid insertions, and, in certain cases, probably altered mRNA splicing. The DNA sequences remaining after Tc1 somatic excision were the same as those remaining after germ line excision, but the frequency of somatic excision was at least 1,000-fold higher than that of germ line excision. The genetic properties of Tc1 excision, combined with the DNA sequences of the resulting unc-54 alleles, demonstrated that excision was dependent on Tc1 transposition functions in both germ line and somatic cells. Somatic excision was not regulated in the same strain-specific manner as germ-line excision was. In a genetic background where Tc1 transposition and excision in the germ line was not detectable, Tc1 excision in the soma still occurred at high frequency.


2007 ◽  
Vol 23 (20) ◽  
pp. 2795-2796 ◽  
Author(s):  
M. Ganapathiraju ◽  
C. J. Jursa ◽  
H. A. Karimi ◽  
J. Klein-Seetharaman

Sign in / Sign up

Export Citation Format

Share Document