scholarly journals Nucleotide sequence of the phosphoglycerate kinase gene from the extreme thermophile Thermus thermophilus. Comparison of the deduced amino acid sequence with that of the mesophilic yeast phosphoglycerate kinase

1988 ◽  
Vol 254 (2) ◽  
pp. 509-517 ◽  
Author(s):  
D Bowen ◽  
J A Littlechild ◽  
J E Fothergill ◽  
H C Watson ◽  
L Hall

Using oligonucleotide probes derived from amino acid sequencing information, the structural gene for phosphoglycerate kinase from the extreme thermophile, Thermus thermophilus, was cloned in Escherichia coli and its complete nucleotide sequence determined. The gene consists of an open reading frame corresponding to a protein of 390 amino acid residues (calculated Mr 41,791) with an extreme bias for G or C (93.1%) in the codon third base position. Comparison of the deduced amino acid sequence with that of the corresponding mesophilic yeast enzyme indicated a number of significant differences. These are discussed in terms of the unusual codon bias and their possible role in enhanced protein thermal stability.

1995 ◽  
Vol 107 (4) ◽  
pp. 1483-1484 ◽  
Author(s):  
P. G. Jones ◽  
C. A. Raines ◽  
J. C. Lloyd

1999 ◽  
Vol 26 (5) ◽  
pp. 495 ◽  
Author(s):  
Kazumasa Yoshida ◽  
Kiyoshi Tazaki

Three genomic clones (Rplec2, Rplec5 and Rplec6) and a cDNA clone (LECRPA4) that encoded lectin or lectin-related polypeptides were isolated from Robinia pseudoacacia L. A comparison of the nucleotide sequences of Rplec2 and a previously reported cDNA for the subunit indicated that Rplec2 encoded the 29 kDa subunit of the inner-bark lectin RPbAI. Rplec5 encoded a polypeptide whose deduced amino acid sequence was 96.1% identical to that of a subunit of seed lectin. The amino acid sequence deduced from the open reading frame of Rplec6 showed 61.1% identity to that encoded by Rplec5. LECRPA4 was isolated from an inner bark cDNA library and appeared to encode the 26 kDa subunit of inner-bark lectin RPbAII. The expression patterns of the various genes in tissues were examined by the reverse transcriptase-polymerase chain reaction (RT-PCR) with appropriate primers. Rplec2 transcripts were detected in the inner bark and roots. Rplec5 transcripts were detected in the inner bark, seeds and roots. No Rplec6 transcripts were detected in all tissues examined. LECRPA4 transcripts were found in leaves and in the inner bark. The level of expression of Rplec2 in the inner bark appeared to be similar in samples collected in different years and from different trees, whereas levels of expression of Rplec5 and LECRPA4 varied. These results suggest the differential regulation of expression of members of the lectin gene family in tissues of R. pseudoacacia. The nucleotide sequence data reported herein will appear in the DDBJ, EMBL and GenBank Nucleotide Sequence Databases under the accession numbers AB 012632 (Rplec2), AB012633 (Rplec5), AB012634 (Rplec6) and AB012635 (LECRPA4).


2004 ◽  
Vol 70 (3) ◽  
pp. 1570-1575 ◽  
Author(s):  
Dae Heoun Baek ◽  
Jae Jun Song ◽  
Seok-Joon Kwon ◽  
Chung Park ◽  
Chang-Min Jung ◽  
...  

ABSTRACT A new thermostable dipeptidase gene was cloned from the thermophile Brevibacillus borstelensis BCS-1 by genetic complementation of the d-Glu auxotroph Escherichia coli WM335 on a plate containing d-Ala-d-Glu. Nucleotide sequence analysis revealed that the gene included an open reading frame coding for a 307-amino-acid sequence with an M r of 35,000. The deduced amino acid sequence of the dipeptidase exhibited 52% similarity with the dipeptidase from Listeria monocytogenes. The enzyme was purified to homogeneity from recombinant E. coli WM335 harboring the dipeptidase gene from B. borstelensis BCS-1. Investigation of the enantioselectivity (E) to the P1 and P1′ site of Ala-Ala revealed that the ratio of the specificity constant (k cat /Km ) for l-enantioselectivity to the P1 site of Ala-Ala was 23.4 � 2.2 [E = (k cat /Km ) l,d /(k cat /Km ) d,d ], while the d-enantioselectivity to the P1′ site of Ala-Ala was 16.4 � 0.5 [E = (k cat /Km ) l,d /(k cat /Km ) l,l ] at 55�C. The enzyme was stable up to 55�C, and the optimal pH and temperature were 8.5 and 65�C, respectively. The enzyme was able to hydrolyze l-Asp-d-Ala, l-Asp-d-AlaOMe, Z-d-Ala-d-AlaOBzl, and Z-l-Asp-d-AlaOBzl, yet it could not hydrolyze d-Ala-l-Asp, d-Ala-l-Ala, d-AlaNH2, and l-AlaNH2. The enzyme also exhibited β-lactamase activity similar to that of a human renal dipeptidase. The dipeptidase successfully synthesized the precursor of the dipeptide sweetener Z-l-Asp-d-AlaOBzl.


1987 ◽  
Vol 7 (9) ◽  
pp. 3107-3112
Author(s):  
P H Boer ◽  
C N Adra ◽  
Y F Lau ◽  
M W McBurney

In both humans and mice, two genes encode phosphoglycerate kinase, a key enzyme in the glycolytic pathway. The pgk-1 gene is expressed in all somatic cells, is located on the X chromosome, and contains 10 introns. The pgk-2 gene is expressed only in sperm cells, is located on an autosome, and has no introns. The nucleotide sequence of the pgk-2 gene suggests that it arose from pgk-1 more than 100 million years ago by RNA-mediated gene duplication. The pgk-2 gene may, then, be a transcribed retroposon. Thus, gene duplication by retroposition may have been used as a mechanism for evolutionary diversification.


1991 ◽  
Vol 277 (2) ◽  
pp. 469-475 ◽  
Author(s):  
R Dumas ◽  
M Lebrun ◽  
R Douce

Acetohydroxy acid reductoisomerase (AHRI), the second enzyme in the parallel isoleucine/valine-biosynthetic pathway, catalyses an unusual two-step reaction in which the substrate, either 2-acetolactate or 2-aceto-2-hydroxybutyrate, is converted via an alkyl migration and an NADPH-dependent reduction to give 2,3-dihydroxy-3-methylbutyrate or 2,3-dihydroxy-3-methylvalerate respectively. We have isolated and characterized a full-length cDNA from a lambda gt11 spinach library encoding the complete acetohydroxy acid reductoisomerase protein precursor. The 2050-nucleotide sequence contains a 1785-nucleotide open reading frame. The derived amino acid sequence indicates that the protein precursor consists of 595 amino acid residues including a presequence peptide of 72 amino acid residues. The N-terminal sequence of the first 16 amino acid residues of the purified AHRI confirms the identity of the cDNA. The derived amino acid sequence from this open reading frame shows 23% identity with the deduced amino acid sequences of the Escherichia coli and Saccharomyces cerevisiae AHRI proteins. There are two blocks of conserved amino acid residues in these three proteins. One of these is a sequence similar to the ‘fingerprint’ region of the NAD(P)H-binding site found in a large number of NAD(P)H-dependent oxidoreductases. The other, a short sequence (Lys-Xaa-Xaa-Xaa-Xaa-Xaa-Xaa-Xaa-Ser-His-Gly-Phe) containing the amino acids lysine and histidine, could well be the catalytic site of the first step of the AHRI reaction. Southern-blot analysis indicated that AHRI is encoded by a single gene per haploid genome of about 7.5 kbp containing at least four introns.


1986 ◽  
Vol 238 (2) ◽  
pp. 475-483 ◽  
Author(s):  
K Duncan ◽  
S Chaudhuri ◽  
M S Campbell ◽  
J R Coggins

The enzyme 3-dehydroquinase was purified in milligram quantities from an overproducing strain of Escherichia coli. The amino acid sequence was deduced from the nucleotide sequence of the aroD gene and confirmed by determining the amino acid composition of the overproduced enzyme and its N-terminal amino acid sequence. The complete polypeptide chain consists of 240 amino acid residues and has a calculated subunit Mr of 26,377. Transcript mapping revealed that aroD is a typical monocistronic gene.


1996 ◽  
Vol 319 (3) ◽  
pp. 829-837 ◽  
Author(s):  
William S HAYS ◽  
Steven A. JENISON ◽  
Takashi YAMADA ◽  
Andrzej PASTUSZYN ◽  
Robert H. GLEW

The cytosolic β-glucosidase (EC 3.2.1.21) present in the livers of mammalian species is distinguished by its broad specificity for sugars and its preference for hydrophobic aglycones. We purified the cytosolic β-glucosidase from guinea pig liver and sequenced 142 amino acid residues contained within 12 trypsin digest fragments. Using degenerate oligonucleotide primers deduced from the peptide sequences, a 622 bp cytosolic β-glucosidase cDNA was amplified by reverse-transcriptase PCR, using total guinea pig liver RNA as template. The ‘rapid amplification of cDNA ends (RACE)’ method [Frohman (1993) Methods Enzymol. 218, 340–356] was used to synthesize the remaining segments of the full-length cDNA. The complete cDNA contained 1671 nucleotides with an open reading frame coding for 469 amino acid residues. The amino acid sequence deduced from the cDNA sequence included the amino acid sequences of all 12 trypsin digest fragments derived from the purified enzyme. Amino acid sequence analysis indicates that the guinea pig liver cytosolic β-glucosidase is a Family 1 β-glycosidase and that it is most closely related to mammalian lactase-phlorizin hydrolase. These results suggest that the cytosolic β-glucosidase and lactase-phlorizin hydrolase diverged from a common evolutionary precursor.


2019 ◽  
Author(s):  
Wenfa Ng

FASTA file format is a common file type for distributing proteome information, especially those obtained from Uniprot. While MATLAB could automatically read fasta files using the built-in function, fastaread, important information such as protein name and organism name remain enmeshed in a character array. Hence, difficulty exists in automatic extraction of protein names from fasta proteome file to help in building a database with fields comprising protein name and its amino acid sequence. The objective of this work was in developing a MATLAB software that could automatically extract protein name and amino acid sequence information from fasta proteome file and assign them to a new database that comprises fields such as protein name, amino acid sequence, number of amino acid residues, molecular weight of protein and nucleotide sequence of protein. Information on number of amino acid residues came from the use of the length built-in function in MATLAB analyzing the length of the amino acid sequence of a protein. The final two fields were provided by MATLAB built-in functions molweight and aa2nt, respectively. Molecular weight of proteins is useful for a variety of applications while nucleotide sequence is essential for gene synthesis applications in molecular cloning. Finally, the MATLAB software is also equipped with an error check function to help detect letters in the amino acid sequence that are not part of the family of 20 natural amino acids. Sequences with such letters would constitute as error inputs to molweight and aa2nt, and would not be processed. Collectively, given that important information such as protein name is enmeshed in a character array in fasta proteome file, this work sets out to develop a MATLAB software that could automatically extract protein name and amino acid sequence information, and assigns them to a new protein database. Using built-in functions, number of amino acid residues, molecular weight and nucleotide sequence of each protein were calculated; thereby, yielding a new protein database with improved functionalities that could support a variety of biology workflows ranging from sequence alignment to molecular cloning.


Toxins ◽  
2021 ◽  
Vol 13 (12) ◽  
pp. 862
Author(s):  
Stefania Maiello ◽  
Rosario Iglesias ◽  
Letizia Polito ◽  
Lucía Citores ◽  
Massimo Bortolotti ◽  
...  

Kirkiin is a new type 2 ribosome-inactivating protein (RIP) purified from the caudex of Adenia kirkii with a cytotoxicity compared to that of stenodactylin. The high toxicity of RIPs from Adenia genus plants makes them interesting tools for biotechnology and therapeutic applications, particularly in cancer therapy. The complete amino acid sequence and 3D structure prediction of kirkiin are here reported. Gene sequence analysis revealed that kirkiin is encoded by a 1572 bp open reading frame, corresponding to 524 amino acid residues, without introns. The amino acid sequence analysis showed a high degree of identity with other Adenia RIPs. The 3D structure of kirkiin preserves the overall folding of type 2 RIPs. The key amino acids of the active site, described for ricin and other RIPs, are also conserved in the kirkiin A chain. Sugar affinity studies and docking experiments revealed that both the 1α and 2γ sites of the kirkiin B chain exhibit binding activity toward lactose and D-galactose, being lower than ricin. The replacement of His246 in the kirkiin 2γ site instead of Tyr248 in ricin causes a different structure arrangement that could explain the lower sugar affinity of kirkiin with respect to ricin.


Sign in / Sign up

Export Citation Format

Share Document