Nucleotide sequence of the phosphoglycerate kinase gene from the extreme thermophile Thermus thermophilus. Comparison of the deduced amino acid sequence with that of the mesophilic yeast phosphoglycerate kinase

D Bowen; J A Littlechild; J E Fothergill; H C Watson; L Hall

doi:10.1042/bj2540509

Nucleotide sequence of the phosphoglycerate kinase gene from the extreme thermophile Thermus thermophilus. Comparison of the deduced amino acid sequence with that of the mesophilic yeast phosphoglycerate kinase

Biochemical Journal ◽

10.1042/bj2540509 ◽

1988 ◽

Vol 254 (2) ◽

pp. 509-517 ◽

Cited By ~ 42

Author(s):

D Bowen ◽

J A Littlechild ◽

J E Fothergill ◽

H C Watson ◽

L Hall

Keyword(s):

Amino Acid ◽

Nucleotide Sequence ◽

Amino Acid Sequence ◽

Thermus Thermophilus ◽

Phosphoglycerate Kinase ◽

Amino Acid Residues ◽

Extreme Thermophile ◽

Kinase Gene ◽

Reading Frame ◽

Phosphoglycerate Kinase Gene

Using oligonucleotide probes derived from amino acid sequencing information, the structural gene for phosphoglycerate kinase from the extreme thermophile, Thermus thermophilus, was cloned in Escherichia coli and its complete nucleotide sequence determined. The gene consists of an open reading frame corresponding to a protein of 390 amino acid residues (calculated Mr 41,791) with an extreme bias for G or C (93.1%) in the codon third base position. Comparison of the deduced amino acid sequence with that of the corresponding mesophilic yeast enzyme indicated a number of significant differences. These are discussed in terms of the unusual codon bias and their possible role in enhanced protein thermal stability.

Download Full-text

Nucleotide Sequence of a Wheat Chloroplastic Phosphoglycerate Kinase Gene

PLANT PHYSIOLOGY ◽

10.1104/pp.107.4.1483 ◽

1995 ◽

Vol 107 (4) ◽

pp. 1483-1484 ◽

Cited By ~ 2

Author(s):

P. G. Jones ◽

C. A. Raines ◽

J. C. Lloyd

Keyword(s):

Nucleotide Sequence ◽

Phosphoglycerate Kinase ◽

Kinase Gene ◽

Phosphoglycerate Kinase Gene

Download Full-text

Expression patterns of the genes that encode lectin or lectin-related polypeptides in Robinia pseudoacacia

Functional Plant Biology ◽

10.1071/pp99048 ◽

1999 ◽

Vol 26 (5) ◽

pp. 495 ◽

Cited By ~ 3

Author(s):

Kazumasa Yoshida ◽

Kiyoshi Tazaki

Keyword(s):

Amino Acid ◽

Nucleotide Sequence ◽

Amino Acid Sequence ◽

Sequence Data ◽

Expression Patterns ◽

Robinia Pseudoacacia ◽

Regulation Of Expression ◽

Nucleotide Sequence Data ◽

Reading Frame ◽

Inner Bark

Three genomic clones (Rplec2, Rplec5 and Rplec6) and a cDNA clone (LECRPA4) that encoded lectin or lectin-related polypeptides were isolated from Robinia pseudoacacia L. A comparison of the nucleotide sequences of Rplec2 and a previously reported cDNA for the subunit indicated that Rplec2 encoded the 29 kDa subunit of the inner-bark lectin RPbAI. Rplec5 encoded a polypeptide whose deduced amino acid sequence was 96.1% identical to that of a subunit of seed lectin. The amino acid sequence deduced from the open reading frame of Rplec6 showed 61.1% identity to that encoded by Rplec5. LECRPA4 was isolated from an inner bark cDNA library and appeared to encode the 26 kDa subunit of inner-bark lectin RPbAII. The expression patterns of the various genes in tissues were examined by the reverse transcriptase-polymerase chain reaction (RT-PCR) with appropriate primers. Rplec2 transcripts were detected in the inner bark and roots. Rplec5 transcripts were detected in the inner bark, seeds and roots. No Rplec6 transcripts were detected in all tissues examined. LECRPA4 transcripts were found in leaves and in the inner bark. The level of expression of Rplec2 in the inner bark appeared to be similar in samples collected in different years and from different trees, whereas levels of expression of Rplec5 and LECRPA4 varied. These results suggest the differential regulation of expression of members of the lectin gene family in tissues of R. pseudoacacia. The nucleotide sequence data reported herein will appear in the DDBJ, EMBL and GenBank Nucleotide Sequence Databases under the accession numbers AB 012632 (Rplec2), AB012633 (Rplec5), AB012634 (Rplec6) and AB012635 (LECRPA4).

Download Full-text

Characteristics of a New Enantioselective Thermostable Dipeptidase from Brevibacillus borstelensis BCS-1 and Its Application to Synthesis of a d-Amino-Acid-Containing Dipeptide

Applied and Environmental Microbiology ◽

10.1128/aem.70.3.1570-1575.2004 ◽

2004 ◽

Vol 70 (3) ◽

pp. 1570-1575 ◽

Cited By ~ 4

Author(s):

Dae Heoun Baek ◽

Jae Jun Song ◽

Seok-Joon Kwon ◽

Chung Park ◽

Chang-Min Jung ◽

...

Keyword(s):

Escherichia Coli ◽

Amino Acid ◽

Nucleotide Sequence ◽

Amino Acid Sequence ◽

Open Reading Frame ◽

Nucleotide Sequence Analysis ◽

Reading Frame ◽

E Coli ◽

Specificity Constant ◽

Optimal Ph

ABSTRACT A new thermostable dipeptidase gene was cloned from the thermophile Brevibacillus borstelensis BCS-1 by genetic complementation of the d-Glu auxotroph Escherichia coli WM335 on a plate containing d-Ala-d-Glu. Nucleotide sequence analysis revealed that the gene included an open reading frame coding for a 307-amino-acid sequence with an M r of 35,000. The deduced amino acid sequence of the dipeptidase exhibited 52% similarity with the dipeptidase from Listeria monocytogenes. The enzyme was purified to homogeneity from recombinant E. coli WM335 harboring the dipeptidase gene from B. borstelensis BCS-1. Investigation of the enantioselectivity (E) to the P1 and P1′ site of Ala-Ala revealed that the ratio of the specificity constant (k cat /Km ) for l-enantioselectivity to the P1 site of Ala-Ala was 23.4 Ã¯Â¿Â½ 2.2 [E = (k cat /Km ) l,d /(k cat /Km ) d,d ], while the d-enantioselectivity to the P1′ site of Ala-Ala was 16.4 Ã¯Â¿Â½ 0.5 [E = (k cat /Km ) l,d /(k cat /Km ) l,l ] at 55Ã¯Â¿Â½C. The enzyme was stable up to 55Ã¯Â¿Â½C, and the optimal pH and temperature were 8.5 and 65Ã¯Â¿Â½C, respectively. The enzyme was able to hydrolyze l-Asp-d-Ala, l-Asp-d-AlaOMe, Z-d-Ala-d-AlaOBzl, and Z-l-Asp-d-AlaOBzl, yet it could not hydrolyze d-Ala-l-Asp, d-Ala-l-Ala, d-AlaNH2, and l-AlaNH2. The enzyme also exhibited β-lactamase activity similar to that of a human renal dipeptidase. The dipeptidase successfully synthesized the precursor of the dipeptide sweetener Z-l-Asp-d-AlaOBzl.

Download Full-text

The testis-specific phosphoglycerate kinase gene pgk-2 is a recruited retroposon

Molecular and Cellular Biology ◽

10.1128/mcb.7.9.3107-3112.1987 ◽

1987 ◽

Vol 7 (9) ◽

pp. 3107-3112

Author(s):

P H Boer ◽

C N Adra ◽

Y F Lau ◽

M W McBurney

Keyword(s):

Nucleotide Sequence ◽

Gene Duplication ◽

X Chromosome ◽

Phosphoglycerate Kinase ◽

Sperm Cells ◽

Kinase Gene ◽

Evolutionary Diversification ◽

Key Enzyme ◽

Phosphoglycerate Kinase Gene ◽

Kinase A

In both humans and mice, two genes encode phosphoglycerate kinase, a key enzyme in the glycolytic pathway. The pgk-1 gene is expressed in all somatic cells, is located on the X chromosome, and contains 10 introns. The pgk-2 gene is expressed only in sperm cells, is located on an autosome, and has no introns. The nucleotide sequence of the pgk-2 gene suggests that it arose from pgk-1 more than 100 million years ago by RNA-mediated gene duplication. The pgk-2 gene may, then, be a transcribed retroposon. Thus, gene duplication by retroposition may have been used as a mechanism for evolutionary diversification.

Download Full-text

Isolation, characterization and sequence analysis of a full-length cDNA clone encoding acetohydroxy acid reductoisomerase from spinach chloroplasts

Biochemical Journal ◽

10.1042/bj2770469 ◽

1991 ◽

Vol 277 (2) ◽

pp. 469-475 ◽

Cited By ~ 13

Author(s):

R Dumas ◽

M Lebrun ◽

R Douce

Keyword(s):

Amino Acid ◽

Amino Acid Sequence ◽

Single Gene ◽

Amino Acid Sequences ◽

Full Length ◽

Open Reading Frame ◽

Amino Acid Residues ◽

Protein Precursor ◽

Reading Frame ◽

Full Length Cdna

Acetohydroxy acid reductoisomerase (AHRI), the second enzyme in the parallel isoleucine/valine-biosynthetic pathway, catalyses an unusual two-step reaction in which the substrate, either 2-acetolactate or 2-aceto-2-hydroxybutyrate, is converted via an alkyl migration and an NADPH-dependent reduction to give 2,3-dihydroxy-3-methylbutyrate or 2,3-dihydroxy-3-methylvalerate respectively. We have isolated and characterized a full-length cDNA from a lambda gt11 spinach library encoding the complete acetohydroxy acid reductoisomerase protein precursor. The 2050-nucleotide sequence contains a 1785-nucleotide open reading frame. The derived amino acid sequence indicates that the protein precursor consists of 595 amino acid residues including a presequence peptide of 72 amino acid residues. The N-terminal sequence of the first 16 amino acid residues of the purified AHRI confirms the identity of the cDNA. The derived amino acid sequence from this open reading frame shows 23% identity with the deduced amino acid sequences of the Escherichia coli and Saccharomyces cerevisiae AHRI proteins. There are two blocks of conserved amino acid residues in these three proteins. One of these is a sequence similar to the ‘fingerprint’ region of the NAD(P)H-binding site found in a large number of NAD(P)H-dependent oxidoreductases. The other, a short sequence (Lys-Xaa-Xaa-Xaa-Xaa-Xaa-Xaa-Xaa-Ser-His-Gly-Phe) containing the amino acids lysine and histidine, could well be the catalytic site of the first step of the AHRI reaction. Southern-blot analysis indicated that AHRI is encoded by a single gene per haploid genome of about 7.5 kbp containing at least four introns.

Download Full-text

The overexpression and complete amino acid sequence of Escherichia coli 3-dehydroquinase

Biochemical Journal ◽

10.1042/bj2380475 ◽

1986 ◽

Vol 238 (2) ◽

pp. 475-483 ◽

Cited By ~ 36

Author(s):

K Duncan ◽

S Chaudhuri ◽

M S Campbell ◽

J R Coggins

Keyword(s):

Escherichia Coli ◽

Amino Acid ◽

Nucleotide Sequence ◽

Amino Acid Sequence ◽

Polypeptide Chain ◽

Amino Acid Residues ◽

Transcript Mapping ◽

Terminal Amino Acid ◽

Terminal Amino ◽

Terminal Amino Acid Sequence

The enzyme 3-dehydroquinase was purified in milligram quantities from an overproducing strain of Escherichia coli. The amino acid sequence was deduced from the nucleotide sequence of the aroD gene and confirmed by determining the amino acid composition of the overproduced enzyme and its N-terminal amino acid sequence. The complete polypeptide chain consists of 240 amino acid residues and has a calculated subunit Mr of 26,377. Transcript mapping revealed that aroD is a typical monocistronic gene.

Download Full-text

Primary structure of the cytosolic β-glucosidase of guinea pig liver

Biochemical Journal ◽

10.1042/bj3190829 ◽

1996 ◽

Vol 319 (3) ◽

pp. 829-837 ◽

Cited By ~ 16

Author(s):

William S HAYS ◽

Steven A. JENISON ◽

Takashi YAMADA ◽

Andrzej PASTUSZYN ◽

Robert H. GLEW

Keyword(s):

Amino Acid ◽

Amino Acid Sequence ◽

Guinea Pig ◽

Mammalian Species ◽

Amino Acid Sequences ◽

Amino Acid Residues ◽

Pig Liver ◽

Reading Frame ◽

Degenerate Oligonucleotide ◽

Guinea Pig Liver

The cytosolic β-glucosidase (EC 3.2.1.21) present in the livers of mammalian species is distinguished by its broad specificity for sugars and its preference for hydrophobic aglycones. We purified the cytosolic β-glucosidase from guinea pig liver and sequenced 142 amino acid residues contained within 12 trypsin digest fragments. Using degenerate oligonucleotide primers deduced from the peptide sequences, a 622 bp cytosolic β-glucosidase cDNA was amplified by reverse-transcriptase PCR, using total guinea pig liver RNA as template. The ‘rapid amplification of cDNA ends (RACE)’ method [Frohman (1993) Methods Enzymol. 218, 340–356] was used to synthesize the remaining segments of the full-length cDNA. The complete cDNA contained 1671 nucleotides with an open reading frame coding for 469 amino acid residues. The amino acid sequence deduced from the cDNA sequence included the amino acid sequences of all 12 trypsin digest fragments derived from the purified enzyme. Amino acid sequence analysis indicates that the guinea pig liver cytosolic β-glucosidase is a Family 1 β-glycosidase and that it is most closely related to mammalian lactase-phlorizin hydrolase. These results suggest that the cytosolic β-glucosidase and lactase-phlorizin hydrolase diverged from a common evolutionary precursor.

Download Full-text

Nucleotide sequence of the marmoset herpesvirus thymidine kinase gene and predicted amino acid sequence of thymidine kinase polypeptide

Virology ◽

10.1016/0042-6822(84)90189-2 ◽

1984 ◽

Vol 135 (2) ◽

pp. 316-330 ◽

Cited By ~ 33

Author(s):

Haruki Otsuka ◽

Saul Kit

Keyword(s):

Amino Acid ◽

Nucleotide Sequence ◽

Amino Acid Sequence ◽

Thymidine Kinase ◽

Predicted Amino Acid Sequence ◽

Thymidine Kinase Gene ◽

Kinase Gene

Download Full-text

MATLAB software for extracting protein name and sequence information from FASTA formatted proteome file

10.7287/peerj.preprints.27856 ◽

2019 ◽

Author(s):

Wenfa Ng

Keyword(s):

Molecular Weight ◽

Amino Acid ◽

Nucleotide Sequence ◽

Amino Acid Sequence ◽

Sequence Information ◽

Amino Acid Residues ◽

Protein Database ◽

Matlab Software ◽

Amino Acid Sequence Information ◽

New Protein

FASTA file format is a common file type for distributing proteome information, especially those obtained from Uniprot. While MATLAB could automatically read fasta files using the built-in function, fastaread, important information such as protein name and organism name remain enmeshed in a character array. Hence, difficulty exists in automatic extraction of protein names from fasta proteome file to help in building a database with fields comprising protein name and its amino acid sequence. The objective of this work was in developing a MATLAB software that could automatically extract protein name and amino acid sequence information from fasta proteome file and assign them to a new database that comprises fields such as protein name, amino acid sequence, number of amino acid residues, molecular weight of protein and nucleotide sequence of protein. Information on number of amino acid residues came from the use of the length built-in function in MATLAB analyzing the length of the amino acid sequence of a protein. The final two fields were provided by MATLAB built-in functions molweight and aa2nt, respectively. Molecular weight of proteins is useful for a variety of applications while nucleotide sequence is essential for gene synthesis applications in molecular cloning. Finally, the MATLAB software is also equipped with an error check function to help detect letters in the amino acid sequence that are not part of the family of 20 natural amino acids. Sequences with such letters would constitute as error inputs to molweight and aa2nt, and would not be processed. Collectively, given that important information such as protein name is enmeshed in a character array in fasta proteome file, this work sets out to develop a MATLAB software that could automatically extract protein name and amino acid sequence information, and assigns them to a new protein database. Using built-in functions, number of amino acid residues, molecular weight and nucleotide sequence of each protein were calculated; thereby, yielding a new protein database with improved functionalities that could support a variety of biology workflows ranging from sequence alignment to molecular cloning.

Download Full-text

Sequence, Structure, and Binding Site Analysis of Kirkiin in Comparison with Ricin and Other Type 2 RIPs

Toxins ◽

10.3390/toxins13120862 ◽

2021 ◽

Vol 13 (12) ◽

pp. 862

Author(s):

Stefania Maiello ◽

Rosario Iglesias ◽

Letizia Polito ◽

Lucía Citores ◽

Massimo Bortolotti ◽

...

Keyword(s):

Amino Acid ◽

Sequence Analysis ◽

Amino Acid Sequence ◽

Structure Prediction ◽

3D Structure ◽

Binding Activity ◽

Amino Acid Residues ◽

Reading Frame ◽

A Chain

Kirkiin is a new type 2 ribosome-inactivating protein (RIP) purified from the caudex of Adenia kirkii with a cytotoxicity compared to that of stenodactylin. The high toxicity of RIPs from Adenia genus plants makes them interesting tools for biotechnology and therapeutic applications, particularly in cancer therapy. The complete amino acid sequence and 3D structure prediction of kirkiin are here reported. Gene sequence analysis revealed that kirkiin is encoded by a 1572 bp open reading frame, corresponding to 524 amino acid residues, without introns. The amino acid sequence analysis showed a high degree of identity with other Adenia RIPs. The 3D structure of kirkiin preserves the overall folding of type 2 RIPs. The key amino acids of the active site, described for ricin and other RIPs, are also conserved in the kirkiin A chain. Sugar affinity studies and docking experiments revealed that both the 1α and 2γ sites of the kirkiin B chain exhibit binding activity toward lactose and D-galactose, being lower than ricin. The replacement of His246 in the kirkiin 2γ site instead of Tyr248 in ricin causes a different structure arrangement that could explain the lower sugar affinity of kirkiin with respect to ricin.

Download Full-text