Amino acid sequence of the acidic acrosin inhibitor (BUSI I B2) from bull seminal plasma

Bedřich Meloun; Věra Jonáková; Dana Čechová

doi:10.1135/cccc19832558

Amino acid sequence of basic acrosin inhibitor from bull seminal plasma

Collection of Czechoslovak Chemical Communications ◽

10.1135/cccc19792710 ◽

1979 ◽

Vol 44 (9) ◽

pp. 2710-2721 ◽

Cited By ~ 10

Author(s):

Bedřich Meloun ◽

Dana Čechová

Keyword(s):

Molecular Weight ◽

Amino Acid ◽

Carboxylic Acid ◽

Amino Acid Sequence ◽

Seminal Plasma ◽

Terminal Group ◽

Sequential Data ◽

Amino Acid Residues ◽

Pyrrolidone Carboxylic Acid

The molecule of the inhibitor contains 57 amino acid residues whose sequence is the following: Pyr-Gly-Ala-Gln-Val-Asp-Cys-Ala-Glu-Phe-Lys-Asp-Pro-Lys-Val-Tyr-Cys-Thr-Arg-His-Ser-Asp-Pro-Gln-Cys-Gly-Ser-Asn-Gly-Glu-Thr-Tyr-Gly-Asn-Lys-Cys-Ala-Phe-Cys-Lys-Ala-Val-Met-Lys-Ser-Gly-Gly-Lys-Ile-Asn-Leu-Lys-His-Arg-Gly-Cys-Lys. The N-terminal group of the inhibitor is pyrrolidone carboxylic acid. The sequential data were obtained by analyses of peptides isolated from tryptic and chymotryptic digests of the oxidized or carboxymethylated inhibitor. The molecular weight of the inhibitor is 6 200.

Download Full-text

Amino acid sequences around the cysteine residue of calf lens α-crystallin

Biochemical Journal ◽

10.1042/bj1240061 ◽

1971 ◽

Vol 124 (1) ◽

pp. 61-67 ◽

Cited By ~ 11

Author(s):

P. H. Corran ◽

S. G. Waley

Keyword(s):

Molecular Weight ◽

Amino Acid ◽

Amino Acid Sequence ◽

Thiol Group ◽

Cysteine Residue ◽

Amino Acid Sequences ◽

Amino Acid Residues ◽

Radioactive Sodium ◽

Group 2

1. Calf lens α-crystallin was carboxymethylated with radioactive sodium iodoacetate to label the thiol group. 2. The protein was then digested with trypsin or alternatively fractionated in urea to obtain the acidic (A) chains, which were then digested with trypsin. Either procedure gave two radioactive peptides containing carboxymethylcysteine. 3. These two peptides were closely related: the longer form contained 28 amino acid residues, and the shorter lacked two residues at the N-terminal end of the longer form. 4. The amino acid sequence of the peptides have been determined. 5. No evidence for the presence of more than one cysteine residue/chain was found. 6. The question of the molecular weight of the chains is discussed.

Download Full-text

MATLAB software for extracting protein name and sequence information from FASTA formatted proteome file

10.7287/peerj.preprints.27856 ◽

2019 ◽

Author(s):

Wenfa Ng

Keyword(s):

Molecular Weight ◽

Amino Acid ◽

Nucleotide Sequence ◽

Amino Acid Sequence ◽

Sequence Information ◽

Amino Acid Residues ◽

Protein Database ◽

Matlab Software ◽

Amino Acid Sequence Information ◽

New Protein

FASTA file format is a common file type for distributing proteome information, especially those obtained from Uniprot. While MATLAB could automatically read fasta files using the built-in function, fastaread, important information such as protein name and organism name remain enmeshed in a character array. Hence, difficulty exists in automatic extraction of protein names from fasta proteome file to help in building a database with fields comprising protein name and its amino acid sequence. The objective of this work was in developing a MATLAB software that could automatically extract protein name and amino acid sequence information from fasta proteome file and assign them to a new database that comprises fields such as protein name, amino acid sequence, number of amino acid residues, molecular weight of protein and nucleotide sequence of protein. Information on number of amino acid residues came from the use of the length built-in function in MATLAB analyzing the length of the amino acid sequence of a protein. The final two fields were provided by MATLAB built-in functions molweight and aa2nt, respectively. Molecular weight of proteins is useful for a variety of applications while nucleotide sequence is essential for gene synthesis applications in molecular cloning. Finally, the MATLAB software is also equipped with an error check function to help detect letters in the amino acid sequence that are not part of the family of 20 natural amino acids. Sequences with such letters would constitute as error inputs to molweight and aa2nt, and would not be processed. Collectively, given that important information such as protein name is enmeshed in a character array in fasta proteome file, this work sets out to develop a MATLAB software that could automatically extract protein name and amino acid sequence information, and assigns them to a new protein database. Using built-in functions, number of amino acid residues, molecular weight and nucleotide sequence of each protein were calculated; thereby, yielding a new protein database with improved functionalities that could support a variety of biology workflows ranging from sequence alignment to molecular cloning.

Download Full-text

The amino acid sequence of troponin I from rabbit skeletal muscle

Biochemical Journal ◽

10.1042/bj1490493 ◽

1975 ◽

Vol 149 (2) ◽

pp. 493-496 ◽

Cited By ~ 71

Author(s):

J M Wilkinson ◽

R J A. Grand

Keyword(s):

Skeletal Muscle ◽

Molecular Weight ◽

Amino Acid ◽

Amino Acid Sequence ◽

Cyanogen Bromide ◽

Troponin I ◽

Rabbit Skeletal Muscle ◽

British Library ◽

Amino Acid Residues ◽

N Terminus

The complete amino acid sequence of rabbit skeletal muscle troponin I was determined by the isolation of the cyanogen bromide fragments and the tryptic methionine-containing peptides. Troponin I contains 179 amino acid residues and has a molecular weight of 20864. Its N-terminus is acetylated. Detailed evidence on which the sequence is based has been deposited as Supplementary Publication SUP 50055 (23 pages) at the British Library (Lending Division), Boston Spa, Wetherby, West Yorkshire LS23 7QB, U.K., from whom copies may be obtained on the terms given in Biochem. J. (1975) 145, 5.

Download Full-text

Disulfide bonds of basic acrosin inhibitor (BUSI II) from bull seminal plasma

Collection of Czechoslovak Chemical Communications ◽

10.1135/cccc19841204 ◽

1984 ◽

Vol 49 (5) ◽

pp. 1204-1210 ◽

Cited By ~ 1

Author(s):

Alena Rzavská ◽

Bedřich Meloun ◽

Dana Čechová

Keyword(s):

Molecular Weight ◽

Amino Acid ◽

Amino Acid Sequence ◽

Seminal Plasma ◽

Disulfide Bonds ◽

Cysteine Residues ◽

Chromatographic Techniques

The thermolysin digest of the basic acrosin inhibitor (molecular weight 6 200) was resolved on a column of Sephadex G-15 and subsequently by electrophoretic and chromatographic techniques. Cysteine peptides, which link together half-cysteine residues 7 and 39, 17 and 36, and 25 and 57 by disulfide bonds, were isolated. The structure of the molecule of the basic acrosin inhibitor corresponds to structures of inhibitors of the Kazal type. The amino acid sequence of a few residues in the basic acrosin inhibitor has been revised.

Download Full-text

The amino acid sequence of wheat β-purothionin

Canadian Journal of Biochemistry ◽

10.1139/o76-120 ◽

1976 ◽

Vol 54 (10) ◽

pp. 835-842 ◽

Cited By ~ 46

Author(s):

A. S. Mak ◽

B. L. Jones

Keyword(s):

Molecular Weight ◽

Amino Acid ◽

Amino Acid Sequence ◽

Primary Structure ◽

Basic Protein ◽

Viscum Album ◽

Low Molecular Weight ◽

Amino Acid Residues ◽

Basic Proteins ◽

Considerable Homology

The complete amino acid sequence of β-purothionin, a low molecular weight, very basic, protein isolated from wheat endosperm material, has been determined. β-purothionin is toxic to some bacteria, to yeasts, and to animals when injected. The protein contains 45 amino acid residues and has a molecular weight of 4913. The 8 cysteine and 10 basic residues are distributed throughout the molecule. The primary structure of the protein shows considerable homology to those of the viscotoxins, which are toxic, small, basic proteins found in the leaves and stems of European mistletoe (Viscum album L.).

Download Full-text

Thermitase from Thermoactinomyces vulgaris; amino acid sequence of the large N-terminal cyanogen bromide peptide

Collection of Czechoslovak Chemical Communications ◽

10.1135/cccc19850885 ◽

1985 ◽

Vol 50 (4) ◽

pp. 885-896 ◽

Cited By ~ 4

Author(s):

Bedřich Meloun ◽

Miroslav Baudyš ◽

Manfred Pavlík ◽

Vladimír Kostka ◽

Gert Hausdorf ◽

...

Keyword(s):

Amino Acid ◽

Sequence Analysis ◽

Amino Acid Sequence ◽

High Performance ◽

Cyanogen Bromide ◽

Sequential Data ◽

Amino Acid Residues ◽

Methionine Residue ◽

Large N ◽

Thermoactinomyces Vulgaris

The large cyanogen bromide fragment (CB1) represents the N-terminal part of the molecule of thermitase and contains 226 amino acid residues. Its molecular weight calculated from sequential data is 22 932 (the C-terminal residue is regarded as a methionine residue in the calculations). The amino acid sequence of fragment CB1 was determined by analysis of peptides obtained by tryptic hydrolysis of the fragment; these data were complemented by sequence analysis of the chymotryptic digest of fragment Mf (residues 75 through 226) and of chymotryptic fragment ET3 (residues 103 through 226) isolated from the limited tryptic digest of fragment CB1. The peptides were purified by high performance liquid chromatography and by thin layer techniques. The sequence analysis of the large peptides was effected in the sequenator, small peptides were sequenced manually by the DABITC/PITC double coupling technique. The results obtained in this study together with those of previous work5 permitted the complete amino acid sequence of fragment CB1 to be determined.

Download Full-text

MATLAB software for extracting protein name and sequence information from FASTA formatted proteome file

10.7287/peerj.preprints.27856v2 ◽

2019 ◽

Author(s):

Wenfa Ng

Keyword(s):

Molecular Weight ◽

Amino Acid ◽

Nucleotide Sequence ◽

Amino Acid Sequence ◽

Sequence Information ◽

Amino Acid Residues ◽

Protein Database ◽

Matlab Software ◽

Amino Acid Sequence Information ◽

New Protein

FASTA file format is a common file type for distributing proteome information, especially those obtained from Uniprot. While MATLAB could automatically read fasta files using the built-in function, fastaread, important information such as protein name and organism name remain enmeshed in a character array. Hence, difficulty exists in automatic extraction of protein names from fasta proteome file to help in building a database with fields comprising protein name and its amino acid sequence. The objective of this work was in developing a MATLAB software that could automatically extract protein name and amino acid sequence information from fasta proteome file and assign them to a new database that comprises fields such as protein name, amino acid sequence, number of amino acid residues, molecular weight of protein and nucleotide sequence of protein. Information on number of amino acid residues came from the use of the length built-in function in MATLAB analyzing the length of the amino acid sequence of a protein. The final two fields were provided by MATLAB built-in functions molweight and aa2nt, respectively. Molecular weight of proteins is useful for a variety of applications while nucleotide sequence is essential for gene synthesis applications in molecular cloning. Finally, the MATLAB software is also equipped with an error check function to help detect letters in the amino acid sequence that are not part of the family of 20 natural amino acids. Sequences with such letters would constitute as error inputs to molweight and aa2nt, and would not be processed. Collectively, given that important information such as protein name is enmeshed in a character array in fasta proteome file, this work sets out to develop a MATLAB software that could automatically extract protein name and amino acid sequence information, and assigns them to a new protein database. Using built-in functions, number of amino acid residues, molecular weight and nucleotide sequence of each protein were calculated; thereby, yielding a new protein database with improved functionalities that could support a variety of biology workflows ranging from sequence alignment to molecular cloning.

Download Full-text

MATLAB software for extracting protein name and sequence information from FASTA formatted proteome file

10.7287/peerj.preprints.27856v1 ◽

2019 ◽

Author(s):

Wenfa Ng

Keyword(s):

Molecular Weight ◽

Amino Acid ◽

Nucleotide Sequence ◽

Amino Acid Sequence ◽

Sequence Information ◽

Amino Acid Residues ◽

Protein Database ◽

Matlab Software ◽

Amino Acid Sequence Information ◽

New Protein

FASTA file format is a common file type for distributing proteome information, especially those obtained from Uniprot. While MATLAB could automatically read fasta files using the built-in function, fastaread, important information such as protein name and organism name remain enmeshed in a character array. Hence, difficulty exists in automatic extraction of protein names from fasta proteome file to help in building a database with fields comprising protein name and its amino acid sequence. The objective of this work was in developing a MATLAB software that could automatically extract protein name and amino acid sequence information from fasta proteome file and assign them to a new database that comprises fields such as protein name, amino acid sequence, number of amino acid residues, molecular weight of protein and nucleotide sequence of protein. Information on number of amino acid residues came from the use of the length built-in function in MATLAB analyzing the length of the amino acid sequence of a protein. The final two fields were provided by MATLAB built-in functions molweight and aa2nt, respectively. Molecular weight of proteins is useful for a variety of applications while nucleotide sequence is essential for gene synthesis applications in molecular cloning. Finally, the MATLAB software is also equipped with an error check function to help detect letters in the amino acid sequence that are not part of the family of 20 natural amino acids. Sequences with such letters would constitute as error inputs to molweight and aa2nt, and would not be processed. Collectively, given that important information such as protein name is enmeshed in a character array in fasta proteome file, this work sets out to develop a MATLAB software that could automatically extract protein name and amino acid sequence information, and assigns them to a new protein database. Using built-in functions, number of amino acid residues, molecular weight and nucleotide sequence of each protein were calculated; thereby, yielding a new protein database with improved functionalities that could support a variety of biology workflows ranging from sequence alignment to molecular cloning.

Download Full-text

Characterization of cry1Cb3 and cry1Fb7 from Bacillus thuringiensis subsp. galleriae

Open Life Sciences ◽

10.1515/biol-2015-0054 ◽

2015 ◽

Vol 10 (1) ◽

Cited By ~ 1

Author(s):

Tianpei Huang ◽

Ying Xiao ◽

Jieru Pan ◽

Lingling Zhang ◽

Ivan Gelbič ◽

...

Keyword(s):

Molecular Weight ◽

Amino Acid ◽

Bacillus Thuringiensis ◽

Amino Acid Sequence ◽

Amino Acid Residues ◽

Reading Frame ◽

Bacillus Thuringiensis Subsp ◽

Calculated Molecular Weight ◽

Pcr Rflp ◽

Genes Encoding

AbstractTwo cry1-type genes encoding insecticidal crystal proteins (ICPs) were detected by PCR-RFLP and cloned from Bacillus thuringiensis subsp. galleriae 87. The nucleotide sequences were deposited in GenBank with accession numbers EU679501 and EU679502, and designated as cry1Fb7 and cry1Cb3 respectively by B. thuringiensis Delta- Endotoxin Nomenclature Committee. cry1Cb3 shared 99% homology with other cry1Cb genes. The existence of two additional stop codons indicated cry1Cb3 was a silent gene. The cry1Cb3 was 3531 bp with 38.98% G+C content and its first open reading frame (ORF) encoded a protein of 213 amino acid residues with a calculated molecular weight of 23.8 kDa and a predicted pI value of 4.63. Five amino acid sequence blocks (block 1, block 2, block 3, block 4 and block 5) were found in Cry1Cb3. Translation of cry1Fb7 revealed an ORF of 3525 bp with 39.12% G+C content and a protein with a calculated molecular weight of 133.2 kDa and a predicted pI value of 5.18. Cry1Fb7 had five amino acid sequence blocks (blocks 1, 2, 3, 4 and 5) and three domains (I, II and III), which consisted of 218 residues (Leu

Download Full-text