Amino acid sequence of the acidic acrosin inhibitor (BUSI I B2) from bull seminal plasma

1983 ◽  
Vol 48 (9) ◽  
pp. 2558-2568 ◽  
Author(s):  
Bedřich Meloun ◽  
Věra Jonáková ◽  
Dana Čechová

The molecule of the inhibitor consists of 63 amino acid residues whose sequence is the following: Glu-Ile-Tyr-Phe-Glu-Pro-Asp-Phe-Gly-Phe-Pro-Pro-Asp-Cys-Lys-Val-Tyr-Thr-Glu-Ala-Cys-Thr-Arg-Glu-Tyr-Asn-Pro-Ile-Cys-Asp-Ser-Ala-Ala-Lys-Thr-Tyr-Ser-Asn-Glu-Cys-Thr-Phe-Cys-Asn-Glu-Lys-Met-Asn-Asn-Asp-Ala-Asp-Ile-His-Phe-Gln-His-Phe-Gly-Glu-Cys-Glu-Tyr. The sequential data were obtained by the analysis of peptides isolated from the tryptic and chymotryptic digest of the carboxymethylated inhibitor. The molecular weight of the inhibitor calculated from its amino acid sequence is 7377.

1979 ◽  
Vol 44 (9) ◽  
pp. 2710-2721 ◽  
Author(s):  
Bedřich Meloun ◽  
Dana Čechová

The molecule of the inhibitor contains 57 amino acid residues whose sequence is the following: Pyr-Gly-Ala-Gln-Val-Asp-Cys-Ala-Glu-Phe-Lys-Asp-Pro-Lys-Val-Tyr-Cys-Thr-Arg-His-Ser-Asp-Pro-Gln-Cys-Gly-Ser-Asn-Gly-Glu-Thr-Tyr-Gly-Asn-Lys-Cys-Ala-Phe-Cys-Lys-Ala-Val-Met-Lys-Ser-Gly-Gly-Lys-Ile-Asn-Leu-Lys-His-Arg-Gly-Cys-Lys. The N-terminal group of the inhibitor is pyrrolidone carboxylic acid. The sequential data were obtained by analyses of peptides isolated from tryptic and chymotryptic digests of the oxidized or carboxymethylated inhibitor. The molecular weight of the inhibitor is 6 200.


1971 ◽  
Vol 124 (1) ◽  
pp. 61-67 ◽  
Author(s):  
P. H. Corran ◽  
S. G. Waley

1. Calf lens α-crystallin was carboxymethylated with radioactive sodium iodoacetate to label the thiol group. 2. The protein was then digested with trypsin or alternatively fractionated in urea to obtain the acidic (A) chains, which were then digested with trypsin. Either procedure gave two radioactive peptides containing carboxymethylcysteine. 3. These two peptides were closely related: the longer form contained 28 amino acid residues, and the shorter lacked two residues at the N-terminal end of the longer form. 4. The amino acid sequence of the peptides have been determined. 5. No evidence for the presence of more than one cysteine residue/chain was found. 6. The question of the molecular weight of the chains is discussed.


2019 ◽  
Author(s):  
Wenfa Ng

FASTA file format is a common file type for distributing proteome information, especially those obtained from Uniprot. While MATLAB could automatically read fasta files using the built-in function, fastaread, important information such as protein name and organism name remain enmeshed in a character array. Hence, difficulty exists in automatic extraction of protein names from fasta proteome file to help in building a database with fields comprising protein name and its amino acid sequence. The objective of this work was in developing a MATLAB software that could automatically extract protein name and amino acid sequence information from fasta proteome file and assign them to a new database that comprises fields such as protein name, amino acid sequence, number of amino acid residues, molecular weight of protein and nucleotide sequence of protein. Information on number of amino acid residues came from the use of the length built-in function in MATLAB analyzing the length of the amino acid sequence of a protein. The final two fields were provided by MATLAB built-in functions molweight and aa2nt, respectively. Molecular weight of proteins is useful for a variety of applications while nucleotide sequence is essential for gene synthesis applications in molecular cloning. Finally, the MATLAB software is also equipped with an error check function to help detect letters in the amino acid sequence that are not part of the family of 20 natural amino acids. Sequences with such letters would constitute as error inputs to molweight and aa2nt, and would not be processed. Collectively, given that important information such as protein name is enmeshed in a character array in fasta proteome file, this work sets out to develop a MATLAB software that could automatically extract protein name and amino acid sequence information, and assigns them to a new protein database. Using built-in functions, number of amino acid residues, molecular weight and nucleotide sequence of each protein were calculated; thereby, yielding a new protein database with improved functionalities that could support a variety of biology workflows ranging from sequence alignment to molecular cloning.


1975 ◽  
Vol 149 (2) ◽  
pp. 493-496 ◽  
Author(s):  
J M Wilkinson ◽  
R J A. Grand

The complete amino acid sequence of rabbit skeletal muscle troponin I was determined by the isolation of the cyanogen bromide fragments and the tryptic methionine-containing peptides. Troponin I contains 179 amino acid residues and has a molecular weight of 20864. Its N-terminus is acetylated. Detailed evidence on which the sequence is based has been deposited as Supplementary Publication SUP 50055 (23 pages) at the British Library (Lending Division), Boston Spa, Wetherby, West Yorkshire LS23 7QB, U.K., from whom copies may be obtained on the terms given in Biochem. J. (1975) 145, 5.


1984 ◽  
Vol 49 (5) ◽  
pp. 1204-1210 ◽  
Author(s):  
Alena Rzavská ◽  
Bedřich Meloun ◽  
Dana Čechová

The thermolysin digest of the basic acrosin inhibitor (molecular weight 6 200) was resolved on a column of Sephadex G-15 and subsequently by electrophoretic and chromatographic techniques. Cysteine peptides, which link together half-cysteine residues 7 and 39, 17 and 36, and 25 and 57 by disulfide bonds, were isolated. The structure of the molecule of the basic acrosin inhibitor corresponds to structures of inhibitors of the Kazal type. The amino acid sequence of a few residues in the basic acrosin inhibitor has been revised.


1976 ◽  
Vol 54 (10) ◽  
pp. 835-842 ◽  
Author(s):  
A. S. Mak ◽  
B. L. Jones

The complete amino acid sequence of β-purothionin, a low molecular weight, very basic, protein isolated from wheat endosperm material, has been determined. β-purothionin is toxic to some bacteria, to yeasts, and to animals when injected. The protein contains 45 amino acid residues and has a molecular weight of 4913. The 8 cysteine and 10 basic residues are distributed throughout the molecule. The primary structure of the protein shows considerable homology to those of the viscotoxins, which are toxic, small, basic proteins found in the leaves and stems of European mistletoe (Viscum album L.).


1985 ◽  
Vol 50 (4) ◽  
pp. 885-896 ◽  
Author(s):  
Bedřich Meloun ◽  
Miroslav Baudyš ◽  
Manfred Pavlík ◽  
Vladimír Kostka ◽  
Gert Hausdorf ◽  
...  

The large cyanogen bromide fragment (CB1) represents the N-terminal part of the molecule of thermitase and contains 226 amino acid residues. Its molecular weight calculated from sequential data is 22 932 (the C-terminal residue is regarded as a methionine residue in the calculations). The amino acid sequence of fragment CB1 was determined by analysis of peptides obtained by tryptic hydrolysis of the fragment; these data were complemented by sequence analysis of the chymotryptic digest of fragment Mf (residues 75 through 226) and of chymotryptic fragment ET3 (residues 103 through 226) isolated from the limited tryptic digest of fragment CB1. The peptides were purified by high performance liquid chromatography and by thin layer techniques. The sequence analysis of the large peptides was effected in the sequenator, small peptides were sequenced manually by the DABITC/PITC double coupling technique. The results obtained in this study together with those of previous work5 permitted the complete amino acid sequence of fragment CB1 to be determined.


2019 ◽  
Author(s):  
Wenfa Ng

FASTA file format is a common file type for distributing proteome information, especially those obtained from Uniprot. While MATLAB could automatically read fasta files using the built-in function, fastaread, important information such as protein name and organism name remain enmeshed in a character array. Hence, difficulty exists in automatic extraction of protein names from fasta proteome file to help in building a database with fields comprising protein name and its amino acid sequence. The objective of this work was in developing a MATLAB software that could automatically extract protein name and amino acid sequence information from fasta proteome file and assign them to a new database that comprises fields such as protein name, amino acid sequence, number of amino acid residues, molecular weight of protein and nucleotide sequence of protein. Information on number of amino acid residues came from the use of the length built-in function in MATLAB analyzing the length of the amino acid sequence of a protein. The final two fields were provided by MATLAB built-in functions molweight and aa2nt, respectively. Molecular weight of proteins is useful for a variety of applications while nucleotide sequence is essential for gene synthesis applications in molecular cloning. Finally, the MATLAB software is also equipped with an error check function to help detect letters in the amino acid sequence that are not part of the family of 20 natural amino acids. Sequences with such letters would constitute as error inputs to molweight and aa2nt, and would not be processed. Collectively, given that important information such as protein name is enmeshed in a character array in fasta proteome file, this work sets out to develop a MATLAB software that could automatically extract protein name and amino acid sequence information, and assigns them to a new protein database. Using built-in functions, number of amino acid residues, molecular weight and nucleotide sequence of each protein were calculated; thereby, yielding a new protein database with improved functionalities that could support a variety of biology workflows ranging from sequence alignment to molecular cloning.


2019 ◽  
Author(s):  
Wenfa Ng

FASTA file format is a common file type for distributing proteome information, especially those obtained from Uniprot. While MATLAB could automatically read fasta files using the built-in function, fastaread, important information such as protein name and organism name remain enmeshed in a character array. Hence, difficulty exists in automatic extraction of protein names from fasta proteome file to help in building a database with fields comprising protein name and its amino acid sequence. The objective of this work was in developing a MATLAB software that could automatically extract protein name and amino acid sequence information from fasta proteome file and assign them to a new database that comprises fields such as protein name, amino acid sequence, number of amino acid residues, molecular weight of protein and nucleotide sequence of protein. Information on number of amino acid residues came from the use of the length built-in function in MATLAB analyzing the length of the amino acid sequence of a protein. The final two fields were provided by MATLAB built-in functions molweight and aa2nt, respectively. Molecular weight of proteins is useful for a variety of applications while nucleotide sequence is essential for gene synthesis applications in molecular cloning. Finally, the MATLAB software is also equipped with an error check function to help detect letters in the amino acid sequence that are not part of the family of 20 natural amino acids. Sequences with such letters would constitute as error inputs to molweight and aa2nt, and would not be processed. Collectively, given that important information such as protein name is enmeshed in a character array in fasta proteome file, this work sets out to develop a MATLAB software that could automatically extract protein name and amino acid sequence information, and assigns them to a new protein database. Using built-in functions, number of amino acid residues, molecular weight and nucleotide sequence of each protein were calculated; thereby, yielding a new protein database with improved functionalities that could support a variety of biology workflows ranging from sequence alignment to molecular cloning.


2015 ◽  
Vol 10 (1) ◽  
Author(s):  
Tianpei Huang ◽  
Ying Xiao ◽  
Jieru Pan ◽  
Lingling Zhang ◽  
Ivan Gelbič ◽  
...  

AbstractTwo cry1-type genes encoding insecticidal crystal proteins (ICPs) were detected by PCR-RFLP and cloned from Bacillus thuringiensis subsp. galleriae 87. The nucleotide sequences were deposited in GenBank with accession numbers EU679501 and EU679502, and designated as cry1Fb7 and cry1Cb3 respectively by B. thuringiensis Delta- Endotoxin Nomenclature Committee. cry1Cb3 shared 99% homology with other cry1Cb genes. The existence of two additional stop codons indicated cry1Cb3 was a silent gene. The cry1Cb3 was 3531 bp with 38.98% G+C content and its first open reading frame (ORF) encoded a protein of 213 amino acid residues with a calculated molecular weight of 23.8 kDa and a predicted pI value of 4.63. Five amino acid sequence blocks (block 1, block 2, block 3, block 4 and block 5) were found in Cry1Cb3. Translation of cry1Fb7 revealed an ORF of 3525 bp with 39.12% G+C content and a protein with a calculated molecular weight of 133.2 kDa and a predicted pI value of 5.18. Cry1Fb7 had five amino acid sequence blocks (blocks 1, 2, 3, 4 and 5) and three domains (I, II and III), which consisted of 218 residues (Leu


Sign in / Sign up

Export Citation Format

Share Document