Human mucin gene MUC5AC: organization of its 5′-region and central repetitive region

2001 ◽  
Vol 358 (3) ◽  
pp. 763-772 ◽  
Author(s):  
Fabienne ESCANDE ◽  
Jean-Pierre AUBERT ◽  
Nicole PORCHET ◽  
Marie-Pierre BUISINE

Human mucin gene MUC5AC is clustered with MUC2, MUC5B and MUC6 on chromosome 11p15.5. We report here the full length cDNA sequence upstream of the repetitive region of human MUC5AC. We have also determined the sequence of its large central tandem repeat array. The 5′-region reveals high degree of sequence similarity with MUC2 and MUC5B and codes for 1336 amino acids organized into a signal peptide, four pro-von Willebrand factor-like D domains (D1, D2, D′ and D3) and a short domain which connects to the central repetitive region. In the central region, 17 major domains have been identified. Nine code for cysteine-rich domains (Cys-domains 1–9) and exhibit high sequence similarity to the cysteine-rich domains described in the central region of MUC2 and MUC5B. Cys-domains 1–5 are interspersed by domains enriched with serine, threonine, and proline residues. Cys-domains 1–9 are interspersed by four domains (TR1–TR4) composed of various numbers of MUC5AC-type repeats. Southern-blot analyses reveal allelic variations both in length and nucleotide sequence. The length polymorphism which is due to variable numbers of tandem repeats is located in TR1 and TR4, whereas a mutation polymorphism detected with TaqI is located in Cys-domain 6. In this study, the organization of MUC5AC has been entirely elucidated showing extensive similarity to the other chromosome 11p15 MUC genes, particularly MUC5B, and providing additional arguments for common evolution from a single ancestral gene.

1998 ◽  
Vol 332 (3) ◽  
pp. 739-748 ◽  
Author(s):  
Séverine NOLLET ◽  
Nicolas MONIAUX ◽  
Jacques MAURY ◽  
Danièle PETITPREZ ◽  
Pierre DEGAND ◽  
...  

In a previous study we isolated a partial cDNA with a tandem repeat of 48 bp, which allowed us to map a novel human mucin gene named MUC4to chromosome 3q29. Here we report the organization and sequence of the 5´-region and its junction with the tandem repeat array of MUC4. Analysis of three overlapping genomic clones allowed us to obtain a partial restriction map of MUC4 and to locate the complete 48 bp tandem repeat domain on a PstI/EcoRI genomic fragment that exhibits a very large variation in number of tandem repeats (7–19 kb). cDNA clonal extension allowed us to obtain the entire 5´ coding region of MUC4. Exon 1 consists of a 5´ untranslated region and an 82 bp fragment encoding the signal peptide. This latter shows a high degree of similarity to the signal peptide of another apomucin, ASGP-1. Exon 2 is extremely large and contains a unique sequence that is followed by the whole tandem repeat domain. It encodes only one cysteine residue, making MUC4 different from mucin genes belonging to the 11p15.5 family. Moreover, an intron downstream from the tandem repeat array consists mainly of a 15 bp tandem repeat that exhibits a polymorphism in having a variable number of tandem repeats.


2002 ◽  
Vol 363 (3) ◽  
pp. 589-598 ◽  
Author(s):  
Fabienne ESCANDE ◽  
Nicole PORCHET ◽  
Jean-Pierre AUBERT ◽  
Marie-Pierre BUISINE

We report here the isolation and characterization of the mouse Muc5b mucin gene (mMuc5b). We determined its complete cDNA sequence, its genomic organization, and chromosomal localization. Moreover, we analyzed the expression of this gene by reverse-transcription PCR and in situ hybridization. The structure of the gene was determined from a genomic cosmid clone that encompasses the entire mMuc5b gene, including the 5′-flanking region. The mMuc5b gene spans approximately 36kb and contains 49 exons. It is located on mouse distal chromosome 7. mMuc5b encodes at least two transcripts by alternative splicing of the second exon, the longest one being 14.9kb in length. The deduced peptide contains 4782 amino acids. Its central region can be subdivided into 10 imperfect repeats, each composed of a cysteine-rich domain followed by a threonine, serine, and proline-rich mucin-type domain. It is flanked by cysteine-rich domains similar to cysteine-rich domains of pre-pro-von Willebrand factor. Comparison with its human homologue MUC5B revealed common features including high sequence similarities in the 5′ and 3′ regions, and the conservation of the genomic organization. In contrast, mMuc5b differs from its human homologue, since no highly tandemly repeated sequences could be identified within its central region. mMuc5b is expressed mainly in laryngeal mucous glands, and at a lesser extend in stomach and duodenum.


Archaea ◽  
2006 ◽  
Vol 2 (2) ◽  
pp. 109-115 ◽  
Author(s):  
Luigi Mandrich ◽  
Margherita Pezzullo ◽  
Mosè Rossi ◽  
Giuseppe Manco

Previously, we reported from theSulfolobus solfataricusopen reading frame (ORF) SSO2517 the cloning, overexpression and characterization of an esterase belonging to the hormone-sensitive lipase (HSL) family and apparently having a deletion at the N-terminus, which we namedSsoNΔ. Searching the recently reportedSulfolobus acidocaldariusgenome by sequence alignment, using SSO2517 as a query, allowed identity of a putative esterase (ORF SAC1105) sharing high sequence similarity (82%) with SSO2517. This esterase displays an N-terminus and total length similar to other known esterases of the HSL family. Analysis of the upstream DNA sequence of SS02517 revealed the possibility of expressing a longer version of the protein with an extended N-terminus; however, no clear translation signal consistent with a longer protein version was detected. This new version of SSO2517 was cloned, over-expressed, purified and characterized. The resulting protein, namedSsoNΔlong, was 15-fold more active with the substratep-nitrophenyl hexanoate thanSsoNΔ. Furthermore,SsoNΔlong andSsoNΔ displayed different substrate specificities for triacylglycerols. These results and the phylogenetic relationship betweenS. solfataricusandS. acidocaldariussuggest a common origin of SSO2517 and SAC1105 from an ancestral gene, followed by divergent evolution. Alternatively, a yet-to-be discovered mechanism of translation that directs the expression ofSsoNΔlong under specific metabolic conditions could be hypothesized.


1995 ◽  
Vol 311 (3) ◽  
pp. 775-785 ◽  
Author(s):  
L L Shekels ◽  
C Lyftogt ◽  
M Kieliszewski ◽  
J D Filie ◽  
C A Kozak ◽  
...  

Mucins protect gastric epithelium by maintaining a favourable pH gradient and preventing autodigestion. The purpose of this study was to clone a mouse gastric mucin which would provide a foundation for analysis of mucin gene regulation. Mucin was purified from the glandular portion of gastric specimens and deglycosylated by HF solvolysis. Antibodies against native and deglycosylated mouse gastric mucin (MGM) were raised in chickens. Screening of a mouse stomach cDNA library with the anti-(deglycosylated MGM) antibody yielded partial clones containing a 48 bp tandem repeat and 768 bp of non-repetitive sequence. The 16-amino-acid tandem repeat has a consensus sequence of QTSSPNTGKTSTISTT with 25% serine and 38% threonine. The MGM tandem repeat sequence bears no similarity to previously identified mucins. The MGM non-repetitive region shares sequence similarity with human MUC5AC and, to a lesser extent, human MUC2 and rat intestinal mucin. Northern blot analysis reveals a polydisperse message beginning at 13.5 kb in mouse stomach with no expression in oesophagus, trachea, small intestine, large intestine, caecum, lung or kidney. Immunoreactivity of antibodies against deglycosylated MGM and against a synthetic MGM tandem repeat peptide was restricted to superficial mucous cells, antral glands and Brunner's glands in the pyloric-duodenal region. DNA analysis shows that MGM recognizes mouse and rat DNA but not hamster, rabbit or human DNA. The MGM gene maps to a site on mouse chromosome 7 homologous to the location of a human secretory mucin gene cluster on human chromosome 11p15. Due to sequence similarity and predominant expression in the stomach, the MGM gene may be considered a MUC5AC homologue and named Muc5ac.


Plant Methods ◽  
2021 ◽  
Vol 17 (1) ◽  
Author(s):  
Mohamed Ramadan ◽  
Muna Alariqi ◽  
Yizan Ma ◽  
Yanlong Li ◽  
Zhenping Liu ◽  
...  

Abstract Background Upland cotton (Gossypium hirsutum), harboring a complex allotetraploid genome, consists of A and D sub-genomes. Every gene has multiple copies with high sequence similarity that makes genetic, genomic and functional analyses extremely challenging. The recent accessibility of CRISPR/Cas9 tool provides the ability to modify targeted locus efficiently in various complicated plant genomes. However, current cotton transformation method targeting one gene requires a complicated, long and laborious regeneration process. Hence, optimizing strategy that targeting multiple genes is of great value in cotton functional genomics and genetic engineering. Results To target multiple genes in a single experiment, 112 plant development-related genes were knocked out via optimized CRISPR/Cas9 system. We optimized the key steps of pooled sgRNAs assembly method by which 116 sgRNAs pooled together into 4 groups (each group consisted of 29 sgRNAs). Each group of sgRNAs was compiled in one PCR reaction which subsequently went through one round of vector construction, transformation, sgRNAs identification and also one round of genetic transformation. Through the genetic transformation mediated Agrobacterium, we successfully generated more than 800 plants. For mutants identification, Next Generation Sequencing technology has been used and results showed that all generated plants were positive and all targeted genes were covered. Interestingly, among all the transgenic plants, 85% harbored a single sgRNA insertion, 9% two insertions, 3% three different sgRNAs insertions, 2.5% mutated sgRNAs. These plants with different targeted sgRNAs exhibited numerous combinations of phenotypes in plant flowering tissues. Conclusion All targeted genes were successfully edited with high specificity. Our pooled sgRNAs assembly offers a simple, fast and efficient method/strategy to target multiple genes in one time and surely accelerated the study of genes function in cotton.


Author(s):  
Angélique Buton ◽  
Louis-Marie Bobay

Abstract Homologous recombination is a key pathway found in nearly all bacterial taxa. The recombination complex allows bacteria to repair DNA double strand breaks but also promotes adaption through the exchange of DNA between cells. In Proteobacteria, this process is mediated by the RecBCD complex, which relies on the recognition of a DNA motif named Chi to initiate recombination. The Chi motif has been characterized in Escherichia coli and analogous sequences have been found in several other species from diverse families, suggesting that this mode of action is widespread across bacteria. However, the sequences of Chi-like motifs are known for only five bacterial species: E. coli, Haemophilus influenzae, Bacillus subtilis, Lactococcus lactis and Staphylococcus aureus. In this study we detected putative Chi motifs in a large dataset of Proteobacteria and we identified four additional motifs sharing high sequence similarity and similar properties to the Chi motif of E. coli in 85 species of Proteobacteria. Most Chi motifs were detected in Enterobacteriaceae and this motif appears well conserved in this family. However, we did not detect Chi motifs for the majority of Proteobacteria, suggesting that different motifs are used in these species. Altogether these results substantially expand our knowledge on the evolution of Chi motifs and on the recombination process in bacteria.


Genetics ◽  
2004 ◽  
Vol 166 (2) ◽  
pp. 947-957 ◽  
Author(s):  
John G Jelesko ◽  
Kristy Carter ◽  
Whitney Thompson ◽  
Yuki Kinoshita ◽  
Wilhelm Gruissem

Abstract Paralogous genes organized as a gene cluster can rapidly evolve by recombination between misaligned paralogs during meiosis, leading to duplications, deletions, and novel chimeric genes. To model unequal recombination within a specific gene cluster, we utilized a synthetic RBCSB gene cluster to isolate recombinant chimeric genes resulting from meiotic recombination between paralogous genes on sister chromatids. Several F1 populations hemizygous for the synthRBCSB1 gene cluster gave rise to Luc+ F2 plants at frequencies ranging from 1 to 3 × 10-6. A nonuniform distribution of recombination resolution sites resulted in the biased formation of recombinant RBCS3B/1B::LUC genes with nonchimeric exons. The positioning of approximately half of the mapped resolution sites was effectively modeled by the fractional length of identical DNA sequences. In contrast, the other mapped resolution sites fit an alternative model in which recombination resolution was stimulated by an abrupt transition from a region of relatively high sequence similarity to a region of low sequence similarity. Thus, unequal recombination between paralogous RBCSB genes on sister chromatids created an allelic series of novel chimeric genes that effectively resulted in the diversification rather than the homogenization of the synthRBCSB1 gene cluster.


Plants ◽  
2020 ◽  
Vol 9 (12) ◽  
pp. 1692
Author(s):  
Li Gu ◽  
Ting Su ◽  
Ming-Tai An ◽  
Guo-Xiong Hu

Oreocharis esquirolii, a member of Gesneriaceae, is known as Thamnocharis esquirolii, which has been regarded a synonym of the former. The species is endemic to Guizhou, southwestern China, and is evaluated as vulnerable (VU) under the International Union for Conservation of Nature (IUCN) criteria. Until now, the sequence and genome information of O. esquirolii remains unknown. In this study, we assembled and characterized the complete chloroplast (cp) genome of O. esquirolii using Illumina sequencing data for the first time. The total length of the cp genome was 154,069 bp with a typical quadripartite structure consisting of a pair of inverted repeats (IRs) of 25,392 bp separated by a large single copy region (LSC) of 85,156 bp and a small single copy region (SSC) of18,129 bp. The genome comprised 114 unique genes with 80 protein-coding genes, 30 tRNA genes, and four rRNA genes. Thirty-one repeat sequences and 74 simple sequence repeats (SSRs) were identified. Genome alignment across five plastid genomes of Gesneriaceae indicated a high sequence similarity. Four highly variable sites (rps16-trnQ, trnS-trnG, ndhF-rpl32, and ycf 1) were identified. Phylogenetic analysis indicated that O. esquirolii grouped together with O. mileensis, supporting resurrection of the name Oreocharis esquirolii from Thamnocharisesquirolii. The complete cp genome sequence will contribute to further studies in molecular identification, genetic diversity, and phylogeny.


2016 ◽  
Vol 198 (9) ◽  
pp. 1393-1400 ◽  
Author(s):  
Guangyu E. Chen ◽  
Andrew Hitchcock ◽  
Philip J. Jackson ◽  
Roy R. Chaudhuri ◽  
Mark J. Dickman ◽  
...  

ABSTRACTThe major photopigment of the cyanobacteriumAcaryochloris marinais chlorophylld, while its direct biosynthetic precursor, chlorophylla, is also present in the cell. These pigments, along with the majority of chlorophylls utilized by oxygenic phototrophs, carry an ethyl group at the C-8 position of the molecule, having undergone reduction of a vinyl group during biosynthesis. Two unrelated classes of 8-vinyl reductase involved in the biosynthesis of chlorophylls are known to exist, BciA and BciB. The genome ofAcaryochloris marinacontains open reading frames (ORFs) encoding proteins displaying high sequence similarity to BciA or BciB, although they are annotated as genes involved in transcriptional control (nmrA) and methanogenesis (frhB), respectively. These genes were introduced into an 8-vinyl chlorophylla-producing ΔbciBstrain ofSynechocystissp. strain PCC 6803, and both were shown to restore synthesis of the pigment with an ethyl group at C-8, demonstrating their activities as 8-vinyl reductases. We propose thatnmrAandfrhBbe reassigned asbciAandbciB, respectively; transcript and proteomic analysis ofAcaryochloris marinareveal that bothbciAandbciBare expressed and their encoded proteins are present in the cell, possibly in order to ensure that all synthesized chlorophyll pigment carries an ethyl group at C-8. Potential reasons for the presence of two 8-vinyl reductases in this strain, which is unique for cyanobacteria, are discussed.IMPORTANCEThe cyanobacteriumAcaryochloris marinais the best-studied phototrophic organism that uses chlorophylldfor photosynthesis. Unique among cyanobacteria sequenced to date, its genome contains ORFs encoding two unrelated enzymes that catalyze the reduction of the C-8 vinyl group of a precursor molecule to an ethyl group. Carrying a reduced C-8 group may be of particular importance to organisms containing chlorophylld. Plant genomes also contain orthologs of both of these genes; thus, the bacterial progenitor of the chloroplast may also have contained bothbciAandbciB.


2007 ◽  
Vol 57 (6) ◽  
pp. 1243-1249 ◽  
Author(s):  
Ngoc-Phuc Hua ◽  
Atsuko Kanekiyo ◽  
Katsunori Fujikura ◽  
Hisato Yasuda ◽  
Takeshi Naganuma

Two Gram-positive, rod-shaped, moderately halophilic bacteria were isolated from a deep-sea carbonate rock at a methane cold seep in Kuroshima Knoll, Japan. These bacteria, strains IS-Hb4T and IS-Hb7T, were spore-forming and non-motile. They were able to grow at temperatures as low as 9 °C and hydrostatic pressures up to 30 MPa. Based on high sequence similarity of their 16S rRNA genes to those of type strains of the genus Halobacillus, from 96.4 % (strain IS-Hb7T to Halobacillus halophilus NCIMB 9251T) to 99.4 % (strain IS-Hb4T to Halobacillus dabanensis D-8T), the strains were shown to belong to this genus. DNA–DNA relatedness values of 49.5 % and 1.0–33.0 %, respectively, were determined between strains IS-Hb4T and IS-Hb7T and between these strains and other Halobacillus type strains. Both strains showed the major menaquinone MK7 and l-orn–d-Asp cell-wall peptidoglycan type. Straight-chain C16 : 0, unsaturated C16 : 1 ω7c alcohol and C18 : 1 ω7c and cyclopropane C19 : 0 cyc fatty acids were predominant in both strains. The DNA G+C contents of IS-Hb4T and IS-Hb7T were respectively 43.3 and 42.1 mol%. Physiological and biochemical analyses combined with DNA–DNA hybridization results allowed us to place strains IS-Hb4T (=JCM 14154T=DSM 18394T) and IS-Hb7T (=JCM 14155T=DSM 18393T) in the genus Halobacillus as the respective type strains of the novel species Halobacillus profundi sp. nov. and Halobacillus kuroshimensis sp. nov.


Sign in / Sign up

Export Citation Format

Share Document