scholarly journals A local duplication of the Melanocortin receptor 1 locus in Astyanax

Genome ◽  
2018 ◽  
Vol 61 (4) ◽  
pp. 254-265 ◽  
Author(s):  
Joshua B. Gross ◽  
James Weagley ◽  
Bethany A. Stahl ◽  
Li Ma ◽  
Luis Espinasa ◽  
...  

In this study, we report evidence of a novel duplication of Melanocortin receptor 1 (Mc1r) in the cavefish genome. This locus was discovered following the observation of excessive allelic diversity in a ∼820 bp fragment of Mc1r amplified via degenerate PCR from a natural population of Astyanax aeneus fish from Guerrero, Mexico. The cavefish genome reveals the presence of two closely related Mc1r open reading frames separated by a 1.46 kb intergenic region. One open reading frame corresponds to the previously reported Mc1r receptor, and the other open reading frame (duplicate copy) is 975 bp in length, encoding a receptor of 325 amino acids. Sequence similarity analyses position both copies in the syntenic region of the single Mc1r locus in 16 representative craniate genomes spanning bony fish (including Astyanax) to mammals, suggesting we discovered tandem duplicates of this important gene. The two Mc1r copies share ∼89% sequence similarity and, within Astyanax, are more similar to one another compared to other melanocortin family members. Future studies will inform the precise functional significance of the duplicated Mc1r locus and if this novel copy number variant may have adaptive significance for the Astyanax lineage.

1999 ◽  
Vol 10 (04) ◽  
pp. 635-643 ◽  
Author(s):  
AGNIESZKA GIERLIK ◽  
PAWEŁ MACKIEWICZ ◽  
MARIA KOWALCZUK ◽  
STANISŁAW CEBRAT ◽  
MIROSŁAW R. DUDEK

Coding sequences of DNA generate Open Reading Frames (ORFs) inside them with much higher frequency than random DNA sequences do, especially in the antisense strand. This is a specific feature of the genetic code. Since coding sequences are selected for their length, the generated ORFs are indirect results of this selection and their length is also influenced by selection. That is why ORFs found in any genome, even much longer ones than those spontaneously generated in random DNA sequences, should be considered as two different sets of ORFs: The first one coding for proteins, the second one generated by the coding ORFs. Even intergenic sequences possess greater capacity for generating ORFs than random DNA sequences of the same nucleotide composition, which seems to be a premise that intergenic sequences were generated from coding sequences by recombinational mechanisms.


2004 ◽  
Vol 78 (21) ◽  
pp. 11544-11550 ◽  
Author(s):  
Paul Kraft ◽  
Andrea Oeckinghaus ◽  
Daniel Kümmel ◽  
George H. Gauss ◽  
John Gilmore ◽  
...  

ABSTRACT Sulfolobus spindle-shaped viruses (SSVs), or Fuselloviridae, are ubiquitous crenarchaeal viruses found in high-temperature acidic hot springs around the world (pH ≤4.0; temperature of ≥70°C). Because they are relatively easy to isolate, they represent the best studied of the crenarchaeal viruses. This is particularly true for the type virus, SSV1, which contains a double-stranded DNA genome of 15.5 kilobases, encoding 34 putative open reading frames. Interestingly, the genome shows little sequence similarity to organisms other than its SSV homologues. Together, sequence similarity and biochemical analyses have suggested functions for only 6 of the 34 open reading frames. Thus, even though SSV1 is the best-studied crenarchaeal virus, functions for most (28) of its open reading frames remain unknown. We have undertaken biochemical and structural studies for the gene product of open reading frame F-93. We find that F-93 exists as a homodimer in solution and that a tight dimer is also present in the 2.7-Å crystal structure. Further, the crystal structure reveals a fold that is homologous to the SlyA and MarR subfamilies of winged-helix DNA binding proteins. This strongly suggests that F-93 functions as a transcription factor that recognizes a (pseudo-)palindromic DNA target sequence.


2008 ◽  
Vol 82 (17) ◽  
pp. 8917-8921 ◽  
Author(s):  
Christopher J. McCormick ◽  
Omar Salim ◽  
Paul R. Lambden ◽  
Ian N. Clarke

ABSTRACT A generally accepted view of norovirus replication is that capsid expression requires production of a subgenomic transcript, the presence of capsid often being used as a surrogate marker to indicate the occurrence of viral replication. Using a polymerase II-based baculovirus delivery system, we observed capsid expression following introduction of a full-length genogroup 3 norovirus genome into HepG2 cells. However, capsid expression occurred as a result of a novel translation termination/reinitiation event between the nonstructural-protein and capsid open reading frames, a feature that may be unique to genogroup 3 noroviruses.


1987 ◽  
Vol 7 (8) ◽  
pp. 2728-2734 ◽  
Author(s):  
C A Strick ◽  
T D Fox

The yeast nuclear gene PET111 is required specifically for translation of the mitochondrion-coded mRNA for cytochrome c oxidase subunit II. We have determined the nucleotide sequence of a 3-kilobase segment of DNA that carries PET111. The sequence contains a single long open reading frame that predicts a basic protein of 718 amino acids. The PET111 gene product is a mitochondrial protein, since a hybrid protein which includes the amino-terminal 154 amino acids of PET111 fused to beta-galactosidase is specifically associated with mitochondria. PET111 is translated from a 2.9-kilobase mRNA which, interestingly, has an extended 5'-leader sequence containing four short open reading frames upstream of the long open reading frame. These open reading frames exhibit an interesting pattern of overlap with each other and with the PET111 reading frame.


1987 ◽  
Vol 7 (8) ◽  
pp. 2728-2734
Author(s):  
C A Strick ◽  
T D Fox

The yeast nuclear gene PET111 is required specifically for translation of the mitochondrion-coded mRNA for cytochrome c oxidase subunit II. We have determined the nucleotide sequence of a 3-kilobase segment of DNA that carries PET111. The sequence contains a single long open reading frame that predicts a basic protein of 718 amino acids. The PET111 gene product is a mitochondrial protein, since a hybrid protein which includes the amino-terminal 154 amino acids of PET111 fused to beta-galactosidase is specifically associated with mitochondria. PET111 is translated from a 2.9-kilobase mRNA which, interestingly, has an extended 5'-leader sequence containing four short open reading frames upstream of the long open reading frame. These open reading frames exhibit an interesting pattern of overlap with each other and with the PET111 reading frame.


2019 ◽  
Vol 15 (2) ◽  
pp. 108-116 ◽  
Author(s):  
Alexandra Khitun ◽  
Travis J. Ness ◽  
Sarah A. Slavoff

Increasing evidence suggests that some small open reading frame-encoded polypeptides (SEPs) function in prokaryotic and eukaryotic cellular stress responses.


2020 ◽  
Author(s):  
Xiangwen Ji ◽  
Chunmei Cui ◽  
Qinghua Cui

Abstract Background Small open reading frame (smORF) is open reading frame with a length of less than 100 codons. Microproteins, translated from smORFs, have been found to participate in a variety of biological processes such as muscle formation and contraction, cell proliferation, and immune activation. Although previous studies have collected and annotated a large abundance of smORFs, functions of the vast majority of smORFs are still unknown. It is thus increasingly important to develop computational methods to annotate the functions of these smORFs. Results In this study, we collected 617,462 unique smORFs from three studies. The expression of smORF RNAs was estimated by reannotated microarray probes. Using a speed-optimized correlation algorism, the functions of smORFs were predicted by their correlated genes with known functional annotations. After applying our method to 5 known microproteins from literatures, our method successfully predicted their functions. Further validation from the UniProt database showed that at least one function of 202 out of 270 microproteins was predicted. Conclusions We developed a method, smORFunction, to provide function predictions of smORFs/microproteins in at most 265 models generated from 173 datasets, including 48 tissues/cells, 82 diseases (and normal). The tool can be available at http://www.cuilab.cn/smorfunction.


2002 ◽  
Vol 83 (9) ◽  
pp. 2303-2307 ◽  
Author(s):  
Masanori Terai ◽  
Robert D. Burk

We have characterized the complete genome (8300 bp) of an isolate of Felis domesticus papillomavirus (FdPV) from a domestic cat with cutaneous papillomatosis. A BLAST homology search using the nucleotide sequence of the L1 open reading frame demonstrated that the FdPV genome was most closely related to canine oral papillomavirus (COPV). A 384 bp non-coding region (NCR) was found between the end of L1 and the beginning of E6, and a 1·3 kbp NCR was located between the end of E2 and the beginning of L2. Phylogenetic analysis placed FdPV in the E3 clade with COPV. Both viruses contain the atypical second NCR, which has no homology with sequences in existing databases.


2020 ◽  
Author(s):  
Xiangwen Ji ◽  
Chunmei Cui ◽  
Qinghua Cui

Abstract Background Small open reading frame (smORF) is open reading frame with a length of less than 100 codons. Microproteins, translated from smORFs, have been found to participate in a variety of biological processes such as muscle formation and contraction, cell proliferation, and immune activation. Although previous studies have collected and annotated a large abundance of smORFs, functions of the vast majority of smORFs are still unknown. It is thus increasingly important to develop computational methods to annotate the functions of these smORFs. Results In this study, we collected 617,462 unique smORFs from three studies. The expression of smORF RNAs was estimated by reannotated microarray probes. Using a speed-optimized correlation algorism, the functions of smORFs were predicted by their correlated genes with known functional annotations. After applying our method to 5 known microproteins from literatures, our method successfully predicted their functions. Further validation from the UniProt database showed that at least one function of 202 out of 270 microproteins was predicted. Conclusions We developed a method, smORFunction, to provide function predictions of smORFs/microproteins in at most 265 models generated from 173 datasets, including 48 tissues/cells, 82 diseases (and normal). The tool can be available at http://www.cuilab.cn/smorfunction.


2020 ◽  
Vol 21 (1) ◽  
Author(s):  
Xiangwen Ji ◽  
Chunmei Cui ◽  
Qinghua Cui

Abstract Background Small open reading frame (smORF) is open reading frame with a length of less than 100 codons. Microproteins, translated from smORFs, have been found to participate in a variety of biological processes such as muscle formation and contraction, cell proliferation, and immune activation. Although previous studies have collected and annotated a large abundance of smORFs, functions of the vast majority of smORFs are still unknown. It is thus increasingly important to develop computational methods to annotate the functions of these smORFs. Results In this study, we collected 617,462 unique smORFs from three studies. The expression of smORF RNAs was estimated by reannotated microarray probes. Using a speed-optimized correlation algorism, the functions of smORFs were predicted by their correlated genes with known functional annotations. After applying our method to 5 known microproteins from literatures, our method successfully predicted their functions. Further validation from the UniProt database showed that at least one function of 202 out of 270 microproteins was predicted. Conclusions We developed a method, smORFunction, to provide function predictions of smORFs/microproteins in at most 265 models generated from 173 datasets, including 48 tissues/cells, 82 diseases (and normal). The tool can be available at https://www.cuilab.cn/smorfunction.


Sign in / Sign up

Export Citation Format

Share Document