Computational prediction of SEG (single exon gene) function in humans

10.2741/1627 ◽  
2005 ◽  
Vol 10 (1-3) ◽  
pp. 1382 ◽  
Author(s):  
Meena, K. Sakharkar
PLoS ONE ◽  
2012 ◽  
Vol 7 (9) ◽  
pp. e45768 ◽  
Author(s):  
Nikolaos Mandalos ◽  
Marannia Saridaki ◽  
Jessica Lea Harper ◽  
Anastasia Kotsoni ◽  
Peter Yang ◽  
...  

Science ◽  
2020 ◽  
Vol 370 (6512) ◽  
pp. 121-124 ◽  
Author(s):  
Shingo Miyawaki ◽  
Shunsuke Kuroki ◽  
Ryo Maeda ◽  
Naoki Okashita ◽  
Peter Koopman ◽  
...  

The mammalian sex-determining gene Sry induces male development. Since its discovery 30 years ago, Sry has been believed to be a single-exon gene. Here, we identified a cryptic second exon of mouse Sry and a corresponding two-exon type Sry (Sry-T) transcript. XY mice lacking Sry-T were sex-reversed, and ectopic expression of Sry-T in XX mice induced male development. Sry-T messenger RNA is expressed similarly to that of canonical single-exon type Sry (Sry-S), but SRY-T protein is expressed predominantly because of the absence of a degron in the C terminus of SRY-S. Sry exon2 appears to have evolved recently in mice through acquisition of a retrotransposon-derived coding sequence to replace the degron. Our findings suggest that in nature, SRY-T, not SRY-S, is the bona fide testis-determining factor.


2020 ◽  
Author(s):  
George G. Vega Yon ◽  
Duncan C. Thomas ◽  
John Morrison ◽  
Huaiyu Mi ◽  
Paul D. Thomas ◽  
...  

AbstractMotivationGene function annotation is important for a variety of downstream analyses of genetic data. Yet experimental characterization of function remains costly and slow, making computational prediction an important endeavor. In this paper we use a probabilistic evolutionary model built upon phylogenetic trees and experimental Gene Ontology functional annotations that allows automated prediction of function for unannotated genes.ResultsWe have developed a computationally efficient model of evolution of gene annotations using phylogenies based on a Bayesian framework using Markov Chain Monte Carlo for parameter estimation. Unlike previous approaches, our method is able to estimate parameters over many different phylogenetic trees and functions. The resulting parameters agree with biological intuition, such as the increased probability of function change following gene duplication. The method performs well on leave-one-out validation, and we further validated some of the predictions in the experimental scientific literature.AvailabilityOur method has been implemented as an R package and it is available online at https://github.com/USCBiostats/aphylo. Code needed to reproduce the tables and figures can be found in https://github.com/USCbiostats/aphylo-simulations.Author summaryUnderstanding the individual role that genes play in life is a key issue in biomedical-sciences. While information regarding gene functions is continuously growing, the number of genes with unknown biological purpose is yet greater. Because of this, scientists have dedicated much of their time to build and design tools that automatically infer gene functions. In this paper, we present yet another attempt to do such. While very simple, our model of gene-function evolution has some key features that have the potential to generate an impact in the field: (a) compared to other methods, ours is highly-scalable, which means that it is possible to simultaneously analyze hundreds of what are known as gene-families, compromising thousands of genes, (b) supports our biological intuition as our model’s data-driven results coherently agree with what theory dictates regarding how gene-functions evolved, (c) notwithstanding its simplicity, the model’s prediction accuracy is comparable to other more complex alternatives, and (d) perhaps most importantly, it can be used to both support new annotations and to suggest areas in which existing annotations show inconsistencies that may indicate errors or controversies.


2015 ◽  
Vol 27 (1-2) ◽  
pp. 62-69 ◽  
Author(s):  
Masaru Motojima ◽  
Sanae Ogiwara ◽  
Taiji Matsusaka ◽  
Sang Yong Kim ◽  
Nobuho Sagawa ◽  
...  

F1000Research ◽  
2013 ◽  
Vol 2 ◽  
pp. 230 ◽  
Author(s):  
Paul Pavlidis ◽  
Jesse Gillis

In an opinion published in 2012, we reviewed and discussed our studies of how gene network-based guilt-by-association (GBA) is impacted by confounds related to gene multifunctionality. We found such confounds account for a significant part of the GBA signal, and as a result meaningfully evaluating and applying computationally-guided GBA is more challenging than generally appreciated. We proposed that effort currently spent on incrementally improving algorithms would be better spent in identifying the features of data that do yield novel functional insights. We also suggested that part of the problem is the reliance by computational biologists on gold standard annotations such as the Gene Ontology. In the year since, there has been continued heavy activity in GBA-based research, including work that contributes to our understanding of the issues we raised. Here we provide a review of some of the most relevant recent work, or which point to new areas of progress and challenges.


2020 ◽  
Vol 20 (1) ◽  
Author(s):  
Manuel Seefelder ◽  
Vikram Alva ◽  
Bin Huang ◽  
Tatjana Engler ◽  
Wolfgang Baumeister ◽  
...  

Abstract Background The huntingtin-associated protein 40 (HAP40) abundantly interacts with huntingtin (HTT), the protein that is altered in Huntington’s disease (HD). Therefore, we analysed the evolution of HAP40 and its interaction with HTT. Results We found that in amniotes HAP40 is encoded by a single-exon gene, whereas in all other organisms it is expressed from multi-exon genes. HAP40 co-occurs with HTT in unikonts, including filastereans such as Capsaspora owczarzaki and the amoebozoan Dictyostelium discoideum, but both proteins are absent from fungi. Outside unikonts, a few species, such as the free-living amoeboflagellate Naegleria gruberi, contain putative HTT and HAP40 orthologs. Biochemically we show that the interaction between HTT and HAP40 extends to fish, and bioinformatic analyses provide evidence for evolutionary conservation of this interaction. The closest homologue of HAP40 in current protein databases is the family of soluble N-ethylmaleimide-sensitive factor attachment proteins (SNAPs). Conclusion Our results indicate that the transition from a multi-exon to a single-exon gene appears to have taken place by retroposition during the divergence of amphibians and amniotes, followed by the loss of the parental multi-exon gene. Furthermore, it appears that the two proteins probably originated at the root of eukaryotes. Conservation of the interaction between HAP40 and HTT and their likely coevolution strongly indicate functional importance of this interaction.


2006 ◽  
Vol 7 (1) ◽  
pp. 23-34 ◽  
Author(s):  
Pingzhao Hu ◽  
Gary Bader ◽  
Dennis A. Wigle ◽  
Andrew Emili

F1000Research ◽  
2012 ◽  
Vol 1 ◽  
pp. 14 ◽  
Author(s):  
Paul Pavlidis ◽  
Jesse Gillis

In this opinion piece, we attempt to unify recent arguments we have made that serious confounds affect the use of network data to predict and characterize gene function. The development of computational approaches to determine gene function is a major strand of computational genomics research. However, progress beyond using BLAST to transfer annotations has been surprisingly slow. We have previously argued that a large part of the reported success in using "guilt by association" in network data is due to the tendency of methods to simply assign new functions to already well-annotated genes. While such predictions will tend to be correct, they are generic; it is true, but not very helpful, that a gene with many functions is more likely to have any function. We have also presented evidence that much of the remaining performance in cross-validation cannot be usefully generalized to new predictions, making progressive improvement in analysis difficult to engineer. Here we summarize our findings about how these problems will affect network analysis, discuss some ongoing responses within the field to these issues, and consolidate some recommendations and speculation, which we hope will modestly increase the reliability and specificity of gene function prediction.


Sign in / Sign up

Export Citation Format

Share Document