scholarly journals Human genome annotation — a possible role for HUGO?

10.1038/896 ◽  
1998 ◽  
Vol 19 (3) ◽  
pp. 222-222 ◽  
Author(s):  
Peter Little
2012 ◽  
Vol 22 (9) ◽  
pp. 1760-1774 ◽  
Author(s):  
J. Harrow ◽  
A. Frankish ◽  
J. M. Gonzalez ◽  
E. Tapanari ◽  
M. Diekhans ◽  
...  

2013 ◽  
Vol 14 (Suppl 11) ◽  
pp. S8 ◽  
Author(s):  
Po-Yen Wu ◽  
John H Phan ◽  
May D Wang

2001 ◽  
Vol 155 (5) ◽  
pp. 703-704 ◽  
Author(s):  
Peter G. Gillespie ◽  
Joseph P. Albanesi ◽  
Martin Bähler ◽  
William M. Bement ◽  
Jonathan S. Berg ◽  
...  

We suggest that the vertebrate myosin-I field adopt a common nomenclature system based on the names adopted by the Human Genome Organization (HUGO). At present, the myosin-I nomenclature is very confusing; not only are several systems in use, but several different genes have been given the same name. Despite their faults, we believe that the names adopted by the HUGO nomenclature group for genome annotation are the best compromise, and we recommend universal adoption.


2014 ◽  
Author(s):  
Iakes Ezkurdia ◽  
David Juan ◽  
Jose Manuel Rodriguez ◽  
Adam Frankish ◽  
Mark Deikhans ◽  
...  

Determining the full complement of protein-coding genes is a key goal of genome annotation. The most powerful approach for confirming protein coding potential is the detection of cellular protein expression through peptide mass spectrometry experiments. Here we map the peptides detected in 7 large-scale proteomics studies to almost 60% of the protein coding genes in the GENCODE annotation the human genome. We find that conservation across vertebrate species and the age of the gene family are key indicators of whether a peptide will be detected in proteomics experiments. We find peptides for most highly conserved genes and for practically all genes that evolved before bilateria. At the same time there is almost no evidence of protein expression for genes that have appeared since primates, or for genes that do not have any protein-like features or cross-species conservation. We identify 19 non-protein-like features such as weak conservation, no protein features or ambiguous annotations in major databases that are indicators of low peptide detection rates. We use these features to describe a set of 2,001 genes that are potentially non-coding, and show that many of these genes behave more like non-coding genes than protein-coding genes. We detect peptides for just 3% of these genes. We suggest that many of these 2,001 genes do not code for proteins under normal circumstances and that they should not be included in the human protein coding gene catalogue. These potential non-coding genes will be revised as part of the ongoing human genome annotation effort.


2015 ◽  
Author(s):  
Qiongshi Lu ◽  
Yiming Hu ◽  
Jiehuan Sun ◽  
Yuwei Cheng ◽  
Kei-Hoi Cheung ◽  
...  

Identifying functional regions in the human genome is a major goal in human genetics. Great efforts have been made to functionally annotate the human genome either through computational predictions, such as genomic conservation, or high-throughput experiments, such as the ENCODE project. These efforts have resulted in a rich collection of functional annotation data of diverse types that need to be jointly analyzed for integrated interpretation and annotation. Here we present GenoCanyon, a whole-genome annotation method that performs unsupervised statistical learning using 22 computational and experimental annotations thereby inferring the functional potential of each position in the human genome. With GenoCanyon, we are able to predict many of the known functional regions. The ability of predicting functional regions as well as its generalizable statistical framework makes GenoCanyon a unique and powerful tool for whole-genome annotation. The GenoCanyon web server is available at http://genocanyon.med.yale.edu


2019 ◽  
Vol 63 (6) ◽  
pp. 757-771 ◽  
Author(s):  
Claire Francastel ◽  
Frédérique Magdinier

Abstract Despite the tremendous progress made in recent years in assembling the human genome, tandemly repeated DNA elements remain poorly characterized. These sequences account for the vast majority of methylated sites in the human genome and their methylated state is necessary for this repetitive DNA to function properly and to maintain genome integrity. Furthermore, recent advances highlight the emerging role of these sequences in regulating the functions of the human genome and its variability during evolution, among individuals, or in disease susceptibility. In addition, a number of inherited rare diseases are directly linked to the alteration of some of these repetitive DNA sequences, either through changes in the organization or size of the tandem repeat arrays or through mutations in genes encoding chromatin modifiers involved in the epigenetic regulation of these elements. Although largely overlooked so far in the functional annotation of the human genome, satellite elements play key roles in its architectural and topological organization. This includes functions as boundary elements delimitating functional domains or assembly of repressive nuclear compartments, with local or distal impact on gene expression. Thus, the consideration of satellite repeats organization and their associated epigenetic landmarks, including DNA methylation (DNAme), will become unavoidable in the near future to fully decipher human phenotypes and associated diseases.


Sign in / Sign up

Export Citation Format

Share Document