scholarly journals Probably Correct: Rescuing Repeats with Short and Long Reads

Genes ◽  
2020 ◽  
Vol 12 (1) ◽  
pp. 48
Author(s):  
Monika Cechova

Ever since the introduction of high-throughput sequencing following the human genome project, assembling short reads into a reference of sufficient quality posed a significant problem as a large portion of the human genome—estimated 50–69%—is repetitive. As a result, a sizable proportion of sequencing reads is multi-mapping, i.e., without a unique placement in the genome. The two key parameters for whether or not a read is multi-mapping are the read length and genome complexity. Long reads are now able to span difficult, heterochromatic regions, including full centromeres, and characterize chromosomes from “telomere to telomere”. Moreover, identical reads or repeat arrays can be differentiated based on their epigenetic marks, such as methylation patterns, aiding in the assembly process. This is despite the fact that long reads still contain a modest percentage of sequencing errors, disorienting the aligners and assemblers both in accuracy and speed. Here, I review the proposed and implemented solutions to the repeat resolution and the multi-mapping read problem, as well as the downstream consequences of reference choice, repeat masking, and proper representation of sex chromosomes. I also consider the forthcoming challenges and solutions with regards to long reads, where we expect the shift from the problem of repeat localization within a single individual to the problem of repeat positioning within pangenomes.

2020 ◽  
Vol 15 (7) ◽  
pp. 639-645 ◽  
Author(s):  
Ying Li ◽  
Jinglan Li ◽  
Leilei Chen ◽  
Liangliang Xu

The Human Genome Project (HGP) announced in 2001 that it had sequenced the entire human genome, yielding nearly complete human DNA. About 98.5 percent of the human genome has been found to be non-coding sequences. Long non-coding RNA (lncRNA) is a non-coding RNA with a length between 200 and 100,000 nucleotide units. Because of shallow research on lncRNA, it was believed that it had no biological functions, but exists as a by-product of the transcription process. With the development of high-throughput sequencing technology, studies have shown that lncRNA plays important roles in many processes by participating in epigenetics, transcription, translation and protein modification. Current researches have shown that lncRNA also has an important part in the pathogenesis of osteoporosis. Osteoporosis is a common disorder of bone metabolism, also a major medical and socioeconomic challenge worldwide. It is characterized by a systemic reduction in bone mass and microstructure changes, which increases the risk of brittle fractures. It is more common in postmenopausal women and elderly men. However, the roles of lncRNA and relevant mechanisms in osteoporosis remain unclear. Based on this background, we hereby review the roles of lncRNA in osteoporosis, and how it influences the functions of osteoblasts and osteoclasts, providing reference to clinical diagnosis, treatment and prognosis of osteoporosis.


Author(s):  
John Archibald

The initial phase of human genome sequencing is often referred to as ‘the’ Human Genome Project. But there were two different projects, one publicly funded, the other supported by a private company, Craig Venter’s Celera Genomics. ‘The human genome in biology and medicine’ explains that both projects used DNA samples from more than one person. It was not until 2007 that the first genome of a single individual was published. The structure of the nuclear DNA and mitochondrial genome is also described. Genomics research is having a profound impact on our understanding of the genetic, biochemical, and cell biological underpinnings of cancer, and how it can be detected and treated.


Biomolecules ◽  
2021 ◽  
Vol 11 (1) ◽  
pp. 90
Author(s):  
Ryuji Hamamoto

The Human Genome Project, completed in 2003 by an international consortium, is considered one of the most important achievements for mankind in the 21st century [...]


1993 ◽  
Vol 36 (3) ◽  
pp. 466-475 ◽  
Author(s):  
BELINDA J. F. ROSSITER ◽  
C THOMAS CASKEY

Sign in / Sign up

Export Citation Format

Share Document