scholarly journals De Novo Assembly and Genomic Structural Variation Analysis with Genome Sequencer FLX 3K Long-Tag Paired End Reads

BioTechniques ◽  
2008 ◽  
Vol 44 (6) ◽  
pp. 829-831 ◽  
Author(s):  
Thomas Jarvie ◽  
Timothy Harkins
2011 ◽  
Vol 29 (8) ◽  
pp. 723-730 ◽  
Author(s):  
Yingrui Li ◽  
Hancheng Zheng ◽  
Ruibang Luo ◽  
Honglong Wu ◽  
Hongmei Zhu ◽  
...  

2019 ◽  
Vol 4 (12) ◽  
pp. 1759-1762
Author(s):  
Yuka Sugawara ◽  
Hideki Kato ◽  
Yoko Yoshida ◽  
Madoka Fujisawa ◽  
Koichi Kokame ◽  
...  

2018 ◽  
Author(s):  
Yen-Lung Lin ◽  
Omer Gokcumen

AbstractGenomic structural variants (SVs) are distributed nonrandomly across the human genome. These “hotspots” have been implicated in critical evolutionary innovations, as well as serious medical conditions. However, the evolutionary and biomedical features of these hotspots remain incompletely understood. In this study, we analyzed data from 2,504 genomes from the 1000 Genomes Project Consortium and constructed a refined map of 1,148 SV hotspots in human genomes. By studying the genomic architecture of these hotspots, we found that both nonallelic homologous recombination and non-homologous mechanisms act as mechanistic drivers of SV formation. We found that the majority of SV hotspots are within gene-poor regions and evolve under relaxed negative selection or neutrality. However, we found that a small subset of SV hotspots harbor genes that are enriched for anthropologically crucial functions, including blood oxygen transport, olfaction, synapse assembly, and antigen binding. We provide evidence that balancing selection may have maintained these SV hotspots, which include two independent hotspots on different chromosomes affecting alpha and beta hemoglobin gene clusters. Biomedically, we found that the SV hotspots coincide with breakpoints of clinically relevant, large de novo SVs, significantly more often than genome-wide expectations. As an example, we showed that the breakpoints of multiple large de novo SVs, which lead to idiopathic short stature, coincide with SV hotspots. As such, the mutational instability in SV hotpots likely enables chromosomal breaks that lead to pathogenic structural variation formations. Our study contributes to a better understanding of the mutational landscape of the genome and implicates both mechanistic and adaptive forces in the formation and maintenance of SV hotspots.


2012 ◽  
Vol 24 (2) ◽  
pp. 660-675 ◽  
Author(s):  
Anna Stengel ◽  
Irene L. Gügel ◽  
Daniel Hilger ◽  
Birgit Rengstl ◽  
Heinrich Jung ◽  
...  

2021 ◽  
Vol 18 (2) ◽  
pp. 170-175 ◽  
Author(s):  
Haoyu Cheng ◽  
Gregory T. Concepcion ◽  
Xiaowen Feng ◽  
Haowen Zhang ◽  
Heng Li
Keyword(s):  

Author(s):  
Guangtu Gao ◽  
Susana Magadan ◽  
Geoffrey C Waldbieser ◽  
Ramey C Youngblood ◽  
Paul A Wheeler ◽  
...  

Abstract Currently, there is still a need to improve the contiguity of the rainbow trout reference genome and to use multiple genetic backgrounds that will represent the genetic diversity of this species. The Arlee doubled haploid line was originated from a domesticated hatchery strain that was originally collected from the northern California coast. The Canu pipeline was used to generate the Arlee line genome de-novo assembly from high coverage PacBio long-reads sequence data. The assembly was further improved with Bionano optical maps and Hi-C proximity ligation sequence data to generate 32 major scaffolds corresponding to the karyotype of the Arlee line (2 N = 64). It is composed of 938 scaffolds with N50 of 39.16 Mb and a total length of 2.33 Gb, of which ∼95% was in 32 chromosome sequences with only 438 gaps between contigs and scaffolds. In rainbow trout the haploid chromosome number can vary from 29 to 32. In the Arlee karyotype the haploid chromosome number is 32 because chromosomes Omy04, 14 and 25 are divided into six acrocentric chromosomes. Additional structural variations that were identified in the Arlee genome included the major inversions on chromosomes Omy05 and Omy20 and additional 15 smaller inversions that will require further validation. This is also the first rainbow trout genome assembly that includes a scaffold with the sex-determination gene (sdY) in the chromosome Y sequence. The utility of this genome assembly is demonstrated through the improved annotation of the duplicated genome loci that harbor the IGH genes on chromosomes Omy12 and Omy13.


Sign in / Sign up

Export Citation Format

Share Document