scholarly journals Six new reference-quality bat genomes illuminate the molecular basis and evolution of bat adaptations

2019 ◽  
Author(s):  
David Jebb ◽  
Zixia Huang ◽  
Martin Pippel ◽  
Graham M. Hughes ◽  
Ksenia Lavrichenko ◽  
...  

AbstractBats account for ~20% of all extant mammal species and are considered exceptional given their extraordinary adaptations, including biosonar, true flight, extreme longevity, and unparalleled immune systems. To understand these adaptations, we generated reference-quality genomes of six species representing the key divergent lineages. We assembled these genomes with a novel pipeline incorporating state-of-the-art long-read and long-range sequencing and assembly techniques. The genomes were annotated using a maximal evidence approach, de novo predictions, protein/mRNA alignments, Iso-seq long read and RNA-seq short read transcripts, and gene projections from our new TOGA pipeline, retrieving virtually all (>99%) mammalian BUSCO genes. Phylogenetic analyses of 12,931 protein coding-genes and 10,857 conserved non-coding elements identified across 48 mammalian genomes helped to resolve bats’ closest extant relatives within Laurasiatheria, supporting a basal position for bats within Scrotifera. Genome-wide screens along the bat ancestral branch revealed (a) selection on hearing-involved genes (e.g LRP2, SERPINB6, TJP2), which suggest that laryngeal echolocation is a shared ancestral trait of bats; (b) selection (e.g INAVA, CXCL13, NPSR1) and loss of immunity related proteins (e.g. LRRC70, IL36G), including pro-inflammatory NF-kB signalling; and (c) expansion of the APOBEC family, associated with restricting viral infection, transposon activity and interferon signalling. We also identified unique integrated viruses, indicating that bats have a history of tolerating viral pathogens, lethal to other mammal species. Non-coding RNA analyses identified variant and novel microRNAs, revealing regulatory relationships that may contribute to phenotypic diversity in bats. Together, our reference-quality genomes, high-quality annotations, genome-wide screens and in-vitro tests revealed previously unknown genomic adaptations in bats that may explain their extraordinary traits.

2019 ◽  
Vol 9 (1) ◽  
Author(s):  
Suzana Stjelja ◽  
Johan Fogelqvist ◽  
Christian Tellgren-Roth ◽  
Christina Dixelius

Abstract Plasmodiophora brassicae is a soil-borne pathogen that attacks roots of cruciferous plants causing clubroot disease. The pathogen belongs to the Plasmodiophorida order in Phytomyxea. Here we used long-read SMRT technology to clarify the P. brassicae e3 genomic constituents along with comparative and phylogenetic analyses. Twenty contigs representing the nuclear genome and one mitochondrial (mt) contig were generated, together comprising 25.1 Mbp. Thirteen of the 20 nuclear contigs represented chromosomes from telomere to telomere characterized by [TTTTAGGG] sequences. Seven active gene candidates encoding synaptonemal complex-associated and meiotic-related protein homologs were identified, a finding that argues for possible genetic recombination events. The circular mt genome is large (114,663 bp), gene dense and intron rich. It shares high synteny with the mt genome of Spongospora subterranea, except in a unique 12 kb region delimited by shifts in GC content and containing tandem minisatellite- and microsatellite repeats with partially palindromic sequences. De novo annotation identified 32 protein-coding genes, 28 structural RNA genes and 19 ORFs. ORFs predicted in the repeat-rich region showed similarities to diverse organisms suggesting possible evolutionary connections. The data generated here form a refined platform for the next step involving functional analysis, all to clarify the complex biology of P. brassicae.


2019 ◽  
Author(s):  
Ryan Bracewell ◽  
Anita Tran ◽  
Kamalakar Chatla ◽  
Doris Bachtrog

ABSTRACTThe Drosophila obscura species group is one of the most studied clades of Drosophila and harbors multiple distinct karyotypes. Here we present a de novo genome assembly and annotation of D. bifasciata, a species which represents an important subgroup for which no high-quality chromosome-level genome assembly currently exists. We combined long-read sequencing (Nanopore) and Hi-C scaffolding to achieve a highly contiguous genome assembly approximately 193Mb in size, with repetitive elements constituting 30.1% of the total length. Drosophila bifasciata harbors four large metacentric chromosomes and the small dot, and our assembly contains each chromosome in a single scaffold, including the highly repetitive pericentromere, which were largely composed of Jockey and Gypsy transposable elements. We annotated a total of 12,821 protein-coding genes and comparisons of synteny with D. athabasca orthologs show that the large metacentric pericentromeric regions of multiple chromosomes are conserved between these species. Importantly, Muller A (X chromosome) was found to be metacentric in D. bifasciata and the pericentromeric region appears homologous to the pericentromeric region of the fused Muller A-AD (XL and XR) of pseudoobscura/affinis subgroup species. Our finding suggests a metacentric ancestral X fused to a telocentric Muller D and created the large neo-X (Muller A-AD) chromosome ∼15 MYA. We also confirm the fusion of Muller C and D in D. bifasciata and show that it likely involved a centromere-centromere fusion.


PeerJ ◽  
2020 ◽  
Vol 8 ◽  
pp. e10364
Author(s):  
Natalia I. Abramson ◽  
Fedor N. Golenishchev ◽  
Semen Yu. Bodrov ◽  
Olga V. Bondareva ◽  
Evgeny A. Genelt-Yanovskiy ◽  
...  

In this article, we present the nearly complete mitochondrial genome of the Subalpine Kashmir vole Hyperacrius fertilis (Arvicolinae, Cricetidae, Rodentia), assembled using data from Illumina next-generation sequencing (NGS) of the DNA from a century-old museum specimen. De novo assembly consisted of 16,341 bp and included all mitogenome protein-coding genes as well as 12S and 16S RNAs, tRNAs and D-loop. Using the alignment of protein-coding genes of 14 previously published Arvicolini tribe mitogenomes, seven Clethrionomyini mitogenomes, and also Ondatra and Dicrostonyx outgroups, we conducted phylogenetic reconstructions based on a dataset of 13 protein-coding genes (PCGs) under maximum likelihood and Bayesian inference. Phylogenetic analyses robustly supported the phylogenetic position of this species within the tribe Arvicolini. Among the Arvicolini, Hyperacrius represents one of the early-diverged lineages. This result of phylogenetic analysis altered the conventional view on phylogenetic relatedness between Hyperacrius and Alticola and prompted the revision of morphological characters underlying the former assumption. Morphological analysis performed here confirmed molecular data and provided additional evidence for taxonomic replacement of the genus Hyperacrius from the tribe Clethrionomyini to the tribe Arvicolini.


2017 ◽  
Author(s):  
Mircea Cretu Stancu ◽  
Markus J. van Roosmalen ◽  
Ivo Renkens ◽  
Marleen Nieboer ◽  
Sjors Middelkamp ◽  
...  

AbstractStructural genomic variants form a common type of genetic alteration underlying human genetic disease and phenotypic variation. Despite major improvements in genome sequencing technology and data analysis, the detection of structural variants still poses challenges, particularly when variants are of high complexity. Emerging long-read single-molecule sequencing technologies provide new opportunities for detection of structural variants. Here, we demonstrate sequencing of the genomes of two patients with congenital abnormalities using the ONT MinION at 11x and 16x mean coverage, respectively. We developed a bioinformatic pipeline - NanoSV - to efficiently map genomic structural variants (SVs) from the long-read data. We demonstrate that the nanopore data are superior to corresponding short-read data with regard to detection of de novo rearrangements originating from complex chromothripsis events in the patients. Additionally, genome-wide surveillance of SVs, revealed 3,253 (33%) novel variants that were missed in short-read data of the same sample, the majority of which are duplications < 200bp in size. Long sequencing reads enabled efficient phasing of genetic variations, allowing the construction of genome-wide maps of phased SVs and SNVs. We employed read-based phasing to show that all de novo chromothripsis breakpoints occurred on paternal chromosomes and we resolved the long-range structure of the chromothripsis. This work demonstrates the value of long-read sequencing for screening whole genomes of patients for complex structural variants.


2016 ◽  
Vol 61 (2) ◽  
Author(s):  
Jees Sebastian ◽  
Sharmada Swaminath ◽  
Rashmi Ravindran Nair ◽  
Kishor Jakkala ◽  
Atul Pradhan ◽  
...  

ABSTRACT Bacterial persisters are a subpopulation of cells that can tolerate lethal concentrations of antibiotics. However, the possibility of the emergence of genetically resistant mutants from antibiotic persister cell populations, upon continued exposure to lethal concentrations of antibiotics, remained unexplored. In the present study, we found that Mycobacterium tuberculosis cells exposed continuously to lethal concentrations of rifampin (RIF) or moxifloxacin (MXF) for prolonged durations showed killing, RIF/MXF persistence, and regrowth phases. RIF-resistant or MXF-resistant mutants carrying clinically relevant mutations in the rpoB or gyrA gene, respectively, were found to emerge at high frequency from the RIF persistence phase population. A Luria-Delbruck fluctuation experiment using RIF-exposed M. tuberculosis cells showed that the rpoB mutants were not preexistent in the population but were formed de novo from the RIF persistence phase population. The RIF persistence phase M. tuberculosis cells carried elevated levels of hydroxyl radical that inflicted extensive genome-wide mutations, generating RIF-resistant mutants. Consistent with the elevated levels of hydroxyl radical-mediated genome-wide random mutagenesis, MXF-resistant M. tuberculosis gyrA de novo mutants could be selected from the RIF persistence phase cells. Thus, unlike previous studies, which showed emergence of genetically resistant mutants upon exposure of bacteria for short durations to sublethal concentrations of antibiotics, our study demonstrates that continuous prolonged exposure of M. tuberculosis cells to lethal concentrations of an antibiotic generates antibiotic persistence phase cells that form a reservoir for the generation of genetically resistant mutants to the same antibiotic or another antibiotic. These findings may have clinical significance in the emergence of drug-resistant tubercle bacilli.


2018 ◽  
Vol 115 (10) ◽  
pp. 2401-2406 ◽  
Author(s):  
H. S. Jeffrey Man ◽  
Aravin N. Sukumar ◽  
Gabrielle C. Lam ◽  
Paul J. Turgeon ◽  
Matthew S. Yan ◽  
...  

Endothelial cell (EC)-enriched protein coding genes, such as endothelial nitric oxide synthase (eNOS), define quintessential EC-specific physiologic functions. It is not clear whether long noncoding RNAs (lncRNAs) also define cardiovascular cell type-specific phenotypes, especially in the vascular endothelium. Here, we report the existence of a set of EC-enriched lncRNAs and define a role for spliced-transcript endothelial-enriched lncRNA (STEEL) in angiogenic potential, macrovascular/microvascular identity, and shear stress responsiveness. STEEL is expressed from the terminus of the HOXD locus and is transcribed antisense to HOXD transcription factors. STEEL RNA increases the number and integrity of de novo perfused microvessels in an in vivo model and augments angiogenesis in vitro. The STEEL RNA is polyadenylated, nuclear enriched, and has microvascular predominance. Functionally, STEEL regulates a number of genes in diverse ECs. Of interest, STEEL up-regulates both eNOS and the transcription factor Kruppel-like factor 2 (KLF2), and is subject to feedback inhibition by both eNOS and shear-augmented KLF2. Mechanistically, STEEL up-regulation of eNOS and KLF2 is transcriptionally mediated, in part, via interaction of chromatin-associated STEEL with the poly-ADP ribosylase, PARP1. For instance, STEEL recruits PARP1 to the KLF2 promoter. This work identifies a role for EC-enriched lncRNAs in the phenotypic adaptation of ECs to both body position and hemodynamic forces and establishes a newer role for lncRNAs in the transcriptional regulation of EC identity.


2019 ◽  
Vol 10 (1) ◽  
Author(s):  
Eugene J. Gardner ◽  
Elena Prigmore ◽  
Giuseppe Gallone ◽  
Petr Danecek ◽  
Kaitlin E. Samocha ◽  
...  

Abstract Mobile genetic Elements (MEs) are segments of DNA which can copy themselves and other transcribed sequences through the process of retrotransposition (RT). In humans several disorders have been attributed to RT, but the role of RT in severe developmental disorders (DD) has not yet been explored. Here we identify RT-derived events in 9738 exome sequenced trios with DD-affected probands. We ascertain 9 de novo MEs, 4 of which are likely causative of the patient’s symptoms (0.04%), as well as 2 de novo gene retroduplications. Beyond identifying likely diagnostic RT events, we estimate genome-wide germline ME mutation rate and selective constraint and demonstrate that coding RT events have signatures of purifying selection equivalent to those of truncating mutations. Overall, our analysis represents a comprehensive interrogation of the impact of retrotransposition on protein coding genes and a framework for future evolutionary and disease studies.


2020 ◽  
Vol 10 (3) ◽  
pp. 891-897 ◽  
Author(s):  
Ryan Bracewell ◽  
Anita Tran ◽  
Kamalakar Chatla ◽  
Doris Bachtrog

The Drosophila obscura species group is one of the most studied clades of Drosophila and harbors multiple distinct karyotypes. Here we present a de novo genome assembly and annotation of D. bifasciata, a species which represents an important subgroup for which no high-quality chromosome-level genome assembly currently exists. We combined long-read sequencing (Nanopore) and Hi-C scaffolding to achieve a highly contiguous genome assembly approximately 193 Mb in size, with repetitive elements constituting 30.1% of the total length. Drosophila bifasciata harbors four large metacentric chromosomes and the small dot, and our assembly contains each chromosome in a single scaffold, including the highly repetitive pericentromeres, which were largely composed of Jockey and Gypsy transposable elements. We annotated a total of 12,821 protein-coding genes and comparisons of synteny with D. athabasca orthologs show that the large metacentric pericentromeric regions of multiple chromosomes are conserved between these species. Importantly, Muller A (X chromosome) was found to be metacentric in D. bifasciata and the pericentromeric region appears homologous to the pericentromeric region of the fused Muller A-AD (XL and XR) of pseudoobscura/affinis subgroup species. Our finding suggests a metacentric ancestral X fused to a telocentric Muller D and created the large neo-X (Muller A-AD) chromosome ∼15 MYA. We also confirm the fusion of Muller C and D in D. bifasciata and show that it likely involved a centromere-centromere fusion.


2021 ◽  
Vol 53 (9) ◽  
pp. 1392-1402
Author(s):  
Lei Kang ◽  
Lunwen Qian ◽  
Ming Zheng ◽  
Liyang Chen ◽  
Hao Chen ◽  
...  

AbstractDespite early domestication around 3000 BC, the evolutionary history of the ancient allotetraploid species Brassica juncea (L.) Czern & Coss remains uncertain. Here, we report a chromosome-scale de novo assembly of a yellow-seeded B. juncea genome by integrating long-read and short-read sequencing, optical mapping and Hi-C technologies. Nuclear and organelle phylogenies of 480 accessions worldwide supported that B. juncea is most likely a single origin in West Asia, 8,000–14,000 years ago, via natural interspecific hybridization. Subsequently, new crop types evolved through spontaneous gene mutations and introgressions along three independent routes of eastward expansion. Selective sweeps, genome-wide trait associations and tissue-specific RNA-sequencing analysis shed light on the domestication history of flowering time and seed weight, and on human selection for morphological diversification in this versatile species. Our data provide a comprehensive insight into the origin and domestication and a foundation for genomics-based breeding of B. juncea.


Viruses ◽  
2020 ◽  
Vol 12 (7) ◽  
pp. 749 ◽  
Author(s):  
Melanie Hiltbrunner ◽  
Gerald Heckel

Research on the ecology and evolution of viruses is often hampered by the limitation of sequence information to short parts of the genomes or single genomes derived from cultures. In this study, we use hybrid sequence capture enrichment in combination with high-throughput sequencing to provide efficient access to full genomes of European hantaviruses from rodent samples obtained in the field. We applied this methodology to Tula (TULV) and Puumala (PUUV) orthohantaviruses for which analyses from natural host samples are typically restricted to partial sequences of their tri-segmented RNA genome. We assembled a total of ten novel hantavirus genomes de novo with very high coverage (on average >99%) and sequencing depth (average >247×). A comparison with partial Sanger sequences indicated an accuracy of >99.9% for the assemblies. An analysis of two common vole (Microtus arvalis) samples infected with two TULV strains each allowed for the de novo assembly of all four TULV genomes. Combining the novel sequences with all available TULV and PUUV genomes revealed very similar patterns of sequence diversity along the genomes, except for remarkably higher diversity in the non-coding region of the S-segment in PUUV. The genomic distribution of polymorphisms in the coding sequence was similar between the species, but differed between the segments with the highest sequence divergence of 0.274 for the M-segment, 0.265 for the S-segment, and 0.248 for the L-segment (overall 0.258). Phylogenetic analyses showed the clustering of genome sequences consistent with their geographic distribution within each species. Genome-wide data yielded extremely high node support values, despite the impact of strong mutational saturation that is expected for hantavirus sequences obtained over large spatial distances. We conclude that genome sequencing based on capture enrichment protocols provides an efficient means for ecological and evolutionary investigations of hantaviruses at an unprecedented completeness and depth.


Sign in / Sign up

Export Citation Format

Share Document