CNVmap: a method and software to detect and map copy number variants from segregation data

Mapping Intimacies ◽

10.1101/778753 ◽

2019 ◽

Author(s):

Matthieu Falque ◽

Kamel Jebreen ◽

Etienne Paux ◽

Carsten Knaak ◽

Sofiane Mezmouk ◽

...

Keyword(s):

Copy Number ◽

Copy Number Variants ◽

Original Method ◽

Nucleotide Polymorphisms ◽

Sequencing Data ◽

Extra Copy ◽

Segregation Data ◽

The Past ◽

Duplicated Loci ◽

Additional Value

AbstractSingle nucleotide polymorphisms (SNPs) are widely used for detecting quantitative trait loci or for searching for causal variants of diseases. Nevertheless, structural variations such as copy-number variants (CNVs) represent a large part of natural genetic diversity and contribute significantly to trait variation. Over the past decade, numerous methods and softwares have been developed to detect CNVs. Such approaches are based on exploiting sequencing data or SNP arrays, but they bypass a wealth of information such as genotyping data from segregating populations, produced e.g. for QTL mapping. Here we propose an original method to both detect and genetically map CNVs using mapping panels. Specifically, we exploit the apparent heterozygous state of duplicated loci: peaks in appropriately defined genome-wide allelic profiles provide highly specific signatures that identify the nature and position of the CNVs. Our original method and software can detect and map automatically up to 33 different predefined types of CNVs based on segregation data only. We validate this approach on simulated and experimental bi-parental mapping panels in two maize and one wheat populations. Most of the events found correspond to having just one extra copy in one of the parental lines but the corresponding allelic value can be that of either parent. We also find cases with two or more additional copies, especially in wheat where these copies locate to homeologues. More generally, our computational tool can be used to give additional value, at no cost, to many datasets produced over the past decade from genetic mapping panels.

Download Full-text

CNVmap: A Method and Software To Detect and Map Copy Number Variants from Segregation Data

Genetics ◽

10.1534/genetics.119.302881 ◽

2019 ◽

Vol 214 (3) ◽

pp. 561-576

Author(s):

Matthieu Falque ◽

Kamel Jebreen ◽

Etienne Paux ◽

Carsten Knaak ◽

Sofiane Mezmouk ◽

...

Keyword(s):

Copy Number ◽

Copy Number Variants ◽

Original Method ◽

Nucleotide Polymorphisms ◽

Extra Copy ◽

Segregation Data ◽

Wheat Population ◽

Causal Variants ◽

Duplicated Loci ◽

Additional Value

Single nucleotide polymorphisms (SNPs) are used widely for detecting quantitative trait loci, or for searching for causal variants of diseases. Nevertheless, structural variations such as copy-number variants (CNVs) represent a large part of natural genetic diversity, and contribute significantly to trait variation. Numerous methods and softwares based on different technologies (amplicons, CGH, tiling, or SNP arrays, or sequencing) have already been developed to detect CNVs, but they bypass a wealth of information such as genotyping data from segregating populations, produced, e.g., for QTL mapping. Here, we propose an original method to both detect and genetically map CNVs using mapping panels. Specifically, we exploit the apparent heterozygous state of duplicated loci: peaks in appropriately defined genome-wide allelic profiles provide highly specific signatures that identify the nature and position of the CNVs. Our original method and software can detect and map automatically up to 33 different predefined types of CNVs based on segregation data only. We validate this approach on simulated and experimental biparental mapping panels in two maize populations and one wheat population. Most of the events found correspond to having just one extra copy in one of the parental lines, but the corresponding allelic value can be that of either parent. We also find cases with two or more additional copies, especially in wheat, where these copies locate to homeologues. More generally, our computational tool can be used to give additional value, at no cost, to many datasets produced over the past decade from genetic mapping panels.

Download Full-text

CONGA: Copy number variation genotyping in ancient genomes and low-coverage sequencing data

10.1101/2021.12.17.473150 ◽

2021 ◽

Author(s):

Arda Soylev ◽

Sevim Seda Cokoglu ◽

Dilek Koptekin ◽

Can Alkan ◽

Mehmet Somel

Keyword(s):

Copy Number ◽

Demographic History ◽

Copy Number Variants ◽

Purifying Selection ◽

Gene Pools ◽

Nucleotide Polymorphisms ◽

Sequencing Data ◽

Duplication Events ◽

Highly Correlated ◽

Genome Analyses

To date, ancient genome analyses have been largely confined to the study of single nucleotide polymorphisms (SNPs). Copy number variants (CNVs) are a major contributor of disease and of evolutionary adaptation, but identifying CNVs in ancient shotgun-sequenced genomes is hampered by (a) most published genomes being <1x coverage, (ii) ancient DNA fragments being typically <80 bps. These characteristics preclude state-of-the-art CNV detection software to be effectively applied to ancient genomes. Here we present CONGA, an algorithm tailored for genotyping deletion and duplication events in genomes with low depths of coverage. Simulations show that CONGA can genotype deletions and duplications >1 Kbps with F-scores >0.77 and >0.82, respectively at >=0.5x. Further, down-sampling experiments using published ancient BAM files reveal that >1 Kbps deletions could be genotyped at F-score >0.75 at >=1x coverage. Using CONGA, we analyse deletion events at 10,018 loci in 56 ancient human genomes spanning the last 50,000 years, with coverages 0.4x-26x. We find inter-individual genetic diversity measured using deletions and SNPs to be highly correlated, suggesting that deletion frequencies broadly reflect demographic history. We also identify signatures of purifying selection on deletions, such as an excess of singletons compared to those in SNPs. CONGA paves the way for systematic studies of drift, mutation load, and adaptation in ancient and modern-day gene pools through the lens of CNVs.

Download Full-text

PocaCNV: A Tool to Detect Copy Number Variants from Population-Scale Genome Sequencing Data

10.1109/bibm52615.2021.9669405 ◽

2021 ◽

Author(s):

Zhendong Zhang ◽

Yongzhuang Liu ◽

Gaoyang Li ◽

Yadong Wang

Keyword(s):

Genome Sequencing ◽

Copy Number ◽

Copy Number Variants ◽

Sequencing Data ◽

Population Scale

Download Full-text

Clinically significant exome-based copy number variants detected by re-evaluation of exome sequencing data

Dokuz Eylül Üniversitesi Tıp Fakültesi Dergisi ◽

10.5505/deutfd.2021.29053 ◽

2021 ◽

Vol 35 (1) ◽

pp. 1-11

Author(s):

Fatma Kurt Çolak

Keyword(s):

Exome Sequencing ◽

Copy Number ◽

Copy Number Variants ◽

Sequencing Data ◽

Exome Sequencing Data ◽

Clinically Significant

Download Full-text

Genetics of Schizophrenia and Bipolar Disorder

10.1093/med/9780190681425.003.0013 ◽

2017 ◽

Author(s):

Alexander Charney ◽

Pamela Sklar

Keyword(s):

Bipolar Disorder ◽

Copy Number ◽

Psychotic Disorders ◽

Copy Number Variants ◽

Nucleotide Polymorphisms ◽

Common Variants ◽

Single Nucleotide Variants ◽

Single Nucleotide ◽

Number Variation

Schizophrenia and bipolar disorder are the classic psychotic disorders. Both diseases are strongly familial, but have proven recalcitrant to genetic methodologies for identifying the etiology until recently. There is now convincing genetic evidence that indicates a contribution of many DNA changes to the risk of becoming ill. For schizophrenia, there are large contributions of rare copy number variants and common single nucleotide variants, with an overall highly polygenic genetic architecture. For bipolar disorder, the role of copy number variation appears to be much less pronounced. Specific common single nucleotide polymorphisms are associated, and there is evidence for polygenicity. Several surprises have emerged from the genetic data that indicate there is significantly more molecular overlap in copy number variants between autism and schizophrenia, and in common variants between schizophrenia and bipolar disorder.

Download Full-text

nbCNV: a multi-constrained optimization model for discovering copy number variants in single-cell sequencing data

BMC Bioinformatics ◽

10.1186/s12859-016-1239-7 ◽

2016 ◽

Vol 17 (1) ◽

Cited By ~ 9

Author(s):

Changsheng Zhang ◽

Hongmin Cai ◽

Jingying Huang ◽

Yan Song

Keyword(s):

Constrained Optimization ◽

Single Cell ◽

Optimization Model ◽

Copy Number ◽

Copy Number Variants ◽

Sequencing Data ◽

Single Cell Sequencing

Download Full-text

Genome-wide association of early-onset myocardial infarction with single nucleotide polymorphisms and copy number variants

Nature Genetics ◽

10.1038/ng.327 ◽

2009 ◽

Vol 41 (3) ◽

pp. 334-341 ◽

Cited By ~ 720

Author(s):

Keyword(s):

Myocardial Infarction ◽

Single Nucleotide Polymorphisms ◽

Early Onset ◽

Copy Number ◽

Copy Number Variants ◽

Genome Wide Association ◽

Nucleotide Polymorphisms ◽

Single Nucleotide ◽

Genome Wide

Download Full-text

SECNVs: A Simulator of Copy Number Variants and Whole-Exome Sequences from Reference Genomes

10.1101/824128 ◽

2019 ◽

Cited By ~ 1

Author(s):

Yue Xing ◽

Alan R. Dabney ◽

Xiao Li ◽

Guosong Wang ◽

Clare A. Gill ◽

...

Keyword(s):

Whole Genome Sequencing ◽

Genome Sequencing ◽

Copy Number ◽

Copy Number Variants ◽

Whole Genome ◽

Sequencing Data ◽

Software Applications ◽

Exome Sequencing Data ◽

Whole Exome ◽

Whole Exome Sequencing Data

AbstractCopy number variants are insertions and deletions of 1 kb or larger in a genome that play an important role in phenotypic changes and human disease. Many software applications have been developed to detect copy number variants using either whole-genome sequencing or whole-exome sequencing data. However, there is poor agreement in the results from these applications. Simulated datasets containing copy number variants allow comprehensive comparisons of the operating characteristics of existing and novel copy number variant detection methods. Several software applications have been developed to simulate copy number variants and other structural variants in whole-genome sequencing data. However, none of the applications reliably simulate copy number variants in whole-exome sequencing data. We have developed and tested SECNVs (Simulator of Exome Copy Number Variants), a fast, robust and customizable software application for simulating copy number variants and whole-exome sequences from a reference genome. SECNVs is easy to install, implements a wide range of commands to customize simulations, can output multiple samples at once, and incorporates a pipeline to output rearranged genomes, short reads and BAM files in a single command. Variants generated by SECNVs are detected with high sensitivity and precision by tools commonly used to detect copy number variants. SECNVs is publicly available at https://github.com/YJulyXing/SECNVs.

Download Full-text

SA55GENOME-WIDE ASSOCIATION OF COPY NUMBER VARIANTS FROM WHOLE-EXOME SEQUENCING DATA REVEALS AN ASSOCIATION BETWEEN EXTREMES IN WORKING MEMORY PERFORMANCE AND RARE CNVS

European Neuropsychopharmacology ◽

10.1016/j.euroneuro.2018.08.277 ◽

2019 ◽

Vol 29 ◽

pp. S1218

Author(s):

Angela Heck ◽

Annette Milnik ◽

Vanja Vukojevic ◽

Jana Petrovska ◽

Virginie Freytag ◽

...

Keyword(s):

Working Memory ◽

Exome Sequencing ◽

Copy Number ◽

Memory Performance ◽

Copy Number Variants ◽

Sequencing Data ◽

Rare Cnvs ◽

Exome Sequencing Data ◽

Whole Exome ◽

Whole Exome Sequencing Data

Download Full-text

Detecting common copy number variants in high-throughput sequencing data by using JointSLM algorithm

Nucleic Acids Research ◽

10.1093/nar/gkr068 ◽

2011 ◽

Vol 39 (10) ◽

pp. e65-e65 ◽

Cited By ~ 51

Author(s):

Alberto Magi ◽

Matteo Benelli ◽

Seungtai Yoon ◽

Franco Roviello ◽

Francesca Torricelli

Keyword(s):

High Throughput ◽

Copy Number ◽

High Throughput Sequencing ◽

Copy Number Variants ◽

Sequencing Data ◽

High Throughput Sequencing Data

Download Full-text