A Diploid Assembly-based Benchmark for Variants in the Major Histocompatibility Complex

Mapping Intimacies ◽

10.1101/831792 ◽

2019 ◽

Cited By ~ 4

Author(s):

Chen-Shan Chin ◽

Justin Wagner ◽

Qiandong Zeng ◽

Erik Garrison ◽

Shilpa Garg ◽

...

Keyword(s):

Major Histocompatibility Complex ◽

De Novo ◽

Genome Project ◽

Personal Genome ◽

Major Histocompatibility ◽

Base Level ◽

Histocompatibility Complex ◽

Human Genomes ◽

Long Reads ◽

Complex Variation

AbstractWe develop the first human benchmark derived from a diploid assembly for the openly-consented Genome in a Bottle/Personal Genome Project Ashkenazi son (HG002). As a proof-of-principle, we focus on a medically important, highly variable, 5 million base-pair region - the Major Histocompatibility Complex (MHC). Most human genomes are characterized by aligning individual reads to the reference genome, but accurate long reads and linked reads now enable us to construct base-level accurate, phased de novo assemblies from the reads. We assemble a single haplotig (haplotype-specific contig) for each haplotype, and align reads back to each assembled haplotig to identify two regions of lower confidence. We align the haplotigs to the reference, call phased small and structural variants, and define the first small variant benchmark for the MHC, covering 21496 small variants in 4.58 million base-pairs (92 % of the MHC). The assembly-based benchmark is 99.95 % concordant with a draft mapping-based benchmark from the same long and linked reads within both benchmark regions, but covers 50 % more variants outside the mapping-based benchmark regions. The haplotigs and variant calls are completely concordant with phased clinical HLA types for HG002. This benchmark reliably identifies false positives and false negatives from mapping-based callsets, and enables performance assessment in regions with much denser, complex variation than regions covered by previous benchmarks. These methods demonstrate a path towards future diploid assembly-based benchmarks for other complex regions of the genome.

Download Full-text

A diploid assembly-based benchmark for variants in the major histocompatibility complex

Nature Communications ◽

10.1038/s41467-020-18564-9 ◽

2020 ◽

Vol 11 (1) ◽

Cited By ~ 2

Author(s):

Chen-Shan Chin ◽

Justin Wagner ◽

Qiandong Zeng ◽

Erik Garrison ◽

Shilpa Garg ◽

...

Keyword(s):

Major Histocompatibility Complex ◽

Performance Assessment ◽

Reference Genome ◽

De Novo ◽

Major Histocompatibility ◽

Structural Variants ◽

Histocompatibility Complex ◽

Human Genomes ◽

Long Reads ◽

Complex Variation

Abstract Most human genomes are characterized by aligning individual reads to the reference genome, but accurate long reads and linked reads now enable us to construct accurate, phased de novo assemblies. We focus on a medically important, highly variable, 5 million base-pair (bp) region where diploid assembly is particularly useful - the Major Histocompatibility Complex (MHC). Here, we develop a human genome benchmark derived from a diploid assembly for the openly-consented Genome in a Bottle sample HG002. We assemble a single contig for each haplotype, align them to the reference, call phased small and structural variants, and define a small variant benchmark for the MHC, covering 94% of the MHC and 22368 variants smaller than 50 bp, 49% more variants than a mapping-based benchmark. This benchmark reliably identifies errors in mapping-based callsets, and enables performance assessment in regions with much denser, complex variation than regions covered by previous benchmarks.

Download Full-text

De novo genotyping of the major histocompatibility complex in an Australian dragon lizard, Ctenophorus decresii

Transactions of the Royal Society of South Australia ◽

10.1080/03721426.2018.1542259 ◽

2018 ◽

Vol 143 (1) ◽

pp. 97-117

Author(s):

Jessica Hacking ◽

Tessa Bradford ◽

Kelly Pierce ◽

Michael Gardner

Keyword(s):

Major Histocompatibility Complex ◽

De Novo ◽

Major Histocompatibility ◽

Histocompatibility Complex

Download Full-text

Major histocompatibility complex variation and evolution at a single, expressed DQA locus in two genera of elephants

Immunogenetics ◽

10.1007/s00251-009-0413-8 ◽

2010 ◽

Vol 62 (2) ◽

pp. 85-100 ◽

Cited By ~ 19

Author(s):

Elizabeth A. Archie ◽

Tammy Henry ◽

Jesus E. Maldonado ◽

Cynthia J. Moss ◽

Joyce H. Poole ◽

...

Keyword(s):

Major Histocompatibility Complex ◽

Major Histocompatibility ◽

Histocompatibility Complex ◽

Complex Variation

Download Full-text

Stock Identification of Fraser River Sockeye Salmon Using Microsatellites and Major Histocompatibility Complex Variation

Transactions of the American Fisheries Society ◽

10.1577/t04-001.1 ◽

2004 ◽

Vol 133 (5) ◽

pp. 1117-1137 ◽

Cited By ~ 84

Author(s):

Terry D. Beacham ◽

Michael Lapointe ◽

John R. Candy ◽

Brenda McIntosh ◽

Cathy MacConnachie ◽

...

Keyword(s):

Major Histocompatibility Complex ◽

Sockeye Salmon ◽

Stock Identification ◽

Fraser River ◽

Major Histocompatibility ◽

Histocompatibility Complex ◽

Complex Variation

Download Full-text

Major histocompatibility complex variation in insular populations of the Egyptian vulture: inferences about the roles of genetic drift and selection

Molecular Ecology ◽

10.1111/j.1365-294x.2011.05107.x ◽

2011 ◽

Vol 20 (11) ◽

pp. 2329-2340 ◽

Cited By ~ 28

Author(s):

ROSA AGUDO ◽

MIGUEL ALCAIDE ◽

CIRO RICO ◽

JESUS A. LEMUS ◽

GUILLERMO BLANCO ◽

...

Keyword(s):

Major Histocompatibility Complex ◽

Genetic Drift ◽

Major Histocompatibility ◽

Egyptian Vulture ◽

Histocompatibility Complex ◽

Complex Variation ◽

Insular Populations

Download Full-text

Immunocontraceptive vaccines and major histocompatibility complex variation in the brushtail possum

Journal of Reproductive Immunology ◽

10.1016/j.jri.2010.06.067 ◽

2010 ◽

Vol 86 (1) ◽

pp. 36

Author(s):

O.J. Holland ◽

P.E. Cowan ◽

D.M. Gleeson ◽

J.A. Duckworth ◽

L.W. Chamley

Keyword(s):

Major Histocompatibility Complex ◽

Brushtail Possum ◽

Major Histocompatibility ◽

Histocompatibility Complex ◽

Complex Variation

Download Full-text

Initial description of Major Histocompatibility Complex variation at two Class II loci (DQA-DQB) in Sotalia fluviatilis and Sotalia guianensis

Latin American Journal of Aquatic Mammals ◽

10.5597/lajam00156 ◽

2010 ◽

Vol 8 (1-2) ◽

Cited By ~ 1

Author(s):

S. Caballero ◽

D. Heimeier ◽

F. Trujillo ◽

J. A. Vianna ◽

H. Barrios-Garrido ◽

...

Keyword(s):

Major Histocompatibility Complex ◽

Class Ii ◽

Major Histocompatibility ◽

Sotalia Guianensis ◽

Histocompatibility Complex ◽

Sotalia Fluviatilis ◽

Complex Variation ◽

Initial Description

Download Full-text

Extensive sequencing of seven human genomes to characterize benchmark reference materials

10.1101/026468 ◽

2015 ◽

Cited By ~ 9

Author(s):

Justin M Zook ◽

David Catoe ◽

Jennifer McDaniel ◽

Lindsay Vang ◽

Noah Spies ◽

...

Keyword(s):

Human Genome ◽

Reference Materials ◽

De Novo ◽

Variant Calling ◽

Genome Project ◽

Genome Comparison ◽

Personal Genome ◽

Sequencing Data ◽

Sequencing Technologies ◽

Human Genomes

The Genome in a Bottle Consortium, hosted by the National Institute of Standards and Technology (NIST) is creating reference materials and data for human genome sequencing, as well as methods for genome comparison and benchmarking. Here, we describe a large, diverse set of sequencing data for seven human genomes; five are current or candidate NIST Reference Materials. The pilot genome, NA12878, has been released as NIST RM 8398. We also describe data from two Personal Genome Project trios, one of Ashkenazim Jewish ancestry and one of Chinese ancestry. The data come from 12 technologies: BioNano Genomics, Complete Genomics paired-end and LFR, Ion Proton exome, Oxford Nanopore, Pacific Biosciences, SOLiD, 10X Genomics GemCodeTM WGS, and Illumina exome and WGS paired-end, mate-pair, and synthetic long reads. Cell lines, DNA, and data from these individuals are publicly available. Therefore, we expect these data to be useful for revealing novel information about the human genome and improving sequencing technologies, SNP, indel, and structural variant calling, and de novo assembly.

Download Full-text

Estimation of Stock Composition and Individual Identification of Sockeye Salmon on a Pacific Rim Basis Using Microsatellite and Major Histocompatibility Complex Variation

Transactions of the American Fisheries Society ◽

10.1577/t05-005.1 ◽

2005 ◽

Vol 134 (5) ◽

pp. 1124-1146 ◽

Cited By ~ 111

Author(s):

Terry D. Beacham ◽

John R. Candy ◽

Brenda McIntosh ◽

Cathy MacConnachie ◽

Amy Tabata ◽

...

Keyword(s):

Major Histocompatibility Complex ◽

Sockeye Salmon ◽

Pacific Rim ◽

Individual Identification ◽

Major Histocompatibility ◽

Histocompatibility Complex ◽

Complex Variation

Download Full-text

Microsatellite and major histocompatibility complex variation in an endangered rattlesnake, the Eastern Massasauga ( Sistrurus catenatus )

Ecology and Evolution ◽

10.1002/ece3.2159 ◽

2016 ◽

Vol 6 (12) ◽

pp. 3991-4003 ◽

Cited By ~ 9

Author(s):

Collin P. Jaeger ◽

Melvin R. Duvall ◽

Bradley J. Swanson ◽

Christopher A. Phillips ◽

Michael J. Dreslik ◽

...

Keyword(s):

Major Histocompatibility Complex ◽

Major Histocompatibility ◽

Histocompatibility Complex ◽

Sistrurus Catenatus ◽

Complex Variation

Download Full-text