Forensic use of the genomic relationship matrix to validate and discover livestock pedigrees

Kirsty Lee Moore; Conrad Vilela; Karolina Kaseja; Raphael Mrode; Mike Coffey

doi:10.1093/jas/sky407

A recursive algorithm for decomposition and creation of the inverse of the genomic relationship matrix

Journal of Dairy Science ◽

10.3168/jds.2011-5249 ◽

2012 ◽

Vol 95 (10) ◽

pp. 6093-6102 ◽

Cited By ~ 6

Author(s):

P. Faux ◽

N. Gengler ◽

I. Misztal

Keyword(s):

Recursive Algorithm ◽

Genomic Relationship Matrix ◽

Relationship Matrix ◽

Genomic Relationship

Download Full-text

Correction to: Validation of genomic predictions for body weight in broilers using crossbred information and considering breed-of-origin of alleles

Genetics Selection Evolution ◽

10.1186/s12711-019-0507-1 ◽

2019 ◽

Vol 51 (1) ◽

Author(s):

Pascal Duenk ◽

Mario P. L. Calus ◽

Yvonne C. J. Wientjes ◽

Vivian P. Breen ◽

John M. Henshall ◽

...

Keyword(s):

Body Weight ◽

Genomic Relationship Matrix ◽

Relationship Matrix ◽

Genomic Relationship ◽

Genomic Predictions

Following publication of original article [1], we noticed that there was an error: Eq. (3) on page 5 is the genomic relationship matrix that

Download Full-text

Using the genomic relationship matrix to predict the accuracy of genomic selection

Journal of Animal Breeding and Genetics ◽

10.1111/j.1439-0388.2011.00964.x ◽

2011 ◽

Vol 128 (6) ◽

pp. 409-421 ◽

Cited By ~ 170

Author(s):

M.E. Goddard ◽

B.J. Hayes ◽

T.H.E. Meuwissen

Keyword(s):

Genomic Selection ◽

Genomic Relationship Matrix ◽

Relationship Matrix ◽

Genomic Relationship

Download Full-text

Accuracy of genomic selection predictions for hip height in Brahman cattle using different relationship matrices

Pesquisa Agropecuária Brasileira ◽

10.1590/s0100-204x2018000600008 ◽

2018 ◽

Vol 53 (6) ◽

pp. 717-726 ◽

Cited By ~ 1

Author(s):

Michel Marques Farah ◽

Marina Rufino Salinas Fortes ◽

Matthew Kelly ◽

Laercio Ribeiro Porto-Neto ◽

Camila Tangari Meira ◽

...

Keyword(s):

Allele Frequency ◽

High Density ◽

Genomic Relationship Matrix ◽

Relationship Matrix ◽

Genomic Relationship ◽

Genomic Information ◽

Pedigree Data ◽

Snp Chip ◽

Numerator Relationship Matrix ◽

Brahman Cattle

Abstract: The objective of this work was to evaluate the effects of genomic information on the genetic evaluation of hip height in Brahman cattle using different matrices built from genomic and pedigree data. Hip height measurements from 1,695 animals, genotyped with high-density SNP chip or imputed from 50 K high-density SNP chip, were used. The numerator relationship matrix (NRM) was compared with the H matrix, which incorporated the NRM and genomic relationship (G) matrix simultaneously. The genotypes were used to estimate three versions of G: observed allele frequency (HGOF), average minor allele frequency (HGMF), and frequency of 0.5 for all markers (HG50). For matrix comparisons, animal data were either used in full or divided into calibration (80% older animals) and validation (20% younger animals) datasets. The accuracy values for the NRM, HGOF, and HG50 were 0.776, 0.813, and 0.594, respectively. The NRM and HGOF showed similar minor variances for diagonal and off-diagonal elements, as well as for estimated breeding values. The use of genomic information resulted in relationship estimates similar to those obtained based on pedigree; however, HGOF is the best option for estimating the genomic relationship matrix and results in a higher prediction accuracy. The ranking of the top 20% animals was very similar for all matrices, but the ranking within them varies depending on the method used.

Download Full-text

Implementation of the Realized Genomic Relationship Matrix to Open-Pollinated White Spruce Family Testing for Disentangling Additive from Nonadditive Genetic Effects

G3 Genes|Genome|Genetics ◽

10.1534/g3.115.025957 ◽

2016 ◽

Vol 6 (3) ◽

pp. 743-753 ◽

Cited By ~ 32

Author(s):

Omnia Gamal El-Dien ◽

Blaise Ratcliffe ◽

Jaroslav Klápště ◽

Ilga Porth ◽

Charles Chen ◽

...

Keyword(s):

White Spruce ◽

Genetic Effects ◽

Genomic Relationship Matrix ◽

Relationship Matrix ◽

Genomic Relationship ◽

Family Testing

Download Full-text

Application of Low Coverage Genotyping by Sequencing in Selectively Bred Arctic Charr (Salvelinus alpinus)

G3 Genes|Genome|Genetics ◽

10.1534/g3.120.401295 ◽

2020 ◽

Vol 10 (6) ◽

pp. 2069-2078 ◽

Cited By ~ 2

Author(s):

Christos Palaiokostas ◽

Shannon M. Clarke ◽

Henrik Jeuthe ◽

Rudiger Brauning ◽

Timothy P. Bilton ◽

...

Keyword(s):

Principal Components ◽

Salvelinus Alpinus ◽

Arctic Charr ◽

Genotyping By Sequencing ◽

Breeding Value ◽

Genomic Relationship Matrix ◽

Relationship Matrix ◽

Genomic Relationship ◽

Value Estimation ◽

Low Coverage

Arctic charr (Salvelinus alpinus) is a species of high economic value for the aquaculture industry, and of high ecological value due to its Holarctic distribution in both marine and freshwater environments. Novel genome sequencing approaches enable the study of population and quantitative genetic parameters even on species with limited or no prior genomic resources. Low coverage genotyping by sequencing (GBS) was applied in a selected strain of Arctic charr in Sweden originating from a landlocked freshwater population. For the needs of the current study, animals from year classes 2013 (171 animals, parental population) and 2017 (759 animals; 13 full sib families) were used as a template for identifying genome wide single nucleotide polymorphisms (SNPs). GBS libraries were constructed using the PstI and MspI restriction enzymes. Approximately 14.5K SNPs passed quality control and were used for estimating a genomic relationship matrix. Thereafter a wide range of analyses were conducted in order to gain insights regarding genetic diversity and investigate the efficiency of the genomic information for parentage assignment and breeding value estimation. Heterozygosity estimates for both year classes suggested a slight excess of heterozygotes. Furthermore, FST estimates among the families of year class 2017 ranged between 0.009 – 0.066. Principal components analysis (PCA) and discriminant analysis of principal components (DAPC) were applied aiming to identify the existence of genetic clusters among the studied population. Results obtained were in accordance with pedigree records allowing the identification of individual families. Additionally, DNA parentage verification was performed, with results in accordance with the pedigree records with the exception of a putative dam where full sib genotypes suggested a potential recording error. Breeding value estimation for juvenile growth through the usage of the estimated genomic relationship matrix clearly outperformed the pedigree equivalent in terms of prediction accuracy (0.51 opposed to 0.31). Overall, low coverage GBS has proven to be a cost-effective genotyping platform that is expected to boost the selection efficiency of the Arctic charr breeding program.

Download Full-text

Efficient Algorithms for Calculating Epistatic Genomic Relationship Matrices

Genetics ◽

10.1534/genetics.120.303459 ◽

2020 ◽

Vol 216 (3) ◽

pp. 651-669

Author(s):

Yong Jiang ◽

Jochen C. Reif

Keyword(s):

Hadamard Product ◽

Association Studies ◽

Complete Solution ◽

Natural Generalization ◽

Genomic Relationship Matrix ◽

Relationship Matrix ◽

Genome Wide Association Studies ◽

Genomic Relationship ◽

Genome Wide ◽

High Degree

The genomic relationship matrix plays a key role in the analysis of genetic diversity, genomic prediction, and genome-wide association studies. The epistatic genomic relationship matrix is a natural generalization of the classic genomic relationship matrix in the sense that it implicitly models the epistatic effects among all markers. Calculating the exact form of the epistatic relationship matrix requires high computational load, and is hence not feasible when the number of markers is large, or when high-degree of epistasis is in consideration. Currently, many studies use the Hadamard product of the classic genomic relationship matrix as an approximation. However, the quality of the approximation is difficult to investigate in the strict mathematical sense. In this study, we derived iterative formulas for the precise form of the epistatic genomic relationship matrix for arbitrary degree of epistasis including both additive and dominance interactions. The key to our theoretical results is the observation of an interesting link between the elements in the genomic relationship matrix and symmetric polynomials, which motivated the application of the corresponding mathematical theory. Based on the iterative formulas, efficient recursive algorithms were implemented. Compared with the approximation by the Hadamard product, our algorithms provided a complete solution to the problem of calculating the exact epistatic genomic relationship matrix. As an application, we showed that our new algorithms easily relieved the computational burden in a previous study on the approximation behavior of two limit models.

Download Full-text

Shrinkage estimation of the genomic relationship matrix can improve genomic estimated breeding values in the training set

Theoretical and Applied Genetics ◽

10.1007/s00122-015-2464-6 ◽

2015 ◽

Vol 128 (4) ◽

pp. 693-703 ◽

Cited By ~ 14

Author(s):

Dominik Müller ◽

Frank Technow ◽

Albrecht E. Melchinger

Keyword(s):

Genomic Relationship Matrix ◽

Relationship Matrix ◽

Shrinkage Estimation ◽

Genomic Relationship ◽

Training Set ◽

Breeding Values ◽

Genomic Estimated Breeding Values ◽

Estimated Breeding Values

Download Full-text

A Weighted Genomic Relationship Matrix Based on Fixation Index (FST) Prioritized SNPs for Genomic Selection

Genes ◽

10.3390/genes10110922 ◽

2019 ◽

Vol 10 (11) ◽

pp. 922

Author(s):

Ling-Yun Chang ◽

Sajjad Toghiani ◽

El Hamidi Hay ◽

Samuel E. Aggrey ◽

Romdhane Rekaya

Keyword(s):

Genomic Selection ◽

Statistical Power ◽

Fixation Index ◽

Genomic Relationship Matrix ◽

Relationship Matrix ◽

Nucleotide Polymorphisms ◽

Genomic Relationship ◽

Single Nucleotide ◽

Relative Contribution ◽

Estimation Of Variance

A dramatic increase in the density of marker panels has been expected to increase the accuracy of genomic selection (GS), unfortunately, little to no improvement has been observed. By including all variants in the association model, the dimensionality of the problem should be dramatically increased, and it could undoubtedly reduce the statistical power. Using all Single nucleotide polymorphisms (SNPs) to compute the genomic relationship matrix (G) does not necessarily increase accuracy as the additive relationships can be accurately estimated using a much smaller number of markers. Due to these limitations, variant prioritization has become a necessity to improve accuracy. The fixation index (FST) as a measure of population differentiation has been used to identify genome segments and variants under selection pressure. Using prioritized variants has increased the accuracy of GS. Additionally, FST can be used to weight the relative contribution of prioritized SNPs in computing G. In this study, relative weights based on FST scores were developed and incorporated into the calculation of G and their impact on the estimation of variance components and accuracy was assessed. The results showed that prioritizing SNPs based on their FST scores resulted in an increase in the genetic similarity between training and validation animals and improved the accuracy of GS by more than 5%.

Download Full-text

335 Genomic predictions with a multi-breed genomic relationship matrix

Journal of Animal Science ◽

10.1093/jas/skz258.099 ◽

2019 ◽

Vol 97 (Supplement_3) ◽

pp. 49-50

Author(s):

Yvette Steyn ◽

Daniela Lourenco ◽

Ignacy Misztal

Keyword(s):

Prediction Accuracy ◽

Negative Impact ◽

Reference Population ◽

Single Step ◽

Genomic Relationship Matrix ◽

Relationship Matrix ◽

Genomic Relationship ◽

Effective Population ◽

Specific Allele ◽

Missing Genotypes

Abstract Multi-breed evaluations have the advantage of increasing the size of the reference population for genomic evaluations and are quite simple; however, combining breeds usually have a negative impact on prediction accuracy. The aim of this study was to evaluate the use of a multi-breed genomic relationship matrix (G), where SNP for each breed are non-shared. The multi-breed G is set assuming known genotypes for one breed and missing genotypes for the remaining breeds. This setup may avoid spurious IBS relationships between breeds and considers breed-specific allele frequencies. This scenario was contrasted to multi-breed evaluations where all SNP are shared, i.e., the same SNP, and to single-breed evaluations. Different SNP densities, namely 9k and 45k, and different effective population sizes (Ne) were tested. Five breeds mimicking recent beef cattle populations that diverged from the same historical population were simulated using different selection criteria. It was assumed that QTL effects were the same over all breeds. For the recent population, generations 1 to 9 had approximately half of the animals genotyped, whereas all 1200 animals were genotyped in generation 10. Genotyped animals in generation 10 were set as validation; therefore, each breed had a validation set. Analysis were performed using single-step GBLUP (ssGBLUP). Prediction accuracy was calculated as correlation between true (T) and genomic estimated (GE) BV. Accuracies of GEBV were lower for the larger Ne and low SNP density. All three scenarios using 45K resulted in similar accuracies, suggesting that the marker density is high enough to account for relationships and linkage disequilibrium with QTL. A shared multi-breed evaluation using 9K resulted in a decrease of accuracy of 0.08 for a smaller Ne and 0.11 for a larger Ne. This loss was mostly avoided when markers were treated as non-shared within the same genomic relationship matrix.

Download Full-text