Integration of Dominance and Marker×Environment Interactions into Maize Genomic Prediction Models

Mapping Intimacies ◽

10.1101/362608 ◽

2018 ◽

Author(s):

Luis Felipe Ventorim Ferrão ◽

Caillet Dornelles Marinho ◽

Patricio R. Munoz ◽

Marcio F. R. Resende

Keyword(s):

Genomic Selection ◽

Prediction Models ◽

Predictive Ability ◽

Additive Models ◽

Hybrid Breeding ◽

Maize Breeding ◽

Prediction Ability ◽

Heterotic Groups ◽

Breeding Programs ◽

Grain Moisture

AbstractHybrid breeding programs are driven by the potential to explore the heterosis phenomenon in traits with non-additive inheritance. Traditionally, progress has been achieved by crossing lines from different heterotic groups and measuring phenotypic performance of hybrids in multiple environment trials. With the reduction in genotyping prices, genomic selection has become a reality for phenotype prediction and a promising tool to predict hybrid performances. However, its prediction ability is directly associated with models that represent the trait and breeding scheme under investigation. Herein, we assess modelling approaches where dominance effects and multi-environment statistical are considered for genomic selection in maize hybrid. To this end, we evaluated the predictive ability of grain yield and grain moisture collected over three production cycles in different locations. Hybrid genotypes were inferredin silicobased on their parental inbred lines using single-nucleotide polymorphism markers obtained via a 500k SNP chip. We considered the importance to decomposes additive and dominance marker effects into components that are constant across environments and deviations that are group-specific. Prediction within and across environments were tested. The incorporation of dominance effect increased the predictive ability for grain production by up to 30% in some scenarios. Contrastingly, additive models yielded better results for grain moisture. For multi-environment modelling, the inclusion of interaction effects increased the predictive ability overall. More generally, we demonstrate that including dominance and genotype by environment interactions resulted in gains in accuracy and hence could be considered for genomic selection implementation in maize breeding programs.

Download Full-text

Genomic Selection in Winter Wheat Breeding Using a Recommender Approach

Genes ◽

10.3390/genes11070779 ◽

2020 ◽

Vol 11 (7) ◽

pp. 779

Author(s):

Dennis N. Lozada ◽

Arron H. Carter

Keyword(s):

Winter Wheat ◽

Genomic Selection ◽

Prediction Models ◽

Heading Date ◽

Predictive Ability ◽

Wheat Breeding ◽

Snp Markers ◽

Bayesian Regression ◽

Phenotypic Trait ◽

Breeding Programs

Achieving optimal predictive ability is key to increasing the relevance of implementing genomic selection (GS) approaches in plant breeding programs. The potential of an item-based collaborative filtering (IBCF) recommender system in the context of multi-trait, multi-environment GS has been explored. Different GS scenarios for IBCF were evaluated for a diverse population of winter wheat lines adapted to the Pacific Northwest region of the US. Predictions across years through cross-validations resulted in improved predictive ability when there is a high correlation between environments. Using multiple spectral traits collected from high-throughput phenotyping resulted in better GS accuracies for grain yield (GY) compared to using only single traits for predictions. Trait adjustments through various Bayesian regression models using genomic information from SNP markers was the most effective in achieving improved accuracies for GY, heading date, and plant height among the GS scenarios evaluated. Bayesian LASSO had the highest predictive ability compared to other models for phenotypic trait adjustments. IBCF gave competitive accuracies compared to a genomic best linear unbiased predictor (GBLUP) model for predicting different traits. Overall, an IBCF approach could be used as an alternative to traditional prediction models for important target traits in wheat breeding programs.

Download Full-text

Financial Compass for Slovak Enterprises: Modeling Economic Stability of Agricultural Entities

Journal of Risk and Financial Management ◽

10.3390/jrfm13050092 ◽

2020 ◽

Vol 13 (5) ◽

pp. 92

Author(s):

Katarina Valaskova ◽

Pavol Durana ◽

Peter Adamko ◽

Jaroslav Jaros

Keyword(s):

Prediction Models ◽

Predictive Accuracy ◽

Characteristic Curve ◽

Confusion Matrix ◽

Predictive Ability ◽

Early Warning Systems ◽

Emerging Countries ◽

Bankruptcy Prediction ◽

Financial Health ◽

Prediction Ability

The risk of corporate financial distress negatively affects the operation of the enterprise itself and can change the financial performance of all other partners that come into close or wider contact. To identify these risks, business entities use early warning systems, prediction models, which help identify the level of corporate financial health. Despite the fact that the relevant financial analyses and financial health predictions are crucial to mitigate or eliminate the potential risks of bankruptcy, the modeling of financial health in emerging countries is mostly based on models which were developed in different economic sectors and countries. However, several prediction models have been introduced in emerging countries (also in Slovakia) in the last few years. Thus, the main purpose of the paper is to verify the predictive ability of the bankruptcy models formed in conditions of the Slovak economy in the sector of agriculture. To compare their predictive accuracy the confusion matrix (cross tables) and the receiver operating characteristic curve are used, which allow more detailed analysis than the mere proportion of correct classifications (predictive accuracy). The results indicate that the models developed in the specific economic sector highly outperform the prediction ability of other models either developed in the same country or abroad, usage of which is then questionable considering the issue of prediction accuracy. The research findings confirm that the highest predictive ability of the bankruptcy prediction models is achieved provided that they are used in the same economic conditions and industrial sector in which they were primarily developed.

Download Full-text

Maximizing efficiency of genomic selection in CIMMYT’s tropical maize breeding program

Theoretical and Applied Genetics ◽

10.1007/s00122-020-03696-9 ◽

2020 ◽

Author(s):

Sikiru Adeniyi Atanda ◽

Michael Olsen ◽

Juan Burgueño ◽

Jose Crossa ◽

Daniel Dzidzienyo ◽

...

Keyword(s):

Genomic Selection ◽

Prediction Accuracy ◽

Large Scale ◽

Primary Objective ◽

Breeding Program ◽

Breeding Cycle ◽

Training Set ◽

Maize Breeding ◽

Phenotypic Data ◽

Breeding Programs

Abstract Key message Historical data from breeding programs can be efficiently used to improve genomic selection accuracy, especially when the training set is optimized to subset individuals most informative of the target testing set. Abstract The current strategy for large-scale implementation of genomic selection (GS) at the International Maize and Wheat Improvement Center (CIMMYT) global maize breeding program has been to train models using information from full-sibs in a “test-half-predict-half approach.” Although effective, this approach has limitations, as it requires large full-sib populations and limits the ability to shorten variety testing and breeding cycle times. The primary objective of this study was to identify optimal experimental and training set designs to maximize prediction accuracy of GS in CIMMYT’s maize breeding programs. Training set (TS) design strategies were evaluated to determine the most efficient use of phenotypic data collected on relatives for genomic prediction (GP) using datasets containing 849 (DS1) and 1389 (DS2) DH-lines evaluated as testcrosses in 2017 and 2018, respectively. Our results show there is merit in the use of multiple bi-parental populations as TS when selected using algorithms to maximize relatedness between the training and prediction sets. In a breeding program where relevant past breeding information is not readily available, the phenotyping expenditure can be spread across connected bi-parental populations by phenotyping only a small number of lines from each population. This significantly improves prediction accuracy compared to within-population prediction, especially when the TS for within full-sib prediction is small. Finally, we demonstrate that prediction accuracy in either sparse testing or “test-half-predict-half” can further be improved by optimizing which lines are planted for phenotyping and which lines are to be only genotyped for advancement based on GP.

Download Full-text

Combining genetic resources and elite material populations to improve the accuracy of genomic prediction in apple

G3 Genes|Genome|Genetics ◽

10.1093/g3journal/jkab420 ◽

2021 ◽

Author(s):

Xabi Cazenave ◽

Bernard Petit ◽

Marc Lateur ◽

Hilde Nybom ◽

Jiri Sedlak ◽

...

Keyword(s):

Genetic Resources ◽

Genomic Selection ◽

Predictive Ability ◽

Practical Implementation ◽

Specific Marker ◽

Training Set ◽

High Genetic Diversity ◽

Breeding Programs ◽

Breeding Cycles ◽

Two Populations

Abstract Genomic selection is an attractive strategy for apple breeding that could reduce the length of breeding cycles. A possible limitation to the practical implementation of this approach lies in the creation of a training set large and diverse enough to ensure accurate predictions. In this study, we investigated the potential of combining two available populations, i.e. genetic resources and elite material, in order to obtain a large training set with a high genetic diversity. We compared the predictive ability of genomic predictions within-population, across-population or when combining both populations, and tested a model accounting for population-specific marker effects in this last case. The obtained predictive abilities were moderate to high according to the studied trait and small increases in predictive ability could be obtained for some traits when the two populations were combined into a unique training set. We also investigated the potential of such a training set to predict hybrids resulting from crosses between the two populations, with a focus on the method to design the training set and the best proportion of each population to optimize predictions. The measured predictive abilities were very similar for all the proportions, except for the extreme cases where only one of the two populations was used in the training set, in which case predictive abilities could be lower than when using both populations. Using an optimization algorithm to choose the genotypes in the training set also led to higher predictive abilities than when the genotypes were chosen at random. Our results provide guidelines to initiate breeding programs that use genomic selection when the implementation of the training set is a limitation.

Download Full-text

Combining ability and response to CMS in reverse diploid maize lines developed at VIR

PLANT BIOTECHNOLOGY AND BREEDING ◽

10.30901/2658-6266-2019-4-o2 ◽

2020 ◽

Vol 2 (4) ◽

pp. 15-23

Author(s):

E. B. Khatefov ◽

B. R. Shomakhov ◽

R. S. Kushkhova ◽

R. A. Kudaev ◽

Z. T. Khashirova ◽

...

Keyword(s):

Combining Ability ◽

Selection Index ◽

Inbred Lines ◽

Breeding Value ◽

Hybrid Breeding ◽

Maize Breeding ◽

Tetraploid Population ◽

Grain Moisture ◽

Hybrid Maize ◽

Period Duration

Abstract. Hybrid maize breeding requires constant renewal of the source material. In this regard, broadening of genetic variation in parental lines is one of the primary tasks in heterotic hybrid breeding programs. The use of reverse diploid inbred lines derived from a tetraploid population is considered as an innovative approach to achieve this goal.Results. The investigated material comprised 106 reverse diploid (rediploid) inbred lines originating from diploid plants selected in segregating selfed progenies of triploid populations and consequently subjected to inbreeding, while triploid populations resulted from a cross between plants of a tetraploid population with a broad genetic basis and a diploid line. The use of a system of crosses with 37 sterile testers belonging to different FAO maize maturity groups allowed the evaluation of the rediploid lines’ combining ability and the response to M and C types of CMS. Field tests were conducted in 2019 in the steppe zone of Kabardino-Balkaria. Forty-six lines (43.3%) with the combining ability ranging from ultra-high to good, and 78 lines (73.6%) maintaining the CMS character were identified. Among them, 59 lines (55.7%) were maintainers for the M type CMS, 15 lines (14.1%) for C type CMS, and 4 lines maintained sterility for both CMS types. Sixteen lines (15.1%) restored pollen fertility of the forms with M type CMS, 11 lines (10.4%) were restorers for the C-type and one line turned out to be a universal restorer for both CMS types. Ranking by the “sprout - flowering of ears” interstage period duration showed that most of the lines (66.0%) with the ability to maintain sterility or restore male fertility of M and C CMS types, as well as with the combining ability from ultrahigh to good (32.6%) fell into the group with the flowering period duration of 51-55 days. According to the results of the harvested grain moisture assessment, the hybrids ♀(РГС246с × OL213) × ♂92с5986·2·3, ♀714М × ♂1/67-1 and ♀714М × ♂92н136-4, with the values of 13, 6%, 13.9%, 14.0%, respectively, were identified. The hybrids ♀714М × ♂1/67- 1 and ♀(OL563С × KL1392) × ♂92с0653 2 1 2 were characterized by the maximum value of the selection index, i.e. 5.03 and 5.13, respectively.Conclusions. The results of the studies showed the breeding value of rediploid lines as an initial material for hybrid maize breeding.

Download Full-text

Genetic diversity among tropical maize inbred lines as revealed by SSR markers

Australian Journal of Crop Science ◽

10.21475/ajcs.20.14.12.1589.pdf ◽

2020 ◽

pp. 2010-2019

Author(s):

Maizura Abu Sin ◽

Ghizan Saleh ◽

Nur Ashikin Psyquay Abdullah ◽

Pedram Kashiani

Keyword(s):

Genetic Diversity ◽

Ssr Markers ◽

Gene Diversity ◽

Polymorphic Information Content ◽

Similarity Index ◽

Inbred Lines ◽

Hybrid Breeding ◽

Heterotic Groups ◽

Breeding Programs ◽

Maize Inbred Lines

Genetic diversity and phenotypic superiority are important attributes of parental inbred lines for use in hybrid breeding programs. In this study, genetic diversity among 30 maize (Zea mays L.) inbred lines comprising of 28 introductions from the International Maize and Wheat Improvement Center (CIMMYT), one from Indonesia and a locally developed, were evaluated using 100 simple sequence repeat (SSR) markers, as early screening for potential parents of hybrid varieties. All markers were polymorphic, with a total of 550 unique alleles detected on the 100 loci from the 30 inbred lines. Allelic richness ranged from 2 to 13 per locus, with an average of 5.50 alleles (na). Number of effective alleles (ne) was 3.75 per locus, indicating their high effectiveness in revealing diversity among inbred lines. Average polymorphic information content (PIC) was 0.624, with values ranging from 0.178 to 0.874, indicating high informativeness of the markers. High gene diversity was observed on Chromosomes 8 and 4, with high number of effective alleles, indicating their potential usefulness for QTL analysis. The UPGMA dendrogram constructed identified four heterotic groups within a similarity index of 0.350, indicating that these markers were able to group the inbred lines. The three-dimensional PCoA plot also supports the dendrogram grouping, indicating that these two methods complement each other. Inbred lines in different heterotic groups have originated from different backgrounds and population sources. Information on genetic diversity among the maize inbred lines are useful in developing strategies exploiting heterosis in breeding programs

Download Full-text

Combining genetic resources and elite material populations to improve the accuracy of genomic prediction in apple

10.1101/2021.08.27.457920 ◽

2021 ◽

Author(s):

Xabi Cazenave ◽

Bernard Petit ◽

Francois Laurens ◽

Charles-Eric Durel ◽

Helene Muranty

Keyword(s):

Genetic Resources ◽

Genomic Selection ◽

Predictive Ability ◽

Practical Implementation ◽

Specific Marker ◽

Training Set ◽

High Genetic Diversity ◽

Breeding Programs ◽

Breeding Cycles ◽

Two Populations

Genomic selection is an attractive strategy for apple breeding that could reduce the length of breeding cycles. A possible limitation to the practical implementation of this approach lies in the creation of a training set large and diverse enough to ensure accurate predictions. In this study, we investigated the potential of combining two available populations, i.e. genetic resources and elite material, in order to obtain a large training set with a high genetic diversity. We compared the predictive ability of genomic predictions within-population, across-population or when combining both populations, and tested a model accounting for population-specific marker effects in this last case. The obtained predictive abilities were moderate to high according to the studied trait and were always highest when the two populations were combined into a unique training set. We also investigated the potential of such a training set to predict hybrids resulting from crosses between the two populations, with a focus on the method to design the training set and the best proportion of each population to optimize predictions. The measured predictive abilities were very similar for all the proportions, except for the extreme cases where only one of the two populations was used in the training set, in which case predictive abilities could be lower than when using both populations. Using an optimization algorithm to choose the genotypes in the training set also led to higher predictive abilities than when the genotypes were chosen at random. Our results provide guidelines to initiate breeding programs that use genomic selection when the implementation of the training set is a limitation.

Download Full-text

Genomic predictive ability for foliar nutritive traits in perennial ryegrass

10.1101/727958 ◽

2019 ◽

Author(s):

Sai Krishna Arojju ◽

Mingshu Cao ◽

M. Z. Zulfi Jahufer ◽

Brent A Barrett ◽

Marty J Faville

Keyword(s):

Genomic Selection ◽

Genomic Prediction ◽

Nutritive Value ◽

Prediction Models ◽

Genotypic Variation ◽

Genetic Correlations ◽

Predictive Ability ◽

Water Soluble ◽

Training Set ◽

Sib Families

AbstractForage nutritive value impacts animal nutrition, which underpins livestock productivity, reproduction and health. Genetic improvement for nutritive traits has been limited, as they are typically expensive and time-consuming to measure through conventional methods. Genomic selection is appropriate for such complex and expensive traits, enabling cost-effective prediction of breeding values using genome-wide markers. The aims of the present study were to assess the potential of genomic selection for a range of nutritive traits in a multi-population training set, and to quantify contributions of genotypic, environmental and genotype-by-environment (G × E) variance components to trait variation and heritability for nutritive traits. The training set consisted of a total of 517 half-sibling (half-sib) families, from five advanced breeding populations, evaluated in two distinct New Zealand grazing environments. Autumn-harvested samples were analyzed for 18 nutritive traits and maternal parents of the half-sib families were genotyped using genotyping-by-sequencing. Significant (P<0.05) genotypic variation was detected for all nutritive traits and genomic heritability (h2g) was moderate to high (0.20 to 0.74). G × E interactions were significant and particularly large for water soluble carbohydrate (WSC), crude fat, phosphorus (P) and crude protein. GBLUP, KGD-GBLUP and BayesC genomic prediction models displayed similar predictive ability, estimated by 10-fold cross validation, for all nutritive traits with values ranging from r = 0.16 to 0.45 using phenotypes from across two environments. High predictive ability was observed for the mineral traits sulphur (0.44), sodium (0.45) and magnesium (0.45) and the lowest values were observed for P (0.16), digestibility (0.22) and high molecular weight WSC (0.23). Predictive ability estimates for most nutritive traits were retained when marker number was reduced from 1 million to as few as 50,000. The moderate to high predictive abilities observed suggests implementation of genomic selection is feasible for most of the nutritive traits examined. For traits with lower predictive ability, multi-trait genomic prediction approaches that exploit the strong genetic correlations observed amongst some nutritive traits may be useful. This appears to be particularly important for WSC, considered one of the primary constituent of nutritive value for forages.

Download Full-text

Sequencing depth and genotype quality: Accuracy and breeding operation considerations for genomic selection applications in autopolyploid crops

10.1101/2020.02.23.961383 ◽

2020 ◽

Cited By ~ 1

Author(s):

Dorcus C Gemenet ◽

Hannele Lindqvist-Kreuze ◽

Bode A Olukolu ◽

Bert De Boeck ◽

Guilherme da Silva Pereira ◽

...

Keyword(s):

Genomic Selection ◽

Service Providers ◽

Predictive Ability ◽

Genetic Effects ◽

Additive Models ◽

Sequencing Depth ◽

Nucleotide Polymorphisms ◽

Breeding Populations ◽

Wide Range ◽

Additive Genetic Effects

AbstractThe autopolyploid nature of potato and sweetpotato ensures a wide range of meiotic configurations and linkage phases leading to complex gene action and pose problems in genotype data quality and genomic selection analyses. We used a 315-progeny biparental population of hexaploid sweetpotato and a diversity panel of 380 tetraploid potato, genotyped using different platforms to answer the following questions: i) do polyploid crop breeders need to invest more for additional sequencing depth? ii) how many markers are required to make selection decisions? iii) does considering non-additive genetic effects improve predictive ability (PA)? iv) does considering dosage or quantitative trait loci (QTL) offer significant improvement to PA? Our results show that only a small number of highly informative single nucleotide polymorphisms (SNPs; ≤ 1000) are adequate for prediction, hence it is possible to get this number at the current sequencing depth from most service providers. We also show that considering dosage information and additive-effects only models had the best PA for most traits, while the comparative advantage of considering non-additive genetic effects and including known QTL in the predictive model depended on trait architecture. We conclude that genomic selection can help accelerate the rate of genetic gains in potato and sweetpotato. However, application of genomic selection should be considered as part of optimizing the entire breeding program. Additionally, since the predictions in the current study are based on single populations, further studies on the effects of haplotype structure and inheritance on PA should be studied in actual multi-generation breeding populations.Key messagePolypoid crop breeders do not need more investment for sequencing depth, dosage information and fewer highly informative SNPs recommended, non-additive models and QTL advantages on prediction dependent on trait architecture.

Download Full-text

BWGS: a R package for genomic selection and its application to a wheat breeding programme

10.1101/763037 ◽

2019 ◽

Author(s):

Gilles Charmet ◽

Louis Gautier Tran ◽

Jérôme Auzanneau ◽

Renaud Rincent ◽

Sophie Bouchet

Keyword(s):

Missing Data ◽

Genomic Selection ◽

Prediction Models ◽

Predictive Accuracy ◽

Predictive Ability ◽

Breeding Programme ◽

Training Set ◽

Desktop Computer ◽

Marker Selection ◽

Breeding Programmes

AbstractWe developed an integrated R library called BWGS to enable easy computation of Genomic Estimates of Breeding values (GEBV) for genomic selection. BWGS relies on existing R-libraries, all freely available from CRAN servers. The two main functions enable to run 1) replicated random cross validations within a training set of genotyped and phenotyped lines and 2) GEBV prediction, for a set of genotyped-only lines. Options are available for 1) missing data imputation, 2) markers and training set selection and 3) genomic prediction with 15 different methods, either parametric or semi-parametric.The usefulness and efficiency of BWGS are illustrated using a population of wheat lines from a real breeding programme. Adjusted yield data from historical trials (highly unbalanced design) were used for testing the options of BWGS. On the whole, 760 candidate lines with adjusted phenotypes and genotypes for 47 839 robust SNP were used. With a simple desktop computer, we obtained results which compared with previously published results on wheat genomic selection. As predicted by the theory, factors that are most influencing predictive ability, for a given trait of moderate heritability, are the size of the training population and a minimum number of markers for capturing every QTL information. Missing data up to 40%, if randomly distributed, do not degrade predictive ability once imputed, and up to 80% randomly distributed missing data are still acceptable once imputed with Expectation-Maximization method of package rrBLUP. It is worth noticing that selecting markers that are most associated to the trait do improve predictive ability, compared with the whole set of markers, but only when marker selection is made on the whole population. When marker selection is made only on the sampled training set, this advantage nearly disappeared, since it was clearly due to overfitting. Few differences are observed between the 15 prediction models with this dataset. Although non-parametric methods that are supposed to capture non-additive effects have slightly better predictive accuracy, differences remain small. Finally, the GEBV from the 15 prediction models are all highly correlated to each other. These results are encouraging for an efficient use of genomic selection in applied breeding programmes and BWGS is a simple and powerful toolbox to apply in breeding programmes or training activities.

Download Full-text