Deciphering the genic basis of environmental adaptations by simultaneous forward and reverse genetics in Saccharomyces cerevisiae

Mapping Intimacies ◽

10.1101/087510 ◽

2016 ◽

Cited By ~ 2

Author(s):

Calum J. Maclean ◽

Brian P.H. Metzger ◽

Jian-Rong Yang ◽

Wei-Chin Ho ◽

Bryan Moyers ◽

...

Keyword(s):

Saccharomyces Cerevisiae ◽

Population Structure ◽

Association Mapping ◽

Cell Biology ◽

Reverse Genetics ◽

Dna Barcode ◽

Genome Wide ◽

Environmental Adaptations ◽

Yeast Genes ◽

Genomic Regions

ABSTRACTThe budding yeast Saccharomyces cerevisiae is the best studied eukaryote in molecular and cell biology, but its utility for understanding the genetic basis of natural phenotypic variation is limited by the inefficiency of association mapping owing to strong and complex population structure. To facilitate association mapping, we analyzed 190 high-quality genomes of diverse strains, including 85 newly sequenced ones, to uncover yeast’s population structure that varies substantially among genomic regions. We identified 181 yeast genes that are absent from the reference genome and demonstrated their expression and role in important functions such as drug resistance. We then simultaneously measured the growth rates of over 4500 lab strains each deficient of a nonessential gene and 81 natural strains across multiple environments using unique DNA barcode present in each strain. We combined the genome-wide reverse genetic information with genome-wide association analysis to determine potential genomic regions of importance to environmental adaptations, and for a subset experimentally validated their role by reciprocal hemizygosity tests. The resources provided permit efficient and reliable association mapping in yeast and significantly enhances its value as a model for understanding the genetic mechanisms of phenotypic polymorphism and evolution.

Download Full-text

Genome-wide association mapping of genomic regions associated with phenotypic traits in Canadian western spring wheat

Molecular Breeding ◽

10.1007/s11032-017-0741-6 ◽

2017 ◽

Vol 37 (11) ◽

Cited By ~ 7

Author(s):

Hua Chen ◽

Kassa Semagn ◽

Muhammad Iqbal ◽

Neshat Pazooki Moakhar ◽

Teketel Haile ◽

...

Keyword(s):

Association Mapping ◽

Spring Wheat ◽

Genome Wide Association ◽

Phenotypic Traits ◽

Genome Wide ◽

Genomic Regions

Download Full-text

Multiplex Confounding Factor Correction for Genomic Association Mapping with Squared Sparse Linear Mixed Model

10.1101/228114 ◽

2017 ◽

Author(s):

Haohan Wang ◽

Xiang Liu ◽

Yunpeng Xiao ◽

Ming Xu ◽

Eric P. Xing

Keyword(s):

Population Structure ◽

Association Mapping ◽

Complex Traits ◽

Association Studies ◽

Phenotypic Variability ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Confounding Factors ◽

Genetic Loci ◽

Genome Wide

AbstractGenome-wide Association Study has presented a promising way to understand the association between human genomes and complex traits. Many simple polymorphic loci have been shown to explain a significant fraction of phenotypic variability. However, challenges remain in the non-triviality of explaining complex traits associated with multifactorial genetic loci, especially considering the confounding factors caused by population structure, family structure, and cryptic relatedness. In this paper, we propose a Squared-LMM (LMM2) model, aiming to jointly correct population and genetic confounding factors. We offer two strategies of utilizing LMM2 for association mapping: 1) It serves as an extension of univariate LMM, which could effectively correct population structure, but consider each SNP in isolation. 2) It is integrated with the multivariate regression model to discover association relationship between complex traits and multifactorial genetic loci. We refer to this second model as sparse Squared-LMM (sLMM2). Further, we extend LMM2/sLMM2 by raising the power of our squared model to the LMMn/sLMMn model. We demonstrate the practical use of our model with synthetic phenotypic variants generated from genetic loci of Arabidopsis Thaliana. The experiment shows that our method achieves a more accurate and significant prediction on the association relationship between traits and loci. We also evaluate our models on collected phenotypes and genotypes with the number of candidate genes that the models could discover. The results suggest the potential and promising usage of our method in genome-wide association studies.

Download Full-text

Genome-wide association study (GWAS) for morphological and yield-related traits in an oil palm hybrid (Elaeis oleifera x Elaeis guineensis) population

10.21203/rs.2.13122/v3 ◽

2019 ◽

Author(s):

Jaime Osorio ◽

Gina Garzón ◽

Paola Delgadillo ◽

Silvio Bastidas ◽

Leidy Moreno ◽

...

Keyword(s):

Association Mapping ◽

Candidate Genes ◽

Oil Palm ◽

Plant Architecture ◽

Elaeis Guineensis ◽

Genotyping By Sequencing ◽

Genome Wide Association ◽

Elaeis Oleifera ◽

Genome Wide ◽

Genomic Regions

Abstract Background The genus Elaeis has two species of economic importance for the oil palm agroindustry: Elaeis oleifera (O), native to the Americas, and Elaeis guineensis (G), native to Africa. This work provides to our knowledge, the first association mapping study in an interspecific OxG oil palm population, which shows tolerance to pests and diseases, high oil quality, and acceptable fruit bunch production. Results Using genotyping-by-sequencing (GBS), we identified a total of 3,776 single nucleotide polymorphisms (SNPs) that were used to perform a genome-wide association analysis (GWAS) in 378 OxG hybrid population for 10 agronomic traits. Twelve genomic regions (SNPs) were located near candidate genes implicated in multiple functional categories, such as tissue growth, cellular trafficking, and physiological processes. Conclusions We provide new insights on genomic regions that mapped on candidate genes involved in plant architecture and yield. These potential candidate genes need to be confirmed for future targeted functional analyses. Associated markers to the traits of interest may be valuable resources for the development of marker-assisted selection in oil palm breeding. Keywords: Association mapping, Elaeis guineensis , Elaeis oleifera , genotyping-by-sequencing, plant architecture, yield.

Download Full-text

Genome-wide DNA Variants Identify Genetic Diversity and Population Structure of Leptosphaeria maculans in Western Canada

10.21203/rs.3.rs-24766/v2 ◽

2020 ◽

Author(s):

Qilin Chen ◽

Gary Peng ◽

Randy Kutcher ◽

Fengqun Yu

Keyword(s):

Genetic Diversity ◽

Population Structure ◽

Genome Wide Association Study ◽

Leptosphaeria Maculans ◽

Western Canada ◽

Effective Management ◽

Genome Wide ◽

Population Structure Analysis ◽

Dna Variants ◽

Genomic Regions

Abstract Background: Leptosphaeria maculans is a serious concern for canola production in Canada. For effective management, knowledge of the pathogen’s genetic variability and population structure is a prerequisite. Despite some information on race dynamics of the western Canadian L. maculans population in recent years, genetic diversity based on a large number of genome-wide DNA variants has not been investigated.Results: From 1,590 L. maculans isolates collected from 23 field sites in three provinces: Manitoba, Saskatchewan and Alberta, Canada, in the years 2007-2008 and 2012-2014, 150 representative isolates were selected and whole-genome sequenced, and 31,870 polymorphic DNA variants (SNPs and InDels) were used to study L. maculans genetic diversity and population structure. Cluster analysis showed that the genetic diversity levels and isolate groupings varied with the number and genomic regions of the variants involved; isolates collected in 2012-2014 were more genetically diverse than those collected in 2007-2008 when genome-wide variants were considered. The genome wide association study (GWAS) detected variants in egn4_Lema_T86290 (AvrLm4-7), egn4_Lema_T86300 and egn4_Lema_T86310 associated with the year of collection, but no variants was found to be associated with the province or specific location from which the isolates were collected. Population structure analysis indicated the presence of three distinct sub-populations in western Canada. While isolates from Saskatchewan were mainly of one sub-population (sub-pop1), the Alberta isolates comprised two sub-populations (sub-pop1 and sub-pop2), and all the 3 subpopulations were found in Manitoba.Conclusion: The genetic diversity of the western Canadian L. maculans population varied among provinces. It was highly admixed in Manitoba, followed by that in Alberta. The Saskatchewan population had the lowest genetic diversity. Significant genome variation between 2007-2008 and 2012-2014 occurred in the genes egn4_Lema_T86290 (AvrLm4-7), egn4_Lema_T86300 and egn4_Lema_T86310), with AvrLm4-7 becoming much more common in the L. maculans population in the later period.

Download Full-text

Permutation methods for assessing significance in binary trait association mapping with structured samples

10.1101/451377 ◽

2018 ◽

Author(s):

Joelle Mbatchou ◽

Mark Abney ◽

Mary Sara McPeek

Keyword(s):

Population Structure ◽

Association Mapping ◽

Binary Data ◽

Binary Trait ◽

Domestic Dog ◽

Idiopathic Epilepsy ◽

Type 1 Error ◽

Genome Wide ◽

Sample Structure

AbstractIn genetic association analysis of complex traits, permutation testing can be a valuable tool for assessing significance when the asymptotic distribution of the test statistic is unknown or not well-approximated. This commonly arises when the association test statistic is itself a function of multiple correlated statistics. e.g, in tests of gene-set, pathway or genome-wide significance, as well as omnibus tests that combine test statistics that perform well in different scenarios. For genetic association testing in samples with population structure and/or relatedness, use of naive permutation can lead to inflated type 1 error. To address this in quantitative traits, the MVNpermute method was developed. However, for association mapping of a binary trait, the relationship between the mean and variance makes both naive permutation and the MVNpermute method invalid. We propose BRASS, a permutation method for binary trait association mapping in samples that have related individuals and/or population structure. BRASS allows for covariates ascertainment andsimultaneous testing of multiple markers, and it accommodates a wide range of test statistics. We use an estimating equation approach that can be viewed as a hybrid of logistic regression and linear mixed-effects model methods, and we use a combination of principal components and a genetic relatedness matrix to account for sample structure. We show in simulation studies that BRASS maintains correct control of type 1 error in a range of scenarios that include population structure, familial relatedness, ascertainment and phenotype model misspecification. In these settings, only BRASS maintains correct control of type 1 error, performing far better than all other methods. We apply BRASS to two genome-wide analyses in domestic dog, one for elbow dysplasia (ED) in 82 breeds and another for idiopathic epilepsy (IE) in the Irish Wolfhound breed. We detect significant association of IE with SNPs in a previously-identified chromosome 4 region that contains multiple candidate genes.Author summaryPermutation testing is commonly used when distributional assumptions cannot be made or do not apply, or when performing a multiple testing correction, e.g., to assess region-wide or genome-wide significance in association mapping studies. Naively permuting the data is only valid under the assumption of exchangeability, which, in the presence of sample structure and polygenicity, typically does not hold. Linear mixed-model based approaches have been proposed for permutation-based tests with continuous traits that can also adjust for sample structure; however, these may not remain valid when applied to binary traits, as key features of binary data are not well accounted for. We propose BRASS, a permutation-based testing method for binary data that incorporates important characteristics of binary data in the trait model, can accommodate relevant covariates and ascertainment, and adjusts for the presence of structure in the sample. We demonstrate the use of this approach in the context of correcting for multiple testing in two genome-wide association studies in domestic dog: one for elbow dysplasia and one for idiopathic epilepsy.

Download Full-text

Local Ancestry Corrects for Population Structure in Saccharomyces cerevisiae Genome-Wide Association Studies

Genetics ◽

10.1534/genetics.112.144790 ◽

2012 ◽

Vol 192 (4) ◽

pp. 1503-1511 ◽

Cited By ~ 17

Author(s):

Liyang Diao ◽

Kevin C. Chen

Keyword(s):

Saccharomyces Cerevisiae ◽

Population Structure ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Local Ancestry ◽

Genome Wide

Download Full-text

Genome-Wide Screen for Oxalate-Sensitive Mutants of Saccharomyces cerevisiae

Applied and Environmental Microbiology ◽

10.1128/aem.02843-06 ◽

2007 ◽

Vol 73 (18) ◽

pp. 5919-5927 ◽

Cited By ~ 6

Author(s):

V. Cheng ◽

H. U. Stotz ◽

K. Hippchen ◽

A. T. Bakalinsky

Keyword(s):

Saccharomyces Cerevisiae ◽

Oxalic Acid ◽

Indirect Evidence ◽

Open Reading Frames ◽

Monocarboxylate Transporter ◽

Genome Wide ◽

Self Tolerance ◽

Important Virulence Factor ◽

Endosomal Transport ◽

Yeast Genes

ABSTRACT Oxalic acid is an important virulence factor produced by phytopathogenic filamentous fungi. In order to discover yeast genes whose orthologs in the pathogen may confer self-tolerance and whose plant orthologs may protect the host, a Saccharomyces cerevisiae deletion library consisting of 4,827 haploid mutants harboring deletions in nonessential genes was screened for growth inhibition and survival in a rich medium containing 30 mM oxalic acid at pH 3. A total of 31 mutants were identified that had significantly lower cell yields in oxalate medium than in an oxalate-free medium. About 35% of these mutants had not previously been detected in published screens for sensitivity to sorbic or citric acid. Mutants impaired in endosomal transport, the rgp1Δ, ric1Δ, snf7Δ, vps16Δ, vps20Δ, and vps51Δ mutants, were significantly overrepresented relative to their frequency among all verified yeast open reading frames. Oxalate exposure to a subset of five mutants, the drs2Δ, vps16Δ, vps51Δ, ric1Δ, and rib4Δ mutants, was lethal. With the exception of the rib4Δ mutant, all of these mutants are impaired in vesicle-mediated transport. Indirect evidence is provided suggesting that the sensitivity of the rib4Δ mutant, a riboflavin auxotroph, is due to oxalate-mediated interference with riboflavin uptake by the putative monocarboxylate transporter Mch5.

Download Full-text

Genomic Regions Associated with Virulence in Pyrenophora teres f. teres Identified by Genome-Wide Association Analysis and Biparental Mapping

Phytopathology ◽

10.1094/phyto-10-19-0372-r ◽

2020 ◽

Vol 110 (4) ◽

pp. 881-891 ◽

Cited By ~ 5

Author(s):

Anke Martin ◽

Paula Moolhuijzen ◽

Yongfu Tao ◽

Judy McIlroy ◽

Simon R. Ellwood ◽

...

Keyword(s):

Association Mapping ◽

Population Level ◽

Genome Wide Association ◽

Pyrenophora Teres ◽

Barley Cultivars ◽

Genome Wide ◽

A Genome ◽

Sexual Recombination ◽

Parastagonospora Nodorum ◽

Genomic Regions

Net form net blotch (NFNB), caused by the fungal pathogen Pyrenophora teres f. teres, is an important foliar disease present in all barley-producing regions of the world. This fungus is a hemibiotrophic and heterothallic ascomycete, where sexual recombination can lead to changes in disease expression in the host. Knowledge of the genetic architecture and genes involved in virulence is vital to increase the durability of NFNB resistance in barley cultivars. We used a genome-wide association mapping approach to characterize P. teres f. teres genomic regions associated with virulence in Australian barley cultivars. One hundred eighty-eight P. teres f. teres isolates collected across five Australian states were genotyped using Diversity Arrays Technology sequence markers and phenotyped across 20 different barley genotypes. Association mapping identified 14 different genomic regions associated with virulence, with the majority located on P. teres f. teres chromosomes 3 and 5 and one each present on chromosomes 1, 6, and 9. Four of the regions identified were confirmed by quantitative trait loci (QTL) mapping. The QTL regions are discussed in the context of their genomic architecture together with examination of their gene contents, which identified 20 predicted effectors. The number of QTL shown in this study at the population level clearly illustrates a complex genetic basis of P. teres f. teres virulence compared with pure necrotrophs, such as the wheat pathogens Parastagonospora nodorum and Parastagonospora tritici-repentis.

Download Full-text

Genome-wide association analyses of common wheat (Triticum aestivum L.) germplasm identifies multiple loci for aluminium resistanceThis article is one of a selection of papers from the conference “Exploiting Genome-wide Association in Oilseed Brassicas: a model for genetic improvement of major OECD crops for sustainable farming”

Genome ◽

10.1139/g10-058 ◽

2010 ◽

Vol 53 (11) ◽

pp. 957-966 ◽

Cited By ~ 45

Author(s):

Harsh Raman ◽

Benjamin Stodart ◽

Peter R. Ryan ◽

Emmanuel Delhaize ◽

Livinus Emebiri ◽

...

Keyword(s):

Triticum Aestivum ◽

Association Mapping ◽

Common Wheat ◽

Acid Soils ◽

Triticum Aestivum L ◽

Genome Wide Association ◽

Phenotypic Variance ◽

Association Analyses ◽

Genome Wide ◽

Genomic Regions

Aluminium (Al3+) toxicity restricts productivity and profitability of wheat ( Triticum aestivum L.) crops grown on acid soils worldwide. Continued gains will be obtained by identifying superior alleles and novel Al3+ resistance loci that can be incorporated into breeding programs. We used association mapping to identify genomic regions associated with Al3+ resistance using 1055 accessions of common wheat from different geographic regions of the world and 178 polymorphic diversity arrays technology (DArT) markers. Bayesian analyses based on genetic distance matrices classified these accessions into 12 subgroups. Genome-wide association analyses detected markers that were significantly associated with Al3+ resistance on chromosomes 1A, 1B, 2A, 2B, 2D, 3A, 3B, 4A, 4B, 4D, 5B, 6A, 6B, 7A, and 7B. Some of these genomic regions correspond to previously identified loci for Al3+ resistance, whereas others appear to be novel. Among the markers targeting TaALMT1 (the major Al3+-resistance gene located on chromosome 4D), those that detected alleles in the promoter explained most of the phenotypic variance for Al3+ resistance, which is consistent with this region controlling the level of TaALMT1 expression. These results demonstrate that genome-wide association mapping cannot only confirm known Al3+-resistance loci, such as those on chromsomes 4D and 4B, but they also highlight the utility of this technique in identifying novel resistance loci.

Download Full-text

Evaluating the use of ABBA-BABA statistics to locate introgressed loci

10.1101/001347 ◽

2013 ◽

Cited By ~ 17

Author(s):

Simon H. Martin ◽

John W. Davey ◽

Chris D. Jiggins

Keyword(s):

Population Structure ◽

Population Size ◽

Effective Population Size ◽

Ancestral Population ◽

Whole Genome ◽

Effective Population ◽

Genome Data ◽

Genome Wide ◽

A Genome ◽

Genomic Regions

Several methods have been proposed to test for introgression across genomes. One method tests for a genome-wide excess of shared derived alleles between taxa using Patterson?s D statistic, but does not establish which loci show such an excess or whether the excess is due to introgression or ancestral population structure. Several recent studies have extended the use of D by applying the statistic to small genomic regions, rather than genome-wide. Here, we use simulations and whole genome data from Heliconius butterflies to investigate the behavior of D in small genomic regions. We find that D is unreliable in this situation as it gives inflated values when effective population size is low, causing D outliers to cluster in genomic regions of reduced diversity. As an alternative, we propose a related statistic f̂d, a modified version of a statistic originally developed to estimate the genome-wide fraction of admixture. f̂d is not subject to the same biases as D, and is better at identifying introgressed loci. Finally, we show that both D and f̂d outliers tend to cluster in regions of low absolute divergence (dXY), which can confound a recently proposed test for differentiating introgression from shared ancestral variation at individual loci.

Download Full-text