scholarly journals Deciphering the genic basis of environmental adaptations by simultaneous forward and reverse genetics in Saccharomyces cerevisiae

2016 ◽  
Author(s):  
Calum J. Maclean ◽  
Brian P.H. Metzger ◽  
Jian-Rong Yang ◽  
Wei-Chin Ho ◽  
Bryan Moyers ◽  
...  

ABSTRACTThe budding yeast Saccharomyces cerevisiae is the best studied eukaryote in molecular and cell biology, but its utility for understanding the genetic basis of natural phenotypic variation is limited by the inefficiency of association mapping owing to strong and complex population structure. To facilitate association mapping, we analyzed 190 high-quality genomes of diverse strains, including 85 newly sequenced ones, to uncover yeast’s population structure that varies substantially among genomic regions. We identified 181 yeast genes that are absent from the reference genome and demonstrated their expression and role in important functions such as drug resistance. We then simultaneously measured the growth rates of over 4500 lab strains each deficient of a nonessential gene and 81 natural strains across multiple environments using unique DNA barcode present in each strain. We combined the genome-wide reverse genetic information with genome-wide association analysis to determine potential genomic regions of importance to environmental adaptations, and for a subset experimentally validated their role by reciprocal hemizygosity tests. The resources provided permit efficient and reliable association mapping in yeast and significantly enhances its value as a model for understanding the genetic mechanisms of phenotypic polymorphism and evolution.

2017 ◽  
Vol 37 (11) ◽  
Author(s):  
Hua Chen ◽  
Kassa Semagn ◽  
Muhammad Iqbal ◽  
Neshat Pazooki Moakhar ◽  
Teketel Haile ◽  
...  

2017 ◽  
Author(s):  
Haohan Wang ◽  
Xiang Liu ◽  
Yunpeng Xiao ◽  
Ming Xu ◽  
Eric P. Xing

AbstractGenome-wide Association Study has presented a promising way to understand the association between human genomes and complex traits. Many simple polymorphic loci have been shown to explain a significant fraction of phenotypic variability. However, challenges remain in the non-triviality of explaining complex traits associated with multifactorial genetic loci, especially considering the confounding factors caused by population structure, family structure, and cryptic relatedness. In this paper, we propose a Squared-LMM (LMM2) model, aiming to jointly correct population and genetic confounding factors. We offer two strategies of utilizing LMM2 for association mapping: 1) It serves as an extension of univariate LMM, which could effectively correct population structure, but consider each SNP in isolation. 2) It is integrated with the multivariate regression model to discover association relationship between complex traits and multifactorial genetic loci. We refer to this second model as sparse Squared-LMM (sLMM2). Further, we extend LMM2/sLMM2 by raising the power of our squared model to the LMMn/sLMMn model. We demonstrate the practical use of our model with synthetic phenotypic variants generated from genetic loci of Arabidopsis Thaliana. The experiment shows that our method achieves a more accurate and significant prediction on the association relationship between traits and loci. We also evaluate our models on collected phenotypes and genotypes with the number of candidate genes that the models could discover. The results suggest the potential and promising usage of our method in genome-wide association studies.


2019 ◽  
Author(s):  
Jaime Osorio ◽  
Gina Garzón ◽  
Paola Delgadillo ◽  
Silvio Bastidas ◽  
Leidy Moreno ◽  
...  

Abstract Background The genus Elaeis has two species of economic importance for the oil palm agroindustry: Elaeis oleifera (O), native to the Americas, and Elaeis guineensis (G), native to Africa. This work provides to our knowledge, the first association mapping study in an interspecific OxG oil palm population, which shows tolerance to pests and diseases, high oil quality, and acceptable fruit bunch production. Results Using genotyping-by-sequencing (GBS), we identified a total of 3,776 single nucleotide polymorphisms (SNPs) that were used to perform a genome-wide association analysis (GWAS) in 378 OxG hybrid population for 10 agronomic traits. Twelve genomic regions (SNPs) were located near candidate genes implicated in multiple functional categories, such as tissue growth, cellular trafficking, and physiological processes. Conclusions We provide new insights on genomic regions that mapped on candidate genes involved in plant architecture and yield. These potential candidate genes need to be confirmed for future targeted functional analyses. Associated markers to the traits of interest may be valuable resources for the development of marker-assisted selection in oil palm breeding. Keywords: Association mapping, Elaeis guineensis , Elaeis oleifera , genotyping-by-sequencing, plant architecture, yield.


2020 ◽  
Author(s):  
Qilin Chen ◽  
Gary Peng ◽  
Randy Kutcher ◽  
Fengqun Yu

Abstract Background: Leptosphaeria maculans is a serious concern for canola production in Canada. For effective management, knowledge of the pathogen’s genetic variability and population structure is a prerequisite. Despite some information on race dynamics of the western Canadian L. maculans population in recent years, genetic diversity based on a large number of genome-wide DNA variants has not been investigated.Results: From 1,590 L. maculans isolates collected from 23 field sites in three provinces: Manitoba, Saskatchewan and Alberta, Canada, in the years 2007-2008 and 2012-2014, 150 representative isolates were selected and whole-genome sequenced, and 31,870 polymorphic DNA variants (SNPs and InDels) were used to study L. maculans genetic diversity and population structure. Cluster analysis showed that the genetic diversity levels and isolate groupings varied with the number and genomic regions of the variants involved; isolates collected in 2012-2014 were more genetically diverse than those collected in 2007-2008 when genome-wide variants were considered. The genome wide association study (GWAS) detected variants in egn4_Lema_T86290 (AvrLm4-7), egn4_Lema_T86300 and egn4_Lema_T86310 associated with the year of collection, but no variants was found to be associated with the province or specific location from which the isolates were collected. Population structure analysis indicated the presence of three distinct sub-populations in western Canada. While isolates from Saskatchewan were mainly of one sub-population (sub-pop1), the Alberta isolates comprised two sub-populations (sub-pop1 and sub-pop2), and all the 3 subpopulations were found in Manitoba.Conclusion: The genetic diversity of the western Canadian L. maculans population varied among provinces. It was highly admixed in Manitoba, followed by that in Alberta. The Saskatchewan population had the lowest genetic diversity. Significant genome variation between 2007-2008 and 2012-2014 occurred in the genes egn4_Lema_T86290 (AvrLm4-7), egn4_Lema_T86300 and egn4_Lema_T86310), with AvrLm4-7 becoming much more common in the L. maculans population in the later period.


2018 ◽  
Author(s):  
Joelle Mbatchou ◽  
Mark Abney ◽  
Mary Sara McPeek

AbstractIn genetic association analysis of complex traits, permutation testing can be a valuable tool for assessing significance when the asymptotic distribution of the test statistic is unknown or not well-approximated. This commonly arises when the association test statistic is itself a function of multiple correlated statistics. e.g, in tests of gene-set, pathway or genome-wide significance, as well as omnibus tests that combine test statistics that perform well in different scenarios. For genetic association testing in samples with population structure and/or relatedness, use of naive permutation can lead to inflated type 1 error. To address this in quantitative traits, the MVNpermute method was developed. However, for association mapping of a binary trait, the relationship between the mean and variance makes both naive permutation and the MVNpermute method invalid. We propose BRASS, a permutation method for binary trait association mapping in samples that have related individuals and/or population structure. BRASS allows for covariates ascertainment andsimultaneous testing of multiple markers, and it accommodates a wide range of test statistics. We use an estimating equation approach that can be viewed as a hybrid of logistic regression and linear mixed-effects model methods, and we use a combination of principal components and a genetic relatedness matrix to account for sample structure. We show in simulation studies that BRASS maintains correct control of type 1 error in a range of scenarios that include population structure, familial relatedness, ascertainment and phenotype model misspecification. In these settings, only BRASS maintains correct control of type 1 error, performing far better than all other methods. We apply BRASS to two genome-wide analyses in domestic dog, one for elbow dysplasia (ED) in 82 breeds and another for idiopathic epilepsy (IE) in the Irish Wolfhound breed. We detect significant association of IE with SNPs in a previously-identified chromosome 4 region that contains multiple candidate genes.Author summaryPermutation testing is commonly used when distributional assumptions cannot be made or do not apply, or when performing a multiple testing correction, e.g., to assess region-wide or genome-wide significance in association mapping studies. Naively permuting the data is only valid under the assumption of exchangeability, which, in the presence of sample structure and polygenicity, typically does not hold. Linear mixed-model based approaches have been proposed for permutation-based tests with continuous traits that can also adjust for sample structure; however, these may not remain valid when applied to binary traits, as key features of binary data are not well accounted for. We propose BRASS, a permutation-based testing method for binary data that incorporates important characteristics of binary data in the trait model, can accommodate relevant covariates and ascertainment, and adjusts for the presence of structure in the sample. We demonstrate the use of this approach in the context of correcting for multiple testing in two genome-wide association studies in domestic dog: one for elbow dysplasia and one for idiopathic epilepsy.


2007 ◽  
Vol 73 (18) ◽  
pp. 5919-5927 ◽  
Author(s):  
V. Cheng ◽  
H. U. Stotz ◽  
K. Hippchen ◽  
A. T. Bakalinsky

ABSTRACT Oxalic acid is an important virulence factor produced by phytopathogenic filamentous fungi. In order to discover yeast genes whose orthologs in the pathogen may confer self-tolerance and whose plant orthologs may protect the host, a Saccharomyces cerevisiae deletion library consisting of 4,827 haploid mutants harboring deletions in nonessential genes was screened for growth inhibition and survival in a rich medium containing 30 mM oxalic acid at pH 3. A total of 31 mutants were identified that had significantly lower cell yields in oxalate medium than in an oxalate-free medium. About 35% of these mutants had not previously been detected in published screens for sensitivity to sorbic or citric acid. Mutants impaired in endosomal transport, the rgp1Δ, ric1Δ, snf7Δ, vps16Δ, vps20Δ, and vps51Δ mutants, were significantly overrepresented relative to their frequency among all verified yeast open reading frames. Oxalate exposure to a subset of five mutants, the drs2Δ, vps16Δ, vps51Δ, ric1Δ, and rib4Δ mutants, was lethal. With the exception of the rib4Δ mutant, all of these mutants are impaired in vesicle-mediated transport. Indirect evidence is provided suggesting that the sensitivity of the rib4Δ mutant, a riboflavin auxotroph, is due to oxalate-mediated interference with riboflavin uptake by the putative monocarboxylate transporter Mch5.


2020 ◽  
Vol 110 (4) ◽  
pp. 881-891 ◽  
Author(s):  
Anke Martin ◽  
Paula Moolhuijzen ◽  
Yongfu Tao ◽  
Judy McIlroy ◽  
Simon R. Ellwood ◽  
...  

Net form net blotch (NFNB), caused by the fungal pathogen Pyrenophora teres f. teres, is an important foliar disease present in all barley-producing regions of the world. This fungus is a hemibiotrophic and heterothallic ascomycete, where sexual recombination can lead to changes in disease expression in the host. Knowledge of the genetic architecture and genes involved in virulence is vital to increase the durability of NFNB resistance in barley cultivars. We used a genome-wide association mapping approach to characterize P. teres f. teres genomic regions associated with virulence in Australian barley cultivars. One hundred eighty-eight P. teres f. teres isolates collected across five Australian states were genotyped using Diversity Arrays Technology sequence markers and phenotyped across 20 different barley genotypes. Association mapping identified 14 different genomic regions associated with virulence, with the majority located on P. teres f. teres chromosomes 3 and 5 and one each present on chromosomes 1, 6, and 9. Four of the regions identified were confirmed by quantitative trait loci (QTL) mapping. The QTL regions are discussed in the context of their genomic architecture together with examination of their gene contents, which identified 20 predicted effectors. The number of QTL shown in this study at the population level clearly illustrates a complex genetic basis of P. teres f. teres virulence compared with pure necrotrophs, such as the wheat pathogens Parastagonospora nodorum and Parastagonospora tritici-repentis.


Genome ◽  
2010 ◽  
Vol 53 (11) ◽  
pp. 957-966 ◽  
Author(s):  
Harsh Raman ◽  
Benjamin Stodart ◽  
Peter R. Ryan ◽  
Emmanuel Delhaize ◽  
Livinus Emebiri ◽  
...  

Aluminium (Al3+) toxicity restricts productivity and profitability of wheat ( Triticum aestivum L.) crops grown on acid soils worldwide. Continued gains will be obtained by identifying superior alleles and novel Al3+ resistance loci that can be incorporated into breeding programs. We used association mapping to identify genomic regions associated with Al3+ resistance using 1055 accessions of common wheat from different geographic regions of the world and 178 polymorphic diversity arrays technology (DArT) markers. Bayesian analyses based on genetic distance matrices classified these accessions into 12 subgroups. Genome-wide association analyses detected markers that were significantly associated with Al3+ resistance on chromosomes 1A, 1B, 2A, 2B, 2D, 3A, 3B, 4A, 4B, 4D, 5B, 6A, 6B, 7A, and 7B. Some of these genomic regions correspond to previously identified loci for Al3+ resistance, whereas others appear to be novel. Among the markers targeting TaALMT1 (the major Al3+-resistance gene located on chromosome 4D), those that detected alleles in the promoter explained most of the phenotypic variance for Al3+ resistance, which is consistent with this region controlling the level of TaALMT1 expression. These results demonstrate that genome-wide association mapping cannot only confirm known Al3+-resistance loci, such as those on chromsomes 4D and 4B, but they also highlight the utility of this technique in identifying novel resistance loci.


2013 ◽  
Author(s):  
Simon H. Martin ◽  
John W. Davey ◽  
Chris D. Jiggins

Several methods have been proposed to test for introgression across genomes. One method tests for a genome-wide excess of shared derived alleles between taxa using Patterson?s D statistic, but does not establish which loci show such an excess or whether the excess is due to introgression or ancestral population structure. Several recent studies have extended the use of D by applying the statistic to small genomic regions, rather than genome-wide. Here, we use simulations and whole genome data from Heliconius butterflies to investigate the behavior of D in small genomic regions. We find that D is unreliable in this situation as it gives inflated values when effective population size is low, causing D outliers to cluster in genomic regions of reduced diversity. As an alternative, we propose a related statistic f̂d, a modified version of a statistic originally developed to estimate the genome-wide fraction of admixture. f̂d is not subject to the same biases as D, and is better at identifying introgressed loci. Finally, we show that both D and f̂d outliers tend to cluster in regions of low absolute divergence (dXY), which can confound a recently proposed test for differentiating introgression from shared ancestral variation at individual loci.


Sign in / Sign up

Export Citation Format

Share Document