scholarly journals Pooled population resequencing of clam shrimp (Eulimnadia texana) from different vernal pools reveals signatures of local adaptation

2017 ◽  
Author(s):  
James G. Baldwin-Brown ◽  
Anthony D. Long

AbstractVernal pool clam shrimp (Eulimnadia texana) are a promising model due to ease of culturing, short generation time, modest genome size, and obligate desiccated diapaused eggs. We collected Illumina data (Poolseq) from eleven pooled wild vernal pool clam shrimp populations. We hypothesized that restricted gene flow between vernal pools, separated by distances of 0.36 to 253 km, in concert with Poolseq data from each population, could be used to identify genes important in local adaptation. We adapted Bayenv2 to genome-wide Poolseq data and detected thirteen genomic regions showing a strong excess of population subdivision relative to a genome-wide background. We identified a set of regions that appear to be significantly diverged in allele frequency, above what is expected based on the relationships amongst the populations. Regions identified as significant were on average 9.5 kb in size and harbored 3.8 genes. We attempted to identify correlations between allele frequencies at each genomic region and environmental variables that may influence local adaptation in the sequences populations, but found that there were too many confounding environmental variables to draw strong conclusions. One such genomic region harbored an ortholog of Drosophila melanogaster CG10413, a gene predicted to have sodium/potassium/chloride activity. Finally, we demonstrate that the identified regions could not have been found with less powerful statistics, i.e. FST, or with a less contiguous genome assembly.

2020 ◽  
Vol 12 (7) ◽  
pp. 1194-1206
Author(s):  
James G Baldwin-Brown ◽  
Anthony D Long

Abstract Vernal pools are unique in their isolation and the strong selection acting on their resident species. Vernal pool clam shrimp (Eulimnadia texana) are a promising model due to ease of culturing, short generation time, small genomes, and obligate desiccated diapaused eggs. Clam shrimp are also androdioecious (sexes include males and hermaphrodites), and here we use population-scaled recombination rates to support the hypothesis that the heterogametic sex is recombination free in these shrimp. We collected short-read sequence data from pooled samples from different vernal pools to gain insights into local adaptation. We identify genomic regions in which some populations have allele frequencies that differ significantly from the metapopulation. BayPass (Gautier M. 2015. Genome-wide scan for adaptive divergence and association with population-specific covariates. Genetics 201(4):1555–1579.) detected 19 such genomic regions showing an excess of population subdivision. These regions on average are 550 bp in size and had 2.5 genes within 5 kb of them. Genes located near these regions are involved in Malpighian tubule function and osmoregulation, an essential function in vernal pools. It is likely that salinity profiles vary between pools and over time, and variants at these genes are adapted to local salinity conditions.


Agronomy ◽  
2020 ◽  
Vol 11 (1) ◽  
pp. 27
Author(s):  
Archana Khadgi ◽  
Courtney A. Weber

Red raspberry (Rubus idaeus L.) is an expanding high-value berry crop worldwide. The presence of prickles, outgrowths of epidermal tissues lacking vasculature, on the canes, petioles, and undersides of leaves complicates both field management and harvest. The utilization of cultivars with fewer prickles or prickle-free canes simplifies production. A previously generated population segregating for prickles utilizing the s locus between the prickle-free cultivar Joan J (ss) and the prickled cultivar Caroline (Ss) was analyzed to identify the genomic region associated with prickle development in red raspberry. Genotype by sequencing (GBS) was combined with a genome-wide association study (GWAS) using fixed and random model circulating probability unification (FarmCPU) to analyze 8474 single nucleotide polymorphisms (SNPs) and identify significant markers associated with the prickle-free trait. A total of four SNPs were identified on chromosome 4 that were associated with the phenotype and were located near or in annotated genes. This study demonstrates how association genetics can be used to decipher the genetic control of important horticultural traits in Rubus, and provides valuable information about the genomic region and potential genes underlying the prickle-free trait.


Genes ◽  
2021 ◽  
Vol 12 (7) ◽  
pp. 1065
Author(s):  
Reinhard Mischke ◽  
Julia Metzger ◽  
Ottmar Distl

Congenital fibrinogen disorders are very rare in dogs. Cases of afibrinogenemia have been reported in Bernese Mountain, Bichon Frise, Cocker Spaniel, Collie, Lhasa Apso, Viszla, and St. Bernard dogs. In the present study, we examined four miniature wire-haired Dachshunds with afibrinogenemia and ascertained their pedigree. Homozygosity mapping and a genome-wide association study identified a candidate genomic region at 50,188,932–64,187,680 bp on CFA15 harboring FGB (fibrinogen beta chain), FGA (fibrinogen alpha chain), and FGG (fibrinogen gamma-B chain). Sanger sequencing of all three fibrinogen genes in two cases and validation of the FGA-associated mutation (FGA:g.6296delT, NC_006597.3:g.52240694delA, rs1152388481) in pedigree members showed a perfect co-segregation with afibrinogenemia-affected phenotypes, obligate carriers, and healthy animals. In addition, the rs1152388481 variant was validated in 393 Dachshunds and samples from 33 other dog breeds. The rs1152388481 variant is predicted to modify the protein sequence of both FGA transcripts (FGA201:p.Ile486Met and FGA-202:p.Ile555Met) leading to proteins truncated by 306 amino acids. The present data provide evidence for a novel FGA truncating frameshift mutation that is very likely to explain the cases of severe bleeding due to afibrinogenemia in a Dachshund family. This mutation has already been spread in Dachshunds through carriers before cases were ascertained. Genetic testing allows selective breeding to prevent afibrinogenemia-affected puppies in the future.


2022 ◽  
Author(s):  
Tiago da Silva Ribeiro ◽  
José A Galván ◽  
John E Pool

Local adaptation can lead to elevated genetic differentiation at the targeted genetic variant and nearby sites. Selective sweeps come in different forms, and depending on the initial and final frequencies of a favored variant, very different patterns of genetic variation may be produced. If local selection favors an existing variant that had already recombined onto multiple genetic backgrounds, then the width of elevated genetic differentiation (high FST) may be too narrow to detect using a typical windowed genome scan, even if the targeted variant becomes highly differentiated. We therefore used a simulation approach to investigate the power of SNP-level FST (specifically, the maximum SNP FST value within a window) to detect diverse scenarios of local adaptation, and compared it against whole-window FST and the Comparative Haplotype Identity statistic. We found that SNP FST had superior power to detect complete or mostly complete soft sweeps, but lesser power than window-wide statistics to detect partial hard sweeps. To investigate the relative enrichment and nature of SNP FST outliers from real data, we applied the two FST statistics to a panel of Drosophila melanogaster populations. We found that SNP FST had a genome-wide enrichment of outliers compared to demographic expectations, and though it yielded a lesser enrichment than window FST, it detected mostly unique outlier genes and functional categories. Our results suggest that SNP FST is highly complementary to typical window-based approaches for detecting local adaptation, and merits inclusion in future genome scans and methodologies.


2014 ◽  
Vol 23 (03) ◽  
pp. 1460008
Author(s):  
Kevin Byron ◽  
Jason T. L. Wang ◽  
Dongrong Wen

Developing effective artificial intelligence tools to find motifs in DNA, RNA and proteins poses a challenging yet important problem in life science research. In this paper, we present a computational approach for finding RNA tertiary motifs in genomic sequences. Specifically, we predict genomic coordinate locations for coaxial helical stackings in 3-way RNA junctions. These predictions are provided by our tertiary motif search package, named CSminer, which utilizes two versatile methodologies: random forests and covariance models. A coaxial helical stacking tertiary motif occurs in a 3-way RNA junction where two separate helical elements form a pseudocontiguous helix and provide thermodynamic stability to the RNA molecule as a whole. Our CSminer tool first uses a genome-wide search method based on covariance models to find a genomic region that may potentially contain a coaxial helical stacking tertiary motif. CSminer then uses a random forests classifier to predict whether the genomic region indeed contains the tertiary motif. Experimental results demonstrate the effectiveness of our approach.


mSphere ◽  
2019 ◽  
Vol 4 (2) ◽  
Author(s):  
Alanna Cera ◽  
Maria K. Holganza ◽  
Ahmad Abu Hardan ◽  
Irvin Gamarra ◽  
Reem S. Eldabagh ◽  
...  

ABSTRACT Balancing gene expression is a fundamental challenge of all cell types. To properly regulate transcription on a genome-wide level, there are myriad mechanisms employed by the cell. One layer to this regulation is through spatial positioning, with particular chromosomal loci exerting an influence on transcription throughout a region. Many coregulated gene families utilize spatial positioning to coordinate transcription, with functionally related genes clustering together which can allow coordinated expression via adjacent gene coregulation. The mechanisms underlying this process have not been elucidated, though there are many coregulated gene families that exhibit this genomic distribution. In the present study, we tested for a role for the enhancer-promoter (EP) hypothesis, which demonstrates that regulatory elements can exert transcriptional effects over a broad distance, in coordinating transcriptional coregulation using budding yeast, Saccharomyces cerevisiae. We empirically validated the EP model, finding that the genomic distance a promoter can affect varies by locus, which can profoundly affect levels of transcription, phenotype, and the extent of transcriptional disruption throughout a genomic region. Using the nitrogen metabolism, ribosomal protein, toxin response, and heat shock gene families as our test case, we report functionally clustered genes localize to genomic loci that are more conducive to transcriptional regulation at a distance compared to the unpaired members of the same families. Furthermore, we report that the coregulation of functional clusters is dependent, in part, on chromatin maintenance and remodeling, providing one mechanism underlying adjacent gene coregulation. IMPORTANCE The two-dimensional, physical positioning of genes along a chromosome can impact proper transcriptional regulation throughout a genomic region. The transcription of neighboring genes is correlated in a genome-wide manner, which is a characteristic of eukaryotes. Many coregulated gene families can be found clustered with another member of the same set—which can result in adjacent gene coregulation of the pair. Due to the myriad gene families that exhibit a nonrandom genomic distribution, there are likely multiple mechanisms working in concert to properly regulate transcriptional coordination of functionally clustered genes. In this study, we utilized budding yeast in an attempt to elucidate mechanisms that underlie this coregulation: testing and empirically validating the enhancer-promoter hypothesis in this species and reporting that functionally related genes cluster to genomic regions that are more conducive to transcriptional regulation at a distance. These clusters rely, in part, on chromatin maintenance and remodelers to maintain proper transcriptional coordination. Our work provides insight into the mechanisms underlying adjacent gene coregulation.


2016 ◽  
Vol 6 (1) ◽  
Author(s):  
Xue Ren ◽  
Guang-Li Yang ◽  
Wei-Feng Peng ◽  
Yong-Xin Zhao ◽  
Min Zhang ◽  
...  

Abstract Horns are a cranial appendage found exclusively in Bovidae, and play important roles in accessing resources and mates. In sheep (Ovies aries), horns vary from polled to six-horned, and human have been selecting polled animals in farming and breeding. Here, we conducted a genome-wide association study on 24 two-horned versus 22 four-horned phenotypes in a native Chinese breed of Sishui Fur sheep. Together with linkage disequilibrium (LD) analyses and haplotype-based association tests, we identified a genomic region comprising 132.0–133.1 Mb on chromosome 2 that contained the top 10 SNPs (including 4 significant SNPs) and 5 most significant haplotypes associated with the polycerate phenotype. In humans and mice, this genomic region contains the HOXD gene cluster and adjacent functional genes EVX2 and KIAA1715, which have a close association with the formation of limbs and genital buds. Our results provide new insights into the genetic basis underlying variable numbers of horns and represent a new resource for use in sheep genetics and breeding.


BMC Genomics ◽  
2019 ◽  
Vol 20 (1) ◽  
Author(s):  
Mário Vicente ◽  
Edita Priehodová ◽  
Issa Diallo ◽  
Eliška Podgorná ◽  
Estella S. Poloni ◽  
...  

Abstract Background Human population history in the Holocene was profoundly impacted by changes in lifestyle following the invention and adoption of food-production practices. These changes triggered significant increases in population sizes and expansions over large distances. Here we investigate the population history of the Fulani, a pastoral population extending throughout the African Sahel/Savannah belt. Results Based on genome-wide analyses we propose that ancestors of the Fulani population experienced admixture between a West African group and a group carrying both European and North African ancestries. This admixture was likely coupled with newly adopted herding practices, as it resulted in signatures of genetic adaptation in contemporary Fulani genomes, including the control element of the LCT gene enabling carriers to digest lactose throughout their lives. The lactase persistence (LP) trait in the Fulani is conferred by the presence of the allele T-13910, which is also present at high frequencies in Europe. We establish that the T-13910 LP allele in Fulani individuals analysed in this study lies on a European haplotype background thus excluding parallel convergent evolution. We furthermore directly link the T-13910 haplotype with the Lactase Persistence phenotype through a Genome Wide Association study (GWAS) and identify another genomic region in the vicinity of the SPRY2 gene associated with glycaemic measurements after lactose intake. Conclusions Our findings suggest that Eurasian admixture and the European LP allele was introduced into the Fulani through contact with a North African population/s. We furthermore confirm the link between the lactose digestion phenotype in the Fulani to the MCM6/LCT locus by reporting the first GWAS of the lactase persistence trait. We also explored other signals of recent adaptation in the Fulani and identified additional candidates for selection to adapt to herding life-styles.


2021 ◽  
Vol 11 (4) ◽  
Author(s):  
Laura Duran-Lozano ◽  
Gudmar Thorleifsson ◽  
Aitzkoa Lopez de Lapuente Portilla ◽  
Abhishek Niroula ◽  
Molly Went ◽  
...  

AbstractMultiple myeloma (MM) is caused by the uncontrolled, clonal expansion of plasma cells. While there is epidemiological evidence for inherited susceptibility, the molecular basis remains incompletely understood. We report a genome-wide association study totalling 5,320 cases and 422,289 controls from four Nordic populations, and find a novel MM risk variant at SOHLH2 at 13q13.3 (risk allele frequency = 3.5%; odds ratio = 1.38; P = 2.2 × 10−14). This gene encodes a transcription factor involved in gametogenesis that is normally only weakly expressed in plasma cells. The association is represented by 14 variants in linkage disequilibrium. Among these, rs75712673 maps to a genomic region with open chromatin in plasma cells, and upregulates SOHLH2 in this cell type. Moreover, rs75712673 influences transcriptional activity in luciferase assays, and shows a chromatin looping interaction with the SOHLH2 promoter. Our work provides novel insight into MM susceptibility.


Author(s):  
Hamid Alinejad-Rokny ◽  
Rassa Ghavami ◽  
Hamid R. Rabiee ◽  
Narges Rezaei ◽  
Kin Tung Tam ◽  
...  

AbstractHi-C is a genome-wide chromosome conformation capture technology that detects interactions between pairs of genomic regions, and exploits higher order chromatin structures. Conceptually Hi-C data counts interaction frequencies between every position in the genome and every other position. Biologically functional interactions are expected to occur more frequently than random (background) interactions. To identify biologically relevant interactions, several background models that take biases such as distance, GC content and mappability into account have been proposed. Here we introduce MaxHiC, a background correction tool that deals with these complex biases and robustly identifies statistically significant interactions in both Hi-C and capture Hi-C experiments. MaxHiC uses a negative binomial distribution model and a maximum likelihood technique to correct biases in both Hi-C and capture Hi-C libraries. We systematically benchmark MaxHiC against major Hi-C background correction tools and demonstrate using published Hi-C and capture Hi-C datasets that 1) Interacting regions identified by MaxHiC have significantly greater levels of overlap with known regulatory features (e.g. active chromatin histone marks, CTCF binding sites, DNase sensitivity) and also disease-associated genome-wide association SNPs than those identified by currently existing models, and 2) the pairs of interacting regions are more likely to be linked by eQTL pairs and more likely to identify known enhancer-promoter pairs than any of the existing methods. We also demonstrate that interactions between different genomic region types have distinct distance distribution only revealed by MaxHiC. MaxHiC is publicly available as a python package for the analysis of Hi-C and capture Hi-C data.


Sign in / Sign up

Export Citation Format

Share Document