scholarly journals Population Stratification and Underrepresentation of Indian Subcontinent Genetic Diversity in the 1000 Genomes Project Dataset

2016 ◽  
Vol 8 (11) ◽  
pp. 3460-3470 ◽  
Author(s):  
Dhriti Sengupta ◽  
Ananyo Choudhury ◽  
Analabha Basu ◽  
Michèle Ramsay
PeerJ ◽  
2021 ◽  
Vol 9 ◽  
pp. e12294
Author(s):  
Neeraj Bharti ◽  
Ruma Banerjee ◽  
Archana Achalere ◽  
Sunitha Manjari Kasibhatla ◽  
Rajendra Joshi

Objectives Reliable identification of population-specific variants is important for building the single nucleotide polymorphism (SNP) profile. In this study, genomic variation using allele frequency differences of pharmacologically important genes for Gujarati Indians in Houston (GIH) and Indian Telugu in the U.K. (ITU) from the 1000 Genomes Project vis-à-vis global population data was studied to understand its role in drug response. Methods Joint genotyping approach was used to derive variants of GIH and ITU independently. SNPs of both these populations with significant allele frequency variation (minor allele frequency ≥ 0.05) with super-populations from the 1000 Genomes Project and gnomAD based on Chi-square distribution with p-value of ≤ 0.05 and Bonferroni’s multiple adjustment tests were identified. Population stratification and fixation index analysis was carried out to understand genetic differentiation. Functional annotation of variants was carried out using SnpEff, VEP and CADD score. Results Population stratification of VIP genes revealed four clusters viz., single cluster of GIH and ITU, one cluster each of East Asian, European, African populations and Admixed American was found to be admixed. A total of 13 SNPs belonging to ten pharmacogenes were identified to have significant allele frequency variation in both GIH and ITU populations as compared to one or more super-populations. These SNPs belong to VKORC1 (rs17708472, rs2359612, rs8050894) involved in Vitamin K cycle, cytochrome P450 isoforms CYP2C9 (rs1057910), CYP2B6 (rs3211371), CYP2A2 (rs4646425) and CYP2A4 (rs4646440); ATP-binding cassette (ABC) transporter ABCB1 (rs12720067), DPYD1 (rs12119882, rs56160474) involved in pyrimidine metabolism, methyltransferase COMT (rs9332377) and transcriptional factor NR1I2 (rs6785049). SNPs rs1544410 (VDR), rs2725264 (ABCG2), rs5215 and rs5219 (KCNJ11) share high fixation index (≥ 0.5) with either EAS/AFR populations. Missense variants rs1057910 (CYP2C9), rs1801028 (DRD2) and rs1138272 (GSTP1), rs116855232 (NUDT15); intronic variants rs1131341 (NQO1) and rs115349832 (DPYD) are identified to be ‘deleterious’. Conclusions Analysis of SNPs pertaining to pharmacogenes in GIH and ITU populations using population structure, fixation index and allele frequency variation provides a premise for understanding the role of genetic diversity in drug response in Asian Indians.


2015 ◽  
Vol 32 (9) ◽  
pp. 1366-1372 ◽  
Author(s):  
Dmitry Prokopenko ◽  
Julian Hecker ◽  
Edwin K. Silverman ◽  
Marcello Pagano ◽  
Markus M. Nöthen ◽  
...  

2019 ◽  
Vol 37 (1) ◽  
pp. 2-10 ◽  
Author(s):  
Luke Anderson-Trocmé ◽  
Rick Farouni ◽  
Mathieu Bourgey ◽  
Yoichiro Kamatani ◽  
Koichiro Higasa ◽  
...  

Abstract Recent reports have identified differences in the mutational spectra across human populations. Although some of these reports have been replicated in other cohorts, most have been reported only in the 1000 Genomes Project (1kGP) data. While investigating an intriguing putative population stratification within the Japanese population, we identified a previously unreported batch effect leading to spurious mutation calls in the 1kGP data and to the apparent population stratification. Because the 1kGP data are used extensively, we find that the batch effects also lead to incorrect imputation by leading imputation servers and a small number of suspicious GWAS associations. Lower quality data from the early phases of the 1kGP thus continue to contaminate modern studies in hidden ways. It may be time to retire or upgrade such legacy sequencing data.


Science ◽  
2010 ◽  
Vol 330 (6004) ◽  
pp. 574-575 ◽  
Author(s):  
Elizabeth Pennisi

2019 ◽  
Author(s):  
Luke Anderson-Trocmé ◽  
Rick Farouni ◽  
Mathieu Bourgey ◽  
Yoichiro Kamatani ◽  
Koichiro Higasa ◽  
...  

AbstractRecent reports have identified differences in the mutational spectra across human populations. While some of these reports have been replicated in other cohorts, most have been reported only in the 1000 Genomes Project (1kGP) data. While investigating an intriguing putative population stratification within the Japanese population, we identified a previously unreported batch effect leading to spurious mutation calls in the 1kGP data and to the apparent population stratification. Because the 1kGP data is used extensively, we find that the batch effects also lead to incorrect imputation by leading imputation servers and a small number of suspicious GWAS associations. Lower-quality data from the early phases of the 1kGP thus continues to contaminate modern studies in hidden ways. It may be time to retire or upgrade such legacy sequencing data.


PLoS Genetics ◽  
2013 ◽  
Vol 9 (12) ◽  
pp. e1003959 ◽  
Author(s):  
Carrie B. Moore ◽  
John R. Wallace ◽  
Daniel J. Wolfe ◽  
Alex T. Frase ◽  
Sarah A. Pendergrass ◽  
...  

2021 ◽  
Vol 11 (3) ◽  
pp. 231
Author(s):  
Faven Butler ◽  
Ali Alghubayshi ◽  
Youssef Roman

Gout is an inflammatory condition caused by elevated serum urate (SU), a condition known as hyperuricemia (HU). Genetic variations, including single nucleotide polymorphisms (SNPs), can alter the function of urate transporters, leading to differential HU and gout prevalence across different populations. In the United States (U.S.), gout prevalence differentially affects certain racial groups. The objective of this proposed analysis is to compare the frequency of urate-related genetic risk alleles between Europeans (EUR) and the following major racial groups: Africans in Southwest U.S. (ASW), Han-Chinese (CHS), Japanese (JPT), and Mexican (MXL) from the 1000 Genomes Project. The Ensembl genome browser of the 1000 Genomes Project was used to conduct cross-population allele frequency comparisons of 11 SNPs across 11 genes, physiologically involved and significantly associated with SU levels and gout risk. Gene/SNP pairs included: ABCG2 (rs2231142), SLC2A9 (rs734553), SLC17A1 (rs1183201), SLC16A9 (rs1171614), GCKR (rs1260326), SLC22A11 (rs2078267), SLC22A12 (rs505802), INHBC (rs3741414), RREB1 (rs675209), PDZK1 (rs12129861), and NRXN2 (rs478607). Allele frequencies were compared to EUR using Chi-Square or Fisher’s Exact test, when appropriate. Bonferroni correction for multiple comparisons was used, with p < 0.0045 for statistical significance. Risk alleles were defined as the allele that is associated with baseline or higher HU and gout risks. The cumulative HU or gout risk allele index of the 11 SNPs was estimated for each population. The prevalence of HU and gout in U.S. and non-US populations was evaluated using published epidemiological data and literature review. Compared with EUR, the SNP frequencies of 7/11 in ASW, 9/11 in MXL, 9/11 JPT, and 11/11 CHS were significantly different. HU or gout risk allele indices were 5, 6, 9, and 11 in ASW, MXL, CHS, and JPT, respectively. Out of the 11 SNPs, the percentage of risk alleles in CHS and JPT was 100%. Compared to non-US populations, the prevalence of HU and gout appear to be higher in western world countries. Compared with EUR, CHS and JPT populations had the highest HU or gout risk allele frequencies, followed by MXL and ASW. These results suggest that individuals of Asian descent are at higher HU and gout risk, which may partly explain the nearly three-fold higher gout prevalence among Asians versus Caucasians in ambulatory care settings. Furthermore, gout remains a disease of developed countries with a marked global rising.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Surya Prasad Sharma ◽  
Mirza Ghazanfarullah Ghazi ◽  
Suyash Katdare ◽  
Niladri Dasgupta ◽  
Samrat Mondol ◽  
...  

AbstractThe gharial (Gavialis gangeticus) is a critically endangered crocodylian, endemic to the Indian subcontinent. The species has experienced severe population decline during the twentieth century owing to habitat loss, poaching, and mortalities in passive fishing. Its extant populations have largely recovered through translocation programmes initiated in 1975. Understanding the genetic status of these populations is crucial for evaluating the effectiveness of the ongoing conservation efforts. This study assessed the genetic diversity, population structure, and evidence of genetic bottlenecks of the two managed populations inhabiting the Chambal and Girwa Rivers, which hold nearly 80% of the global gharial populations. We used seven polymorphic nuclear microsatellite loci and a 520 bp partial fragment of the mitochondrial control region (CR). The overall mean allelic richness (Ar) was 2.80 ± 0.40, and the observed (Ho) and expected (He) heterozygosities were 0.40 ± 0.05 and 0.39 ± 0.05, respectively. We observed low levels of genetic differentiation between populations (FST = 0.039, P < 0.05; G’ST = 0.058, P < 0.05 Jost’s D = 0.016, P < 0.05). The bottleneck analysis using the M ratio (Chambal = 0.31 ± 0.06; Girwa = 0.41 ± 0.12) suggested the presence of a genetic bottleneck in both populations. The mitochondrial CR also showed a low level of variation, with two haplotypes observed in the Girwa population. This study highlights the low level of genetic diversity in the two largest managed gharial populations in the wild. Hence, it is recommended to assess the genetic status of extant wild and captive gharial populations for planning future translocation programmes to ensure long-term survival in the wild.


Sign in / Sign up

Export Citation Format

Share Document