scholarly journals A phylogenetic transform enhances analysis of compositional microbiota data

2016 ◽  
Author(s):  
Justin D Silverman ◽  
Alex Washburne ◽  
Sayan Mukherjee ◽  
Lawrence A David

ABSTRACTHigh-throughput DNA sequencing technologies have revolutionized the study of microbial communities (microbiota) and have revealed their importance in both human health and disease. However, due to technical limitations, data from microbiota surveys reflect the relative abundance of bacterial taxa and not their absolute levels. It is well known that applying common statistical methods, such as correlation or hypothesis testing, to relative abundance data can lead to spurious results. Here, we introduce the PhILR transform, a data transform that utilizes microbial phylogenetic information. This transform enables off-the-shelf statistical tools to be applied to microbiota surveys free from artifacts usually associated with analysis of relative abundance data. Using environmental and human-associated microbial community datasets as benchmarks, we find that the PhILR transform significantly improves the performance of distance-based and machine learning-based statistics, boosting the accuracy of widely used algorithms on reference benchmarks by 90%. Because the PhILR transform relies on bacterial phylogenies, statistics applied in the PhILR coordinate system are also framed within an evolutionary perspective. Regression on PhILR transformed human microbiota data identified evolutionarily neighboring bacterial clades that may have differentiated to adapt to distinct body sites. Variance statistics showed that the degree of covariation of bacterial clades across human body sites tended to increase with phylogenetic relatedness between clades. These findings support the hypothesis that environmental selection, not competition between bacteria, plays a dominant role in structuring human-associated microbial communities.

eLife ◽  
2017 ◽  
Vol 6 ◽  
Author(s):  
Justin D Silverman ◽  
Alex D Washburne ◽  
Sayan Mukherjee ◽  
Lawrence A David

Surveys of microbial communities (microbiota), typically measured as relative abundance of species, have illustrated the importance of these communities in human health and disease. Yet, statistical artifacts commonly plague the analysis of relative abundance data. Here, we introduce the PhILR transform, which incorporates microbial evolutionary models with the isometric log-ratio transform to allow off-the-shelf statistical tools to be safely applied to microbiota surveys. We demonstrate that analyses of community-level structure can be applied to PhILR transformed data with performance on benchmarks rivaling or surpassing standard tools. Additionally, by decomposing distance in the PhILR transformed space, we identified neighboring clades that may have adapted to distinct human body sites. Decomposing variance revealed that covariation of bacterial clades within human body sites increases with phylogenetic relatedness. Together, these findings illustrate how the PhILR transform combines statistical and phylogenetic models to overcome compositional data challenges and enable evolutionary insights relevant to microbial communities.


Microbiome ◽  
2021 ◽  
Vol 9 (1) ◽  
Author(s):  
Antonio Reverter ◽  
Maria Ballester ◽  
Pamela A. Alexandre ◽  
Emilio Mármol-Sánchez ◽  
Antoni Dalmau ◽  
...  

Abstract Background Analyses of gut microbiome composition in livestock species have shown its potential to contribute to the regulation of complex phenotypes. However, little is known about the host genetic control over the gut microbial communities. In pigs, previous studies are based on classical “single-gene-single-trait” approaches and have evaluated the role of host genome controlling gut prokaryote and eukaryote communities separately. Results In order to determine the ability of the host genome to control the diversity and composition of microbial communities in healthy pigs, we undertook genome-wide association studies (GWAS) for 39 microbial phenotypes that included 2 diversity indexes, and the relative abundance of 31 bacterial and six commensal protist genera in 390 pigs genotyped for 70 K SNPs. The GWAS results were processed through a 3-step analytical pipeline comprised of (1) association weight matrix; (2) regulatory impact factor; and (3) partial correlation and information theory. The inferred gene regulatory network comprised 3561 genes (within a 5 kb distance from a relevant SNP–P < 0.05) and 738,913 connections (SNP-to-SNP co-associations). Our findings highlight the complexity and polygenic nature of the pig gut microbial ecosystem. Prominent within the network were 5 regulators, PRDM15, STAT1, ssc-mir-371, SOX9 and RUNX2 which gathered 942, 607, 588, 284 and 273 connections, respectively. PRDM15 modulates the transcription of upstream regulators of WNT and MAPK-ERK signaling to safeguard naive pluripotency and regulates the production of Th1- and Th2-type immune response. The signal transducer STAT1 has long been associated with immune processes and was recently identified as a potential regulator of vaccine response to porcine reproductive and respiratory syndrome. The list of regulators was enriched for immune-related pathways, and the list of predicted targets includes candidate genes previously reported as associated with microbiota profile in pigs, mice and human, such as SLIT3, SLC39A8, NOS1, IL1R2, DAB1, TOX3, SPP1, THSD7B, ELF2, PIANP, A2ML1, and IFNAR1. Moreover, we show the existence of host-genetic variants jointly associated with the relative abundance of butyrate producer bacteria and host performance. Conclusions Taken together, our results identified regulators, candidate genes, and mechanisms linked with microbiome modulation by the host. They further highlight the value of the proposed analytical pipeline to exploit pleiotropy and the crosstalk between bacteria and protists as significant contributors to host-microbiome interactions and identify genetic markers and candidate genes that can be incorporated in breeding program to improve host-performance and microbial traits.


Genes ◽  
2021 ◽  
Vol 12 (1) ◽  
pp. 65
Author(s):  
Patricia Haug ◽  
Samuel Koller ◽  
Jordi Maggi ◽  
Elena Lang ◽  
Silke Feil ◽  
...  

Coloboma and microphthalmia (C/M) are related congenital eye malformations, which can cause significant visual impairment. Molecular diagnosis is challenging as the genes associated to date with C/M account for only a small percentage of cases. Overall, the genetic cause remains unknown in up to 80% of patients. High throughput DNA sequencing technologies, including whole-exome sequencing (WES), are therefore a useful and efficient tool for genetic screening and identification of new mutations and novel genes in C/M. In this study, we analyzed the DNA of 19 patients with C/M from 15 unrelated families using singleton WES and data analysis for 307 genes of interest. We identified seven novel and one recurrent potentially disease-causing variants in CRIM1, CHD7, FAT1, PTCH1, PUF60, BRPF1, and TGFB2 in 47% of our families, three of which occurred de novo. The detection rate in patients with ocular and extraocular manifestations (67%) was higher than in patients with an isolated ocular phenotype (46%). Our study highlights the significant genetic heterogeneity in C/M cohorts and emphasizes the diagnostic power of WES for the screening of patients and families with C/M.


2012 ◽  
Vol 2012 ◽  
pp. 1-18 ◽  
Author(s):  
Silvio Garofalo ◽  
Marisa Cornacchione ◽  
Alfonso Di Costanzo

The introduction of DNA microarrays and DNA sequencing technologies in medical genetics and diagnostics has been a challenge that has significantly transformed medical practice and patient management. Because of the great advancements in molecular genetics and the development of simple laboratory technology to identify the mutations in the causative genes, also the diagnostic approach to epilepsy has significantly changed. However, the clinical use of molecular cytogenetics and high-throughput DNA sequencing technologies, which are able to test an entire genome for genetic variants that are associated with the disease, is preparing a further revolution in the near future. Molecular Karyotype and Next-Generation Sequencing have the potential to identify causative genes or loci also in sporadic or non-familial epilepsy cases and may well represent the transition from a genetic to a genomic approach to epilepsy.


2014 ◽  
Vol 1051 ◽  
pp. 311-316 ◽  
Author(s):  
Xi Mei Luo ◽  
Zhi Lei Gao ◽  
Hui Min Zhang ◽  
An Jun Li ◽  
Hong Kui He ◽  
...  

In recent years, despite the significant improvement of sequencing technologies such as the pyrosequencing, rapid evaluation of microbial community structures remains very difficult because of the abundance and complexity of organisms in almost all natural microbial communities. In this paper, a group of phylum-specific primers were elaborately designed based on a single nucleotide discrimination technology to quantify the main microbial community structure from GuJingGong pit mud samples using the real-time quantitative PCR (qPCR). Specific PCR (polymerase chain reaction) primers targeting a particular group would provide promising sensitivity and more in-depth assessment of microbial communities.


2021 ◽  
Vol 111 (1) ◽  
pp. 8-11
Author(s):  
Remco Stam ◽  
Pierre Gladieux ◽  
Boris A. Vinatzer ◽  
Erica M. Goss ◽  
Neha Potnis ◽  
...  

Population genetics has been a key discipline in phytopathology for many years. The recent rise in cost-effective, high-throughput DNA sequencing technologies, allows sequencing of dozens, if not hundreds of specimens, turning population genetics into population genomics and opening up new, exciting opportunities as described in this Focus Issue . Without the limitations of genetic markers and the availability of whole or near whole-genome data, population genomics can give new insights into the biology, evolution and adaptation, and dissemination patterns of plant-associated microbes.


2018 ◽  
Author(s):  
Adrian Fritz ◽  
Peter Hofmann ◽  
Stephan Majda ◽  
Eik Dahms ◽  
Johannes Dröge ◽  
...  

Shotgun metagenome data sets of microbial communities are highly diverse, not only due to the natural variation of the underlying biological systems, but also due to differences in laboratory protocols, replicate numbers, and sequencing technologies. Accordingly, to effectively assess the performance of metagenomic analysis software, a wide range of benchmark data sets are required. Here, we describe the CAMISIM microbial community and metagenome simulator. The software can model different microbial abundance profiles, multi-sample time series and differential abundance studies, includes real and simulated strain-level diversity, and generates second and third generation sequencing data from taxonomic profiles or de novo. Gold standards are created for sequence assembly, genome binning, taxonomic binning, and taxonomic profiling. CAMSIM generated the benchmark data sets of the first CAMI challenge. For two simulated multi-sample data sets of the human and mouse gut microbiomes we observed high functional congruence to the real data. As further applications, we investigated the effect of varying evolutionary genome divergence, sequencing depth, and read error profiles on two popular metagenome assemblers, MEGAHIT and metaSPAdes, on several thousand small data sets generated with CAMISIM. CAMISIM can simulate a wide variety of microbial communities and metagenome data sets together with truth standards for method evaluation. All data sets and the software are freely available at: https://github.com/CAMI-challenge/CAMISIM


Author(s):  
Kar-Yan Su ◽  
Wai-Leng Lee ◽  
Vinod Balasubramaniam

One in eight women will be diagnosed with breast cancer (BC) in their lifetime, resulting in over 2 million cases annually. BC is the most common cancer among women. Unfortunately, the etiology of majority of cases remains unknown. Recently, evidence has shown that the human microbiota plays an important role in health and disease. Intriguingly, studies have revealed the presence of microorganisms in human breast tissue, which was previously presumed to be sterile. Next-generation sequencing technologies have paved way for the investigation of breast microbiota, uncovering bacterial signatures that are associated with BC. Some of the bacterial species were found to possess pro-carcinogenic and/or anti-carcinogenic properties, suggesting that the breast microbiota has potentially crucial roles in maintenance of breast health. In this review, we summarize the recent findings on breast tissue microbiota and its interplay with BC. Bacterial signatures identified via next-generation sequencing as well as their impact on breast carcinogenesis and cancer therapies are reviewed. Correlation of breast tissue microbiota and other factors, such as geographical and racial differences, in BC is discussed. Additionally, we discuss the future directions of research on breast microbiota as well as its potential role in prevention, diagnosis and treatment of BC.


2019 ◽  
Author(s):  
Myrto Tsiknia ◽  
Stilianos Fodelianakis ◽  
Nikolaos P. Nikolaidis ◽  
Nikolaos V. Paranychianakis

AbstractThere is a renewed interest in recent years on the ecological processes (stochastic vs selective) driving the assembly of microbial communities. Such information could potentially improve our understanding on ecosystem functioning and resilience to disturbances, ecosystem response to environmental shifts, and adoption of sustainable soil management practices. Herein, employing a suite of existing methodologies, we show that stochastic processes have an important role on the assembly of soil bacterial communities at a Mediterranean watershed. Moreover, we document that the relative contribution of assembly processes varies over the years. The observed intensification of stochastic processes was accompanied by a decrease in the contribution of variable selection in favor of homogeneous selection and dispersal and this trend was only marginally affected by land use (natural vs agricultural lands) or soil depth. Our study also revealed a high inter-annual turnover of soil microbial communities that was likely stimulated by the weak environmental selection and the prevailing environmental conditions (drying-wetting cycles) in Mediterranean landscapes, implying potential impacts on ecosystem functioning and our ability to predict soil response to environmental shifts. Using nitrogen mineralization rate (NMR) as a representative function we document highly variable NMR over the sampling years, land uses and soil depths and lack of significant associations with the monitored environmental variables and individual taxa. In summary, our study provides novel insights on the organization and functioning of microbial communities at Mediterranean ecosystems and sets directions towards a more advanced understanding of the relationships among environmental factors, microbial community structure, and ecosystem functioning that could contribute to sustainable management of these severely degraded ecosystems.


Forests ◽  
2019 ◽  
Vol 10 (7) ◽  
pp. 550 ◽  
Author(s):  
Huili Feng ◽  
Jiahuan Guo ◽  
Weifeng Wang ◽  
Xinzhang Song ◽  
Shuiqiang Yu

Understanding the composition and diversity of soil microorganisms that typically mediate the soil biogeochemical cycle is crucial for estimating greenhouse gas flux and mitigating global changes in plantation forests. Therefore, the objectives of this study were to investigate changes in diversity and relative abundance of bacteria and archaea with soil profiles and the potential factors influencing the vertical differentiation of microbial communities in a poplar plantation. We investigated soil bacterial and archaeal community compositions and diversities by 16S rRNA gene Illumina MiSeq sequencing at different depths of a poplar plantation forest in Chenwei forest farm, Sihong County, Jiangsu, China. More than 882,422 quality-filtered 16S rRNA gene sequences were obtained from 15 samples, corresponding to 34 classified phyla and 68 known classes. Ten major bacterial phyla and two archaeal phyla were found. The diversity of bacterial and archaeal communities decreased with depth of the plantation soil. Analysis of variance (ANOVA) of relative abundance of microbial communities exhibited that Nitrospirae, Verrucomicrobia, Latescibacteria, GAL15, SBR1093, and Euryarchaeota had significant differences at different depths. The transition zone of the community composition between the surface and subsurface occurred at 10–20 cm. Overall, our findings highlighted the importance of depth with regard to the complexity and diversity of microbial community composition in plantation forest soils.


Sign in / Sign up

Export Citation Format

Share Document