A curated transcriptome dataset collection to investigate inborn errors of immunity

F1000Research ◽

10.12688/f1000research.18048.1 ◽

2019 ◽

Vol 8 ◽

pp. 188

Author(s):

Salim Bougarn ◽

Sabri Boughorbel ◽

Damien Chaussabel ◽

Nico Marr

Keyword(s):

Web Application ◽

Large Scale ◽

Mononuclear Cells ◽

Ex Vivo ◽

Clinical Manifestations ◽

Inherited Disorders ◽

Transcriptional Responses ◽

Control Subjects ◽

A Genome ◽

Wide Range

Primary immunodeficiencies (PIDs) are a heterogeneous group of inherited disorders, frequently caused by loss-of-function and less commonly by gain-of-function mutations, which can result in susceptibility to a broad or a very narrow range of infections but also in inflammatory, allergic or malignant diseases. Owing to the wide range in clinical manifestations and variability in penetrance and expressivity, there is an urgent need to better understand the underlying molecular, cellular and immunological phenotypes in PID patients in order to improve clinical diagnosis and management. Here we have compiled a manually curated collection of public transcriptome datasets mainly obtained from human whole blood, peripheral blood mononuclear cells (PBMCs) or fibroblasts of patients with PIDs and of control subjects for subsequent meta-analysis, query and interpretation. A total of nineteen (19) datasets derived from studies of PID patients were identified and retrieved from the NCBI Gene Expression Omnibus (GEO) database and loaded in GXB, a custom web application designed for interactive query and visualization of integrated large-scale data. The dataset collection includes samples from well characterized PID patients that were stimulated ex vivo under a variety of conditions to assess the molecular consequences of the underlying, naturally occurring gene defects on a genome-wide scale. Multiple sample groupings and rank lists were generated to facilitate comparisons of the transcriptional responses between different PID patients and control subjects. The GXB tool enables browsing of a single transcript across studies, thereby providing new perspectives on the role of a given molecule across biological systems and PID patients. This dataset collection is available at http://pid.gxbsidra.org/dm3/geneBrowser/list.

Download Full-text

A curated transcriptome dataset collection to investigate inborn errors of immunity

F1000Research ◽

10.12688/f1000research.18048.2 ◽

2019 ◽

Vol 8 ◽

pp. 188

Author(s):

Salim Bougarn ◽

Sabri Boughorbel ◽

Damien Chaussabel ◽

Nico Marr

Keyword(s):

Web Application ◽

Large Scale ◽

Mononuclear Cells ◽

Ex Vivo ◽

Clinical Manifestations ◽

Inherited Disorders ◽

Transcriptional Responses ◽

Control Subjects ◽

A Genome ◽

Wide Range

Primary immunodeficiencies (PIDs) are a heterogeneous group of inherited disorders, frequently caused by loss-of-function and less commonly by gain-of-function mutations, which can result in susceptibility to a broad or a very narrow range of infections but also in inflammatory, allergic or malignant diseases. Owing to the wide range in clinical manifestations and variability in penetrance and expressivity, there is an urgent need to better understand the underlying molecular, cellular and immunological phenotypes in PID patients in order to improve clinical diagnosis and management. Here we have compiled a manually curated collection of public transcriptome datasets mainly obtained from human whole blood, peripheral blood mononuclear cells (PBMCs) or fibroblasts of patients with PIDs and of control subjects for subsequent meta-analysis, query and interpretation. A total of eighteen (18) datasets derived from studies of PID patients were identified and retrieved from the NCBI Gene Expression Omnibus (GEO) database and loaded in GXB, a custom web application designed for interactive query and visualization of integrated large-scale data. The dataset collection includes samples from well characterized PID patients that were stimulated ex vivo under a variety of conditions to assess the molecular consequences of the underlying, naturally occurring gene defects on a genome-wide scale. Multiple sample groupings and rank lists were generated to facilitate comparisons of the transcriptional responses between different PID patients and control subjects. The GXB tool enables browsing of a single transcript across studies, thereby providing new perspectives on the role of a given molecule across biological systems and PID patients. This dataset collection is available at http://pid.gxbsidra.org/dm3/geneBrowser/list.

Download Full-text

NLRP3 Inflammosome Polymorphisms Are Enriched in Myelodysplastic Syndrome Patients with Autoimmune Disorders

Blood ◽

10.1182/blood.v126.23.1659.1659 ◽

2015 ◽

Vol 126 (23) ◽

pp. 1659-1659 ◽

Cited By ~ 2

Author(s):

Syed A Mian ◽

Austin G. Kulasekararaj ◽

Aytug Kizilors ◽

Kathy L McGraw ◽

Shahram Kordasti ◽

...

Keyword(s):

Large Scale ◽

Mononuclear Cells ◽

Clinical Manifestations ◽

Amplicon Sequencing ◽

Autoimmune Disorders ◽

European Population ◽

Mrna Levels ◽

Serum Cytokine ◽

Available N ◽

Nlrp3 Gene

Abstract Autoimmune disorders (AIDs) are commonly observed in up to 30% of myelodysplastic syndromes (MDS) patients. Clinical manifestations such as acute neutrophilic febrile dermatosis, rheumatoid arthritis, inflammatory bowel disease, pulmonary infiltrates and peripheral polyneuropathy, precede or accompany the diagnosis of MDS. In fact, large-scale epidemiologic studies have suggested that patients with AIDs have an elevated risk of developing MDS. To gain more insight into the association of MDS and AIDs, bone marrow mononuclear cells available from 202 patients were sequenced for 'inflammsome' pathway genes NLRP3 and MYD88, which have been shown to carry mutations in various AIDs. According to WHO classification, the patient cohort included 49% RA/RARS/RCMD, 24% RAEB, 6% 5q syndrome, 10% AML, 6% MPD/MDS and 5% tMDS. Median age of the cohort was 62 years (IQR 54.8-62.3). Autoimmune data was available on 129 patients and 49/129 patients had evidence of AID. 31 patients were diagnosed with MDS-associated 'Sweets Syndrome' AID. Myeloid neoplasm-related 22 gene panel mutation data was available for 171 patients. Initially all protein coding exons of NLRP3 and MYD88 genes were sequenced in 31 cases diagnosed with MDS-Sweets syndrome. Targeted sequencing (Miseq, Illumina) revealed NLRP3 variants in 6/31 cases. NLRP3 heterozygous variants detected were V200M (n=1/31, 3%), Q705K (n=4/31, 13%) and T954M (n=1/31, 3%). Two of these variants (V200M and Q705K) have been previously described as pathogenic and linked with various autoimmune disorders1,2. Variant T954M is reported in COSMIC v73 database as a confirmed somatic mutation. The frequency of these variants in general European population (NHLBI ESP 2015) varies in-between 0.6% to 5% (V200, rs121908147 MAF= 0.7%; Q705, rs35829419 MAF= 5% and T954, MAF= <0.1%). None of the patients sequenced in our study had MYD88 gene mutations. Therefore, we decided to perform targeted amplicon sequencing on three codons (V200, Q705 and T954) of the NLRP3 gene in remainder of the 171 cases. NLRP3 heterozygous variants V200M and Q705K were observed in 2.3% (n=4/171) and 12% (n= 20/171), of the patients screened, respectively. T954M variant was not detected in the follow-up cohort. Sequencing of the paired constitutional DNA (CD3+ lymphocytes/ Skin biopsy), where available (n=10), confirmed that V200M and Q705K are inherited SNPs. The frequency of these NLRP3 variants observed among our study population is marginally higher than previously described in the general European population (Pearson chi-square test, P >0.05) In order to detect the effects of the NLRP3 variant on the inflammasome pathway, we quantified the serum cytokine (IL1β, IL18, TNFα and INFγ) levels using ProcartaPlex Human Cytokine & Chemokine Panel 1A (34 plex, eBioscience) kit in patients with NLRP3 variants (MDS/sweets syndrome, n =3) vs NLRP3 wildtype (MDS-Sweets syndrome, n=5; MDS/non-sweets syndrome, n=2). No difference was observed in the serum cytokine levels between two patient groups. Next, we investigated the mRNA levels of the NLRP3 and other genes that lie downstream of the NLRP3 inflammasome pathway (IL1β, IL18, TNFα, CASPASE-1 and INGγ). 34 patients (NLRP3 variant, n= 22 and NLRP3 Wildtype, n= 12) were included in this analysis. An increased trend in the IL18 mRNA levels was observed in patients with NLRP3 variant vs NLRP3 wildtype. In conclusion, our study shows that the NLRP3 variants are enriched in MDS patients with AIDs compared to historical data on the general population. Patients with NLRP3 variants have an increased levels of IL18 mRNA levels. A larger study to explore the inflammasome pathway is needed to define the molecular pathogenesis of the autoimmunity associated with MDS. Identifying the casual genomic variants of AIDs in MDS may provide relevance to development of novel therapeutic strategies. References 1. Verma D, Sarndahl E, Andersson H, et al. The Q705K polymorphism in NLRP3 is a gain-of-function alteration leading to excessive interleukin-1beta and IL-18 production. PLoS One. 2012;7(4):e34977. 2 . Hoffman HM, Mueller JL, Broide DH, Wanderer AA, Kolodner RD. Mutation of a new gene encoding a putative pyrin-like protein causes familial cold autoinflammatory syndrome and Muckle-Wells syndrome. Nat Genet. 2001;29(3):301-305. Disclosures Kulasekararaj: Alexion: Consultancy. List:Celgene Corporation: Honoraria, Research Funding.

Download Full-text

PinAPL-Py: A comprehensive web-application for the analysis of CRISPR/Cas9 screens

10.1101/147462 ◽

2017 ◽

Author(s):

Philipp N. Spahn ◽

Tyler Bath ◽

Ryan J. Weiss ◽

Jihoon Kim ◽

Jeffrey D. Esko ◽

...

Keyword(s):

Web Application ◽

Large Scale ◽

Sequencing Data ◽

Bioinformatic Tools ◽

Link Type ◽

Screening Experiments ◽

Independent Analysis ◽

Wide Range ◽

Set Up ◽

Sequence Quality

AbstractBackgroundLarge-scale genetic screens using CRISPR/Cas9 technology have emerged as a major tool for functional genomics. With its increased popularity, experimental biologists frequently acquire large sequencing datasets for which they often do not have an easy analysis option. While a few bioinformatic tools have been developed for this purpose, their utility is still hindered either due to limited functionality or the requirement of bioinformatic expertise.ResultsTo make sequencing data analysis of CRISPR/Cas9 screens more accessible to a wide range of scientists, we developed a Platform-independent Analysis of Pooled Screens using Python (PinAPL-Py), which is operated as an intuitive web-service. PinAPL-Py implements state-of-the-art tools and statistical models, assembled in a comprehensive workflow covering sequence quality control, automated sgRNA sequence extraction, alignment, sgRNA enrichment/depletion analysis and gene ranking. The workflow is set up to use a variety of popular sgRNA libraries as well as custom libraries that can be easily uploaded. Various analysis options are offered, suitable to analyze a large variety of CRISPR/Cas9 screening experiments. Analysis output includes ranked lists of sgRNAs and genes, and publication-ready plots.ConclusionsPinAPL-Py helps to advance genome-wide screening efforts by combining comprehensive functionality with user-friendly implementation. PinAPL-Py is freely accessible at http://pinapl-py.ucsd.edu with instructions, documentation and test datasets. The source code is available at https://github.com/LewisLabUCSD/PinAPL-Py

Download Full-text

CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes

10.7287/peerj.preprints.554v2 ◽

2015 ◽

Cited By ~ 5

Author(s):

Donovan H Parks ◽

Michael Imelfort ◽

Connor T Skennerton ◽

Philip Hugenholtz ◽

Gene W Tyson

Keyword(s):

Large Scale ◽

Single Cells ◽

Metagenomic Data ◽

Marker Genes ◽

Specific Gene ◽

Diverse Range ◽

Automated Method ◽

A Genome ◽

Wide Range

Large-scale recovery of genomes from isolates, single cells, and metagenomic data has been made possible by advances in computational methods and substantial reductions in sequencing costs. While this increasing breadth of draft genomes is providing key information regarding the evolutionary and functional diversity of microbial life, it has become impractical to finish all available reference genomes. Making robust biological inferences from draft genomes requires accurate estimates of their completeness and contamination. Current methods for assessing genome quality are ad hoc and generally make use of a limited number of ‘marker’ genes conserved across all bacterial or archaeal genomes. Here we introduce CheckM, an automated method for assessing the quality of a genome using a broader set of marker genes specific to the position of a genome within a reference genome tree and information about the collocation of these genes. We demonstrate the effectiveness of CheckM using synthetic data and a wide range of isolate, single cell and metagenome derived genomes. CheckM is shown to provide accurate estimates of genome completeness and contamination, and to outperform existing approaches. Using CheckM, we identify a diverse range of errors currently impacting publicly available isolate genomes and demonstrate that genomes obtained from single cells and metagenomic data vary substantially in quality. In order to facilitate the use of draft genomes, we propose an objective measure of genome quality that can be used to select genomes suitable for specific gene- and genome-centric analyses of microbial communities.

Download Full-text

VIVID: a web application for variant interpretation and visualisation in multidimensional analyses

10.1101/2021.11.16.468904 ◽

2021 ◽

Author(s):

Swapnil Tichkule ◽

Yoochan Myung ◽

Myo T Naung ◽

Brendan RE Ansell ◽

Andrew Guy ◽

...

Keyword(s):

Web Application ◽

Large Scale ◽

Balancing Selection ◽

Rapid Assessment ◽

Three Dimensional ◽

Genetic Load ◽

Evolutionary Genetics ◽

Wide Range ◽

Parasitic Protist ◽

Functional Antibodies

Large-scale comparative genomics- and population genetic studies generate enormous amounts of polymorphism data in the form of DNA variants. Ultimately, the goal of many of these studies is to associate genetic variants to phenotypes or fitness. We introduce VIVID, an interactive, user-friendly web application that integrates a wide range of approaches for encoding genotypic to phenotypic information in any organism or disease, from an individual or population, in three-dimensional (3D) space. It allows mutation mapping and annotation, calculation of interactions and conservation scores, prediction of harmful effects, analysis of diversity and selection, and 3-dimensional (3D) visualisation of genotypic information encoded in Variant Call Format (VCF) on AlphaFold2 protein models. VIVID enables the rapid assessment of genes of interest in the study of adaptive evolution and the genetic load, and it helps prioritising targets for experimental validation. We demonstrate the utility of VIVID by exploring the evolutionary genetics of the parasitic protist Plasmodium falciparum, revealing geographic variation in the signature of balancing selection in potential targets of functional antibodies.

Download Full-text

Microbiome Research and Multi-Omics Integration for Personalized Medicine in Asthma

Journal of Personalized Medicine ◽

10.3390/jpm11121299 ◽

2021 ◽

Vol 11 (12) ◽

pp. 1299

Author(s):

Marianthi Logotheti ◽

Panagiotis Agioutantis ◽

Paraskevi Katsaounou ◽

Heleni Loutrari

Keyword(s):

Large Scale ◽

Clinical Manifestations ◽

Response To Treatment ◽

Inflammatory Disorder ◽

Host Interactions ◽

Status Monitoring ◽

Omics Integration ◽

Wide Range ◽

Microbiome Research ◽

Global Understanding

Asthma is a multifactorial inflammatory disorder of the respiratory system characterized by high diversity in clinical manifestations, underlying pathological mechanisms and response to treatment. It is generally established that human microbiota plays an essential role in shaping a healthy immune response, while its perturbation can cause chronic inflammation related to a wide range of diseases, including asthma. Systems biology approaches encompassing microbiome analysis can offer valuable platforms towards a global understanding of asthma complexity and improving patients’ classification, status monitoring and therapeutic choices. In the present review, we summarize recent studies exploring the contribution of microbiota dysbiosis to asthma pathogenesis and heterogeneity in the context of asthma phenotypes–endotypes and administered medication. We subsequently focus on emerging efforts to gain deeper insights into microbiota–host interactions driving asthma complexity by integrating microbiome and host multi-omics data. One of the most prominent achievements of these research efforts is the association of refractory neutrophilic asthma with certain microbial signatures, including predominant pathogenic bacterial taxa (such as Proteobacteria phyla, Gammaproteobacteria class, especially species from Haemophilus and Moraxella genera). Overall, despite existing challenges, large-scale multi-omics endeavors may provide promising biomarkers and therapeutic targets for future development of novel microbe-based personalized strategies for diagnosis, prevention and/or treatment of uncontrollable asthma.

Download Full-text

Massive and Selective Ex Vivo Generation of Matured and Functional Human Red Blood Cells (RBC) from Hematopoietic Stem Cells of Diverse Origins: Towards the New Concept of “Cultured RBC”.

Blood ◽

10.1182/blood.v104.11.379.379 ◽

2004 ◽

Vol 104 (11) ◽

pp. 379-379

Author(s):

Luc Douay ◽

Ladan Kobari ◽

Helene Lapillonne ◽

David Chalmers ◽

Laurent Kiger ◽

...

Keyword(s):

Large Scale ◽

Ex Vivo ◽

Erythroid Differentiation ◽

Culture Conditions ◽

Hematopoietic Stem ◽

Human Red Blood Cells ◽

Wide Range ◽

Stromal Cell Line ◽

A Cell ◽

Free Media

Abstract We report a technological approach permitting, for the first time, the massive (up to 2x106-fold cell expansion) and selective (100%) ex vivo production of mature RBCs (cRBCs) starting from CD 34+ cells from peripheral blood (PB), bone marrow (BM) or cord blood (CB) into mature red cells in three steps: firstly, cell proliferation and erythroid differentiation were induced in serum free media supplemented with SCF, IL-3 and Epo for 8 days. Secondly, cells were co-cultured with additional Epo alone on either the murine MS-5 stromal cell line or human mesenchymal cells for 3 days. In the third step, all exogenous factors were withdrawn and cells were incubated on a simple stroma for 4 to 10 days. These cultured erythroid cells (reticulocytes and mature RBCs) displayed characteristics identical to those of native cells, in terms of MCV, MCH, MCHC, enzyme content (G6PD and PK) and deformability. The nature of the Hb produced depended on both the origin of the CD34+ cells and the culture conditions. cRBCs derived from PB or adult BM contained adult Hb (95±1%) whereas cRBCs derived from CB contained essentially HbF (64±13%). As for native RBCs, Hb was able to fix and release oxygen. CFSE-labelled-reticulocytes ex vivo generated from leukapheresis were injected into NOD-SCID mice. The transfused reticulocytes were found in the circulation to the same extent as native RBCs and fully matured into RBCs. This methodology is applicable for fundamental analysis of the mechanisms of terminal erythropoiesis and hemoglobin synthesis. Moreover, large scale cRBCs production could be possible with such a protocol. It can therefore be extrapolated to a wide range of clinical applications in the field of gene therapy, infectious diseases and particularly transfusion medicine with a pointed interest for the generation of a cell population homogeneous in age, thus achieving the new concept of cultured RBCs transfusion.

Download Full-text

CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes

10.7287/peerj.preprints.554v1 ◽

2014 ◽

Cited By ~ 7

Author(s):

Donovan H Parks ◽

Michael Imelfort ◽

Connor T Skennerton ◽

Philip Hugenholtz ◽

Gene W Tyson

Keyword(s):

Large Scale ◽

Single Cells ◽

Metagenomic Data ◽

Marker Genes ◽

Specific Gene ◽

Diverse Range ◽

Automated Method ◽

A Genome ◽

Wide Range

Large-scale recovery of genomes from isolates, single cells, and metagenomic data has been made possible by advances in computational methods and substantial reductions in sequencing costs. While this increasing breadth of draft genomes is providing key information regarding the evolutionary and functional diversity of microbial life, it has become impractical to finish all available reference genomes. Making robust biological inferences from draft genomes requires accurate estimates of their completeness and contamination. Current methods for assessing genome quality are ad hoc and generally make use of a limited number of ‘marker’ genes conserved across all bacterial or archaeal genomes. Here we introduce CheckM, an automated method for assessing the quality of a genome using a broader set of marker genes specific to the position of a genome within a reference genome tree along with information about the collocation of these genes. We demonstrate the effectiveness of CheckM using synthetic data and a wide range of isolate, single cell and metagenome derived genomes. CheckM is shown to provide accurate estimates of genome completeness and contamination, and to outperform existing approaches. Using CheckM, we identify a diverse range of errors currently impacting publicly available isolate genomes and demonstrate that genomes obtained from single cells and metagenomic data vary substantially in quality. In order to facilitate the use of draft genomes, we propose an objective measure of genome quality that can be used to select genomes suitable for specific gene- and genome-centric analyses of microbial communities. CheckM is open source software available at http://ecogenomics.github.io/CheckM.

Download Full-text

CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes

10.7287/peerj.preprints.554 ◽

2015 ◽

Cited By ~ 5

Author(s):

Donovan H Parks ◽

Michael Imelfort ◽

Connor T Skennerton ◽

Philip Hugenholtz ◽

Gene W Tyson

Keyword(s):

Large Scale ◽

Single Cells ◽

Metagenomic Data ◽

Marker Genes ◽

Specific Gene ◽

Diverse Range ◽

Automated Method ◽

A Genome ◽

Wide Range

Large-scale recovery of genomes from isolates, single cells, and metagenomic data has been made possible by advances in computational methods and substantial reductions in sequencing costs. While this increasing breadth of draft genomes is providing key information regarding the evolutionary and functional diversity of microbial life, it has become impractical to finish all available reference genomes. Making robust biological inferences from draft genomes requires accurate estimates of their completeness and contamination. Current methods for assessing genome quality are ad hoc and generally make use of a limited number of ‘marker’ genes conserved across all bacterial or archaeal genomes. Here we introduce CheckM, an automated method for assessing the quality of a genome using a broader set of marker genes specific to the position of a genome within a reference genome tree and information about the collocation of these genes. We demonstrate the effectiveness of CheckM using synthetic data and a wide range of isolate, single cell and metagenome derived genomes. CheckM is shown to provide accurate estimates of genome completeness and contamination, and to outperform existing approaches. Using CheckM, we identify a diverse range of errors currently impacting publicly available isolate genomes and demonstrate that genomes obtained from single cells and metagenomic data vary substantially in quality. In order to facilitate the use of draft genomes, we propose an objective measure of genome quality that can be used to select genomes suitable for specific gene- and genome-centric analyses of microbial communities.

Download Full-text