Integrating Affymetrix microarray data sets using probe-level test statistic for predicting prostate cancer

Author(s):  
Pingzhao Hu ◽  
Celia MT Greenwood ◽  
Joseph Beyene
2007 ◽  
Vol 8 (1) ◽  
pp. 146 ◽  
Author(s):  
Alexander C Cambon ◽  
Abdelnaby Khalyfa ◽  
Nigel GF Cooper ◽  
Caryn M Thompson

Author(s):  
Sufeng Niu ◽  
Guangyu Yang ◽  
Nilim Sarma ◽  
Pengfei Xuan ◽  
Melissa C. Smith ◽  
...  

2014 ◽  
Vol 15 (1) ◽  
pp. 69 ◽  
Author(s):  
Zhuohui Gan ◽  
Jianwu Wang ◽  
Nathan Salomonis ◽  
Jennifer C Stowe ◽  
Gabriel G Haddad ◽  
...  

2009 ◽  
Vol 3 ◽  
pp. BBI.S3060 ◽  
Author(s):  
Markus Schmidberger ◽  
Esmeralda Vicedo ◽  
Ulrich Mansmann

Microarray data repositories as well as large clinical applications of gene expression allow to analyse several hundreds of microarrays at one time. The preprocessing of large amounts of microarrays is still a challenge. The algorithms are limited by the available computer hardware. For example, building classification or prognostic rules from large microarray sets will be very time consuming. Here, preprocessing has to be a part of the cross-validation and resampling strategy which is necessary to estimate the rule's prediction quality honestly. This paper proposes the new Bioconductor package affyPara for parallelized preprocessing of Affymetrix microarray data. Partition of data can be applied on arrays and parallelization of algorithms is a straightforward consequence. The partition of data and distribution to several nodes solves the main memory problems and accelerates preprocessing by up to the factor 20 for 200 or more arrays. affyPara is a free and open source package, under GPL license, available form the Bioconductor project at www.bioconductor.org . A user guide and examples are provided with the package.


2007 ◽  
Vol 8 (6) ◽  
pp. R112 ◽  
Author(s):  
Allen Day ◽  
Marc RJ Carlson ◽  
Jun Dong ◽  
Brian D O'Connor ◽  
Stanley F Nelson

PLoS ONE ◽  
2016 ◽  
Vol 11 (5) ◽  
pp. e0153784 ◽  
Author(s):  
Xi Chen ◽  
Natasha G. Deane ◽  
Keeli B. Lewis ◽  
Jiang Li ◽  
Jing Zhu ◽  
...  

2020 ◽  
Vol 10 (1) ◽  
Author(s):  
Jimmy Vandel ◽  
Céline Gheeraert ◽  
Bart Staels ◽  
Jérôme Eeckhoute ◽  
Philippe Lefebvre ◽  
...  

AbstractTranscriptomic analyses are broadly used in biomedical research calling for tools allowing biologists to be directly involved in data mining and interpretation. We present here GIANT, a Galaxy-based tool for Interactive ANalysis of Transcriptomic data, which consists of biologist-friendly tools dedicated to analyses of transcriptomic data from microarray or RNA-seq analyses. GIANT is organized into modules allowing researchers to tailor their analyses by choosing the specific set of tool(s) to analyse any type of preprocessed transcriptomic data. It also includes a series of tools dedicated to the handling of raw Affymetrix microarray data. GIANT brings easy-to-use solutions to biologists for transcriptomic data mining and interpretation.


2006 ◽  
Vol 2 ◽  
pp. 117693510600200
Author(s):  
Pingzhao Hu ◽  
Celia M.T. Greenwood ◽  
Joseph Beyene

Background Microarray technology has been previously used to identify genes that are differentially expressed between tumour and normal samples in a single study, as well as in syntheses involving multiple studies. When integrating results from several Affymetrix microarray datasets, previous studies summarized probeset-level data, which may potentially lead to a loss of information available at the probe-level. In this paper, we present an approach for integrating results across studies while taking probe-level data into account. Additionally, we follow a new direction in the analysis of microarray expression data, namely to focus on the variation of expression phenotypes in predefined gene sets, such as pathways. This targeted approach can be helpful for revealing information that is not easily visible from the changes in the individual genes. Results We used a recently developed method to integrate Affymetrix expression data across studies. The idea is based on a probe-level based test statistic developed for testing for differentially expressed genes in individual studies. We incorporated this test statistic into a classic random-effects model for integrating data across studies. Subsequently, we used a gene set enrichment test to evaluate the significance of enriched biological pathways in the differentially expressed genes identified from the integrative analysis. We compared statistical and biological significance of the prognostic gene expression signatures and pathways identified in the probe-level model (PLM) with those in the probeset-level model (PSLM). Our integrative analysis of Affymetrix microarray data from 110 prostate cancer samples obtained from three studies reveals thousands of genes significantly correlated with tumour cell differentiation. The bioinformatics analysis, mapping these genes to the publicly available KEGG database, reveals evidence that tumour cell differentiation is significantly associated with many biological pathways. In particular, we observed that by integrating information from the insulin signalling pathway into our prediction model, we achieved better prediction of prostate cancer. Conclusions Our data integration methodology provides an efficient way to identify biologically sound and statistically significant pathways from gene expression data. The significant gene expression phenotypes identified in our study have the potential to characterize complex genetic alterations in prostate cancer.


Sign in / Sign up

Export Citation Format

Share Document