Integrating Affymetrix microarray data sets using probe-level test statistic for predicting prostate cancer

Analysis of probe level patterns in Affymetrix microarray data

BMC Bioinformatics ◽

10.1186/1471-2105-8-146 ◽

2007 ◽

Vol 8 (1) ◽

pp. 146 ◽

Cited By ~ 10

Author(s):

Alexander C Cambon ◽

Abdelnaby Khalyfa ◽

Nigel GF Cooper ◽

Caryn M Thompson

Keyword(s):

Microarray Data ◽

Affymetrix Microarray ◽

Affymetrix Microarray Data

Download Full-text

Combining Hadoop and GPU to preprocess large Affymetrix microarray data

2014 IEEE International Conference on Big Data (Big Data) ◽

10.1109/bigdata.2014.7004293 ◽

2014 ◽

Cited By ~ 5

Author(s):

Sufeng Niu ◽

Guangyu Yang ◽

Nilim Sarma ◽

Pengfei Xuan ◽

Melissa C. Smith ◽

...

Keyword(s):

Microarray Data ◽

Affymetrix Microarray ◽

Affymetrix Microarray Data

Download Full-text

MAAMD: a workflow to standardize meta-analyses and comparison of affymetrix microarray data

BMC Bioinformatics ◽

10.1186/1471-2105-15-69 ◽

2014 ◽

Vol 15 (1) ◽

pp. 69 ◽

Cited By ~ 13

Author(s):

Zhuohui Gan ◽

Jianwu Wang ◽

Nathan Salomonis ◽

Jennifer C Stowe ◽

Gabriel G Haddad ◽

...

Keyword(s):

Microarray Data ◽

Affymetrix Microarray ◽

Affymetrix Microarray Data ◽

Meta Analyses

Download Full-text

affyPara—a Bioconductor Package for Parallelized Preprocessing Algorithms of Affymetrix Microarray Data

Bioinformatics and Biology Insights ◽

10.4137/bbi.s3060 ◽

2009 ◽

Vol 3 ◽

pp. BBI.S3060 ◽

Cited By ~ 6

Author(s):

Markus Schmidberger ◽

Esmeralda Vicedo ◽

Ulrich Mansmann

Keyword(s):

Microarray Data ◽

Main Memory ◽

Computer Hardware ◽

Data Partition ◽

Bioconductor Package ◽

Data Repositories ◽

Affymetrix Microarray ◽

Memory Problems ◽

Affymetrix Microarray Data ◽

Bioconductor Project

Microarray data repositories as well as large clinical applications of gene expression allow to analyse several hundreds of microarrays at one time. The preprocessing of large amounts of microarrays is still a challenge. The algorithms are limited by the available computer hardware. For example, building classification or prognostic rules from large microarray sets will be very time consuming. Here, preprocessing has to be a part of the cross-validation and resampling strategy which is necessary to estimate the rule's prediction quality honestly. This paper proposes the new Bioconductor package affyPara for parallelized preprocessing of Affymetrix microarray data. Partition of data can be applied on arrays and parallelization of algorithms is a straightforward consequence. The partition of data and distribution to several nodes solves the main memory problems and accelerates preprocessing by up to the factor 20 for 200 or more arrays. affyPara is a free and open source package, under GPL license, available form the Bioconductor project at www.bioconductor.org . A user guide and examples are provided with the package.

Download Full-text

Celsius: a community resource for Affymetrix microarray data

Genome Biology ◽

10.1186/gb-2007-8-6-r112 ◽

2007 ◽

Vol 8 (6) ◽

pp. R112 ◽

Cited By ~ 43

Author(s):

Allen Day ◽

Marc RJ Carlson ◽

Jun Dong ◽

Brian D O'Connor ◽

Stanley F Nelson

Keyword(s):

Microarray Data ◽

Community Resource ◽

Affymetrix Microarray ◽

Affymetrix Microarray Data

Download Full-text

Comparison of Nanostring nCounter® Data on FFPE Colon Cancer Samples and Affymetrix Microarray Data on Matched Frozen Tissues

PLoS ONE ◽

10.1371/journal.pone.0153784 ◽

2016 ◽

Vol 11 (5) ◽

pp. e0153784 ◽

Cited By ~ 18

Author(s):

Xi Chen ◽

Natasha G. Deane ◽

Keeli B. Lewis ◽

Jiang Li ◽

Jing Zhu ◽

...

Keyword(s):

Colon Cancer ◽

Microarray Data ◽

Affymetrix Microarray ◽

Affymetrix Microarray Data

Download Full-text

Unsupervised Thresholding of Affymetrix Microarray Data

2007 International Conference on Computing: Theory and Applications (ICCTA'07) ◽

10.1109/iccta.2007.129 ◽

2007 ◽

Cited By ~ 2

Author(s):

Matthew W.B. Trotter ◽

Bernard F. Buxton

Keyword(s):

Microarray Data ◽

Affymetrix Microarray ◽

Affymetrix Microarray Data

Download Full-text

GIANT: galaxy-based tool for interactive analysis of transcriptomic data

Scientific Reports ◽

10.1038/s41598-020-76769-w ◽

2020 ◽

Vol 10 (1) ◽

Author(s):

Jimmy Vandel ◽

Céline Gheeraert ◽

Bart Staels ◽

Jérôme Eeckhoute ◽

Philippe Lefebvre ◽

...

Keyword(s):

Data Mining ◽

Biomedical Research ◽

Microarray Data ◽

Rna Seq ◽

Affymetrix Microarray ◽

Transcriptomic Data ◽

Interactive Analysis ◽

Affymetrix Microarray Data

AbstractTranscriptomic analyses are broadly used in biomedical research calling for tools allowing biologists to be directly involved in data mining and interpretation. We present here GIANT, a Galaxy-based tool for Interactive ANalysis of Transcriptomic data, which consists of biologist-friendly tools dedicated to analyses of transcriptomic data from microarray or RNA-seq analyses. GIANT is organized into modules allowing researchers to tailor their analyses by choosing the specific set of tool(s) to analyse any type of preprocessed transcriptomic data. It also includes a series of tools dedicated to the handling of raw Affymetrix microarray data. GIANT brings easy-to-use solutions to biologists for transcriptomic data mining and interpretation.

Download Full-text

MAAMD: A Workflow to Standardize Meta-Analyses of Affymetrix Microarray Data

2012 IEEE Second International Conference on Healthcare Informatics, Imaging and Systems Biology ◽

10.1109/hisb.2012.45 ◽

2012 ◽

Author(s):

Zhuohui Gan ◽

Jianwu Wang ◽

Nathan Salomonis ◽

Ilkay Altintas ◽

Andrew D. McCulloch ◽

...

Keyword(s):

Microarray Data ◽

Affymetrix Microarray ◽

Affymetrix Microarray Data ◽

Meta Analyses

Download Full-text

Integrative Analysis of Gene Expression Data Including an Assessment of Pathway Enrichment for Predicting Prostate Cancer

Cancer Informatics ◽

10.1177/117693510600200018 ◽

2006 ◽

Vol 2 ◽

pp. 117693510600200

Author(s):

Pingzhao Hu ◽

Celia M.T. Greenwood ◽

Joseph Beyene

Keyword(s):

Gene Expression ◽

Prostate Cancer ◽

Biological Pathways ◽

Integrative Analysis ◽

Differentially Expressed ◽

Expression Data ◽

Test Statistic ◽

Affymetrix Microarray ◽

Level Data ◽

Level Model

Background Microarray technology has been previously used to identify genes that are differentially expressed between tumour and normal samples in a single study, as well as in syntheses involving multiple studies. When integrating results from several Affymetrix microarray datasets, previous studies summarized probeset-level data, which may potentially lead to a loss of information available at the probe-level. In this paper, we present an approach for integrating results across studies while taking probe-level data into account. Additionally, we follow a new direction in the analysis of microarray expression data, namely to focus on the variation of expression phenotypes in predefined gene sets, such as pathways. This targeted approach can be helpful for revealing information that is not easily visible from the changes in the individual genes. Results We used a recently developed method to integrate Affymetrix expression data across studies. The idea is based on a probe-level based test statistic developed for testing for differentially expressed genes in individual studies. We incorporated this test statistic into a classic random-effects model for integrating data across studies. Subsequently, we used a gene set enrichment test to evaluate the significance of enriched biological pathways in the differentially expressed genes identified from the integrative analysis. We compared statistical and biological significance of the prognostic gene expression signatures and pathways identified in the probe-level model (PLM) with those in the probeset-level model (PSLM). Our integrative analysis of Affymetrix microarray data from 110 prostate cancer samples obtained from three studies reveals thousands of genes significantly correlated with tumour cell differentiation. The bioinformatics analysis, mapping these genes to the publicly available KEGG database, reveals evidence that tumour cell differentiation is significantly associated with many biological pathways. In particular, we observed that by integrating information from the insulin signalling pathway into our prediction model, we achieved better prediction of prostate cancer. Conclusions Our data integration methodology provides an efficient way to identify biologically sound and statistically significant pathways from gene expression data. The significant gene expression phenotypes identified in our study have the potential to characterize complex genetic alterations in prostate cancer.

Download Full-text