Blind estimation and correction of microarray batch effect

Mapping Intimacies ◽

10.1101/202952 ◽

2017 ◽

Author(s):

Sudhir Varma

Keyword(s):

Large Scale ◽

Reference Sample ◽

R Package ◽

Batch Effect ◽

Data Repository ◽

Blind Estimation ◽

Reference Set ◽

Shared Information ◽

Scale Integration ◽

Biological Differences

AbstractMicroarray batch effect (BE) has been the primary bottleneck for large-scale integration of data from multiple experiments. Current BE correction methods either need known batch identities (ComBat)or have the potential to overcorrect, by removing true but unknown biological differences (SVA).Even though the effects of technical differences on measured expression have been published, there are no BE correction algorithms that take the approach of predicting technical effects from parameters computed from a fixed reference sample set. We show that a set of signatures, each of which is a vector the length of the number of probes, calculated on a Reference set of microarray samples can predict much of the batch effect in other Validation sets. We present a rationale of selecting a Reference set of samples designed to estimate technical differences without removing biological differences. Putting both together, we introduce the Batch Effect Signature Correction (BESC) algorithm that uses the BES calculated on the Reference set to efficiently predict and remove BE. Using two independent Validation sets, we show that BESC is capable of removing batch effect without removing unknown but true biological differences. Much of the variations due to batch effect is shared between different microarray datasets. That shared information can be used to predict signatures (i.e. directions of perturbation) due to batch effect in new datasets. The correction is blind (without needing to re-compute the parameters on new samples to be corrected), single sample, (each sample is corrected independently of each other) and conservative (only those perturbations known to be likely to be due to technical differences are removed ensuring that unknown but important biological differences are maintained). Those three characteristics make it ideal for high-throughput correction of samples for a microarray data repository. An R Package besc implementing the algorithm is available from http://explainbio.com.

Download Full-text

OneD: increasing reproducibility of Hi-C Samples with abnormal karyotypes

10.1101/148254 ◽

2017 ◽

Cited By ~ 2

Author(s):

Enrique Vidal ◽

François le Dily ◽

Javier Quilez ◽

Ralph Stadhouders ◽

Yasmina Cuartero ◽

...

Keyword(s):

Copy Number ◽

Large Scale ◽

Chromosomal Abnormalities ◽

Three Dimensional ◽

R Package ◽

Copy Number Variations ◽

Normalization Methods ◽

Alternative Approach ◽

Systematic Biases ◽

Biological Differences

AbstractThe three-dimensional conformation of genomes is an essential component of their biological activity. The advent of the Hi-C technology enabled an unprecedented progress in our understanding of genome structures. However, Hi-C is subject to systematic biases that can compromise downstream analyses. Several strategies have been proposed to remove those biases, but the issue of abnormal karyotypes received little attention. Many experiments are performed in cancer cell lines, which typically harbor large-scale copy number variations that create visible defects on the raw Hi-C maps. The consequences of these widespread artifacts on the normalized maps are mostly unexplored. We observed that current normalization methods are not robust to the presence of large-scale copy number variations, potentially obscuring biological differences and enhancing batch effects. To address this issue, we developed an alternative approach designed to take into account chromosomal abnormalities. The method, called OneD, increases reproducibility among replicates of Hi-C samples with abnormal karyotype, outperforming previous methods significantly. On normal karyotypes, OneD fared equally well as state-of-the-art methods, making it a safe choice for Hi-C normalization. OneD is fast and scales well in terms of computing resources for resolutions up to 1 kbp. OneD is implemented as an R package available at http://www.github.com/qenvio/dryhic.

Download Full-text

Optimal Power Generation Mix Model considering Nationwide High-voltage Power Grid in Japan for the Analysis of Large-scale Integration of PV and Wind Power Generation

IEEJ Transactions on Power and Energy ◽

10.1541/ieejpes.136.864 ◽

2016 ◽

Vol 136 (12) ◽

pp. 864-875 ◽

Cited By ~ 4

Author(s):

Tatsuhiko Sugiyama ◽

Ryoichi Komiyama ◽

Yasumasa Fujii

Keyword(s):

Power Generation ◽

Wind Power ◽

High Voltage ◽

Large Scale ◽

Power Grid ◽

Wind Power Generation ◽

Optimal Power ◽

Large Scale Integration ◽

Scale Integration

Download Full-text

Plant hormones, plant growth regulators

Orvosi Hetilap ◽

10.1556/oh.2014.29939 ◽

2014 ◽

Vol 155 (26) ◽

pp. 1011-1018 ◽

Cited By ~ 1

Author(s):

György Végvári ◽

Edina Vidéki

Keyword(s):

Nervous System ◽

Growth And Development ◽

Large Scale ◽

Essential Role ◽

Higher Plants ◽

Structure And Function ◽

Wide Sense ◽

Large Scale Integration ◽

Scale Integration ◽

And Function

Plants seem to be rather defenceless, they are unable to do motion, have no nervous system or immune system unlike animals. Besides this, plants do have hormones, though these substances are produced not in glands. In view of their complexity they lagged behind animals, however, plant organisms show large scale integration in their structure and function. In higher plants, such as in animals, the intercellular communication is fulfilled through chemical messengers. These specific compounds in plants are called phytohormones, or in a wide sense, bioregulators. Even a small quantity of these endogenous organic compounds are able to regulate the operation, growth and development of higher plants, and keep the connection between cells, tissues and synergy beween organs. Since they do not have nervous and immume systems, phytohormones play essential role in plants’ life. Orv. Hetil., 2014, 155(26), 1011–1018.

Download Full-text

EOS (Electrical Overstress) Protection for VLSI (Very Large Scale Integration) Devices.

10.21236/ada150333 ◽

1984 ◽

Author(s):

D. D. Wilson ◽

W. E. Echols ◽

M. G. Rossi

Keyword(s):

Large Scale ◽

Very Large Scale Integration ◽

Large Scale Integration ◽

Electrical Overstress ◽

Scale Integration

Download Full-text

Restructurable VLSI (Very Large Scale Integration) Program

10.21236/ada152340 ◽

1984 ◽

Author(s):

G. C. O'Leary

Keyword(s):

Large Scale ◽

Very Large Scale Integration ◽

Integration Program ◽

Large Scale Integration ◽

Scale Integration

Download Full-text

Exploration of Nanometer Cognitive Reasoning Very Large Scale Integration (VLSI) Computer Architecures

10.21236/ada560043 ◽

2012 ◽

Author(s):

Jr Stine ◽

James E.

Keyword(s):

Large Scale ◽

Very Large Scale Integration ◽

Cognitive Reasoning ◽

Large Scale Integration ◽

Scale Integration

Download Full-text

Behavior Models of Voltage Differencing Inverting Buffered Amplifier and Applications in Circuit Analysis

Recent Advances in Electrical & Electronic Engineering (Formerly Recent Patents on Electrical & Electronic Engineering) ◽

10.2174/2352096511666180629151111 ◽

2019 ◽

Vol 12 (4) ◽

pp. 298-303

Author(s):

YongAn LI

Keyword(s):

Large Scale ◽

Vlsi Design ◽

Circuit Analysis ◽

Second Order ◽

Very Large Scale Integration ◽

Behavioral Models ◽

Order Filter ◽

Large Scale Integration ◽

Scale Integration ◽

Behavior Models

Background: The symbolic nodal analysis acts as a pivotal part of the very large scale integration (VLSI) design. Methods: In this work, based on the terminal relations for the pathological elements and the voltage differencing inverting buffered amplifier (VDIBA), twelve alternative pathological models for the VDIBA are presented. Moreover, the proposed models are applied to the VDIBA-based second-order filter and oscillator so as to simplify the circuit analysis. Results: The result shows that the behavioral models for the VDIBA are systematic, effective and powerful in the symbolic nodal circuit analysis.</P>

Download Full-text

NoRCE: non-coding RNA sets cis enrichment tool

BMC Bioinformatics ◽

10.1186/s12859-021-04112-9 ◽

2021 ◽

Vol 22 (1) ◽

Author(s):

Gulden Olgun ◽

Afshan Nabi ◽

Oznur Tastan

Keyword(s):

Expression Patterns ◽

Target Prediction ◽

Enrichment Analysis ◽

Fruit Fly ◽

Relevant Information ◽

R Package ◽

Data Repository ◽

Biologically Relevant ◽

Gene Sets ◽

Data Files

Abstract Background While some non-coding RNAs (ncRNAs) are assigned critical regulatory roles, most remain functionally uncharacterized. This presents a challenge whenever an interesting set of ncRNAs needs to be analyzed in a functional context. Transcripts located close-by on the genome are often regulated together. This genomic proximity on the sequence can hint at a functional association. Results We present a tool, NoRCE, that performs cis enrichment analysis for a given set of ncRNAs. Enrichment is carried out using the functional annotations of the coding genes located proximal to the input ncRNAs. Other biologically relevant information such as topologically associating domain (TAD) boundaries, co-expression patterns, and miRNA target prediction information can be incorporated to conduct a richer enrichment analysis. To this end, NoRCE includes several relevant datasets as part of its data repository, including cell-line specific TAD boundaries, functional gene sets, and expression data for coding & ncRNAs specific to cancer. Additionally, the users can utilize custom data files in their investigation. Enrichment results can be retrieved in a tabular format or visualized in several different ways. NoRCE is currently available for the following species: human, mouse, rat, zebrafish, fruit fly, worm, and yeast. Conclusions NoRCE is a platform-independent, user-friendly, comprehensive R package that can be used to gain insight into the functional importance of a list of ncRNAs of any type. The tool offers flexibility to conduct the users’ preferred set of analyses by designing their own pipeline of analysis. NoRCE is available in Bioconductor and https://github.com/guldenolgun/NoRCE.

Download Full-text