Introduction to this special section: Cross-disciplinary applications of geophysics

2018 ◽  
Vol 37 (9) ◽  
pp. 654-654
Author(s):  
Kyle Spikes ◽  
Yongyi Li

As the volume of seismic and other data types continues to increase, the use of such data sets has extended to different approaches and techniques of data integration and interpretation. The intent of this special section on cross-disciplinary applications of geophysics is to highlight such uses of multiple data types. Although not limited to any type or location of a given reservoir, the two articles in this section primarily focus on onshore unconventional reservoirs. Nonetheless, the techniques and approaches will also be of interest to readers and practitioners who deal with conventional reservoirs, both onshore and offshore.

Author(s):  
Seema Ansari ◽  
Radha Mohanlal ◽  
Javier Poncela ◽  
Adeel Ansari ◽  
Komal Mohanlal

Combining vast amounts of heterogeneous data and increasing the processing power of existing database management tools is no doubt the emerging need of IT industry in coming years. The complexity and size of data sets that need to be acquired, analyzed, stored, sorted or transferred has spiked in the recent years. Due to the tremendously increasing volume of multiple data types, creating Big Data applications that can extract the valuable trends and relationships required for further processes or deriving useful results is quite challenging task. Companies, corporate organizations or be it government agencies, all need to analyze and execute Big Data implementation to pave new paths of productivity and innovation. This chapter discusses the emerging technology of modern era: Big Data with detailed description of the three V's (Variety, Velocity and Volume). Further chapters will enable to understand the concepts of data mining and big data analysis, Potentials of Big Data in five domains i.e. Healthcare, Public sector, Retail, Manufacturing and Personal location Data.


2020 ◽  
Vol 3 (1) ◽  
pp. 113-136
Author(s):  
Silvia Pineda ◽  
Daniel G. Bunis ◽  
Idit Kosti ◽  
Marina Sirota

Over the last several years, next-generation sequencing and its recent push toward single-cell resolution have transformed the landscape of immunology research by revealing novel complexities about all components of the immune system. With the vast amounts of diverse data currently being generated, and with the methods of analyzing and combining diverse data improving as well, integrative systems approaches are becoming more powerful. Previous integrative approaches have combined multiple data types and revealed ways that the immune system, both as a whole and as individual parts, is affected by genetics, the microbiome, and other factors. In this review, we explore the data types that are available for studying immunology with an integrative systems approach, as well as the current strategies and challenges for conducting such analyses.


2014 ◽  
Vol 20 ◽  
pp. 13-26 ◽  
Author(s):  
Ei Fujioka ◽  
Connie Y. Kot ◽  
Bryan P. Wallace ◽  
Benjamin D. Best ◽  
Jerry Moxley ◽  
...  

2020 ◽  
Author(s):  
Pengcheng Zeng ◽  
Jiaxuan WangWu ◽  
Zhixiang Lin

AbstractUnsupervised methods, such as clustering methods, are essential to the analysis of single-cell genomic data. Most current clustering methods are designed for one data type only, such as scRNA-seq, scATAC-seq or sc-methylation data alone, and a few are developed for the integrative analysis of multiple data types. Integrative analysis of multimodal single-cell genomic data sets leverages the power in multiple data sets and can deepen the biological insight. We propose a coupled co-clustering-based unsupervised transfer learning algorithm (coupleCoC) for the integrative analysis of multimodal single-cell data. Our proposed coupleCoC builds upon the information theoretic co-clustering framework. We applied coupleCoC for the integrative analysis of scATAC-seq and scRNA-seq data, sc-methylation and scRNA-seq data, and scRNA-seq data from mouse and human. We demonstrate that coupleCoC improves the overall clustering performance and matches the cell subpopulations across multimodal single-cell genomic data sets. The software and data sets are available at https://github.com/cuhklinlab/coupleCoC.


2019 ◽  
Author(s):  
Nikola Simidjievski ◽  
Cristian Bodnar ◽  
Ifrah Tariq ◽  
Paul Scherer ◽  
Helena Andres-Terre ◽  
...  

ABSTRACTInternational initiatives such as the Molecular Taxonomy of Breast Cancer International Consortium (METABRIC) are collecting multiple data sets at different genome-scales with the aim to identify novel cancer bio-markers and predict patient survival. To analyse such data, several machine learning, bioinformatics and statistical methods have been applied, among them neural networks such as autoencoders. Although these models provide a good statistical learning framework to analyse multi-omic and/or clinical data, there is a distinct lack of work on how to integrate diverse patient data and identify the optimal design best suited to the available data.In this paper, we investigate several autoencoder architectures that integrate a variety of cancer patient data types (e.g., multi-omics and clinical data). We perform extensive analyses of these approaches and provide a clear methodological and computational framework for designing systems that enable clinicians to investigate cancer traits and translate the results into clinical applications. We demonstrate how these networks can be designed, built and, in particular, applied to tasks of integrative analyses of heterogeneous breast cancer data. The results show that these approaches yield relevant data representations that, in turn, lead to accurate and stable diagnosis.


2017 ◽  
Vol 18 (1) ◽  
Author(s):  
Carles Hernandez-Ferrer ◽  
Carlos Ruiz-Arenas ◽  
Alba Beltran-Gomila ◽  
Juan R. González

2021 ◽  
Vol 26 (2) ◽  
pp. 40
Author(s):  
Michael W. Daniels ◽  
Daniel Dvorkin ◽  
Rani K. Powers ◽  
Katerina Kechris

Integrating gene-level data is useful for predicting the role of genes in biological processes. This problem has typically focused on supervised classification, which requires large training sets of positive and negative examples. However, training data sets that are too small for supervised approaches can still provide valuable information. We describe a hierarchical mixture model that uses limited positively labeled gene training data for semi-supervised learning. We focus on the problem of predicting essential genes, where a gene is required for the survival of an organism under particular conditions. We applied cross-validation and found that the inclusion of positively labeled samples in a semi-supervised learning framework with the hierarchical mixture model improves the detection of essential genes compared to unsupervised, supervised, and other semi-supervised approaches. There was also improved prediction performance when genes are incorrectly assumed to be non-essential. Our comparisons indicate that the incorporation of even small amounts of existing knowledge improves the accuracy of prediction and decreases variability in predictions. Although we focused on gene essentiality, the hierarchical mixture model and semi-supervised framework is standard for problems focused on prediction of genes or other features, with multiple data types characterizing the feature, and a small set of positive labels.


1997 ◽  
Vol 36 (5) ◽  
pp. 61-68 ◽  
Author(s):  
Hermann Eberl ◽  
Amar Khelil ◽  
Peter Wilderer

A numerical method for the identification of parameters of nonlinear higher order differential equations is presented, which is based on the Levenberg-Marquardt algorithm. The estimation of the parameters can be performed by using several reference data sets simultaneously. This leads to a multicriteria optimization problem, which will be treated by using the Pareto optimality concept. In this paper, the emphasis is put on the presentation of the calibration method. As an example identification of the parameters of a nonlinear hydrological transport model for urban runoff is included, but the method can be applied to other problems as well.


Sign in / Sign up

Export Citation Format

Share Document