Multiscale Data Integration Using Markov Random Fields

2002 ◽  
Vol 5 (01) ◽  
pp. 68-78 ◽  
Author(s):  
Sang Heon Lee ◽  
Adel Malallah ◽  
Akhil Datta-Gupta ◽  
David Higdon

Summary We propose a hierarchical approach to spatial modeling based on Markov Random Fields (MRF) and multiresolution algorithms in image analysis. Unlike their geostatistical counterparts, which simultaneously specify distributions across the entire field, MRFs are based on a collection of full conditional distributions that rely on the local neighborhoods of each element. This critical focus on local specification provides several advantages:MRFs are computationally tractable and are ideally suited to simulation based computation, such as Markov Chain Monte Carlo (MCMC) methods, andmodel extensions to account for nonstationarity, discontinuity, and varying spatial properties at various scales of resolution are easily accessible in the MRF framework. Our proposed method is computationally efficient and well suited to reconstruct fine-scale spatial fields from coarser, multiscale samples (based on seismic and production data) and sparse fine-scale conditioning data (e.g., well data). It is easy to implement, and it can account for the complex, nonlinear interactions between different scales, as well as the precision of the data at various scales, in a consistent fashion. We illustrate our method with a variety of examples that demonstrate the power and versatility of the proposed approach. Finally, a comparison with Sequential Gaussian Simulation with Block Kriging (SGSBK) indicates similar performance with less restrictive assumptions. Introduction A persistent problem in petroleum reservoir characterization is to build a model for flow simulations based on incomplete information. Because of the limited spatial information, any conceptual reservoir model used to describe heterogeneities will, necessarily, have large uncertainty. Such uncertainties can be significantly reduced by integrating multiple data sources into the reservoir model.1 In general, we have hard data, such as well logs and cores, and soft data, such as seismic traces, production history, conceptual depositional models, and regional geological analyses. Integrating information from this wide variety of sources into the reservoir model is not a trivial task. This is because different data sources scan different length scales of heterogeneity and can have different degrees of precision.2 Reconciling multiscale data for spatial modeling of reservoir properties is important because different data types provide different information about the reservoir architecture and heterogeneity. It is essential that reservoir models preserve small-scale property variations observed in well logs and core measurements and capture the large-scale structure and continuity observed in global measures such as seismic and production data. A hierarchical model is particularly well suited to address the multiscaled nature of spatial fields, match available data at various levels of resolution, and account for uncertainties inherent in the information.1–3 Several methods to combine multiscale data have been introduced in the literature, with a primary focus on integrating seismic and well data.3–9 These include conventional techniques such as cokriging and its variations,3–6 SGSBK,7 and Bayesian updating of point kriging.8,9 Most kriging-based methods are restricted to multi-Gaussian and stationary random fields.3–9 Therefore, they require data transformation and variogram construction. In practice, variogram modeling with a limited data set can be difficult and strongly user-dependent. Improper variograms can lead to errors and inaccuracies in the estimation. Thus, one might also need to consider the uncertainty in variogram models during estimation. 10 However, conventional geostatistical methods do not provide an effective framework to account for the uncertainty of the variogram. Furthermore, most of the multiscale integration algorithms assume a linear relationship between the scales. The objective of this paper is to introduce a novel multiscale data-integration technique that provides a flexible and sound mathematical framework to overcome some of the limitations of conventional geostatistical techniques. Our approach is based on multiscale MRFs11–14 that can effectively integrate multiple data sources into high-resolution reservoir models for reliable reservoir forecasting. This proposed approach is also ideally suited to simulation- based computations, such as MCMC.15,16 Methodology Our problem of interest is to generate fine-scale random fields based on sparse fine-scale samples and coarse-scale data. Such situations arise when we have limited point measurements, such as well data, and coarse-scale information based on seismic and/or production data. Our proposed method is a Bayesian approach to spatial modeling based on MRF and multiresolution algorithms in image analysis. Broadly, the method consists of two major parts:construction of a posterior distribution for multiscale data integration using a hierarchical model andimplementing MCMC to explore the posterior distribution. Construction of a Posterior Distribution for Multiscale Data Integration. A multiresolution MRF provides an efficient framework to integrate different scales of data hierarchically, provided that the coarse-scale resolution is dependent on the next finescale resolution.11 In general, a hierarchical conditional model over scales 1,. . ., N (from fine to coarse) can be expressed in terms of the product of conditional distributions,Equation 1 where p(xn), n=1, . . ., N, are MRF models at each scale, and the terms p(xn|xn-1) express the statistical interactions between different scales. This approach links the various scales stochastically in a direct Bayesian hierarchical modeling framework (Fig. 1). Knowing the fine-scale field xn does not completely determine the field at a coarser scale xn+1, but depending on the extent of the dependence structure modeled and estimated, it influences the distribution at the coarser scales to a greater or lesser extent. This enables us to address multiscale problems accounting for the scale and precision of the data at various levels. For clarity of exposition, a hierarchical model for reconciling two different scales of data will be considered below.Equation 2 From this equation, the posterior distribution of the fine-scale random field indexed by 1 given a coarse-scale random field indexed by 2 can be derived as follows.

2013 ◽  
Vol 2013 ◽  
pp. 1-9 ◽  
Author(s):  
No-Wook Park

A geostatistical downscaling scheme is presented and can generate fine scale precipitation information from coarse scale Tropical Rainfall Measuring Mission (TRMM) data by incorporating auxiliary fine scale environmental variables. Within the geostatistical framework, the TRMM precipitation data are first decomposed into trend and residual components. Quantitative relationships between coarse scale TRMM data and environmental variables are then estimated via regression analysis and used to derive trend components at a fine scale. Next, the residual components, which are the differences between the trend components and the original TRMM data, are then downscaled at a target fine scale via area-to-point kriging. The trend and residual components are finally added to generate fine scale precipitation estimates. Stochastic simulation is also applied to the residual components in order to generate multiple alternative realizations and to compute uncertainty measures. From an experiment using a digital elevation model (DEM) and normalized difference vegetation index (NDVI), the geostatistical downscaling scheme generated the downscaling results that reflected detailed characteristics with better predictive performance, when compared with downscaling without the environmental variables. Multiple realizations and uncertainty measures from simulation also provided useful information for interpretations and further environmental modeling.


2011 ◽  
Vol 9 (1) ◽  
pp. 180-204 ◽  
Author(s):  
Zhaoqin Huang ◽  
Jun Yao ◽  
Yajun Li ◽  
Chenchen Wang ◽  
Xinrui Lv

AbstractA numerical procedure for the evaluation of equivalent permeability tensor for fractured vuggy porous media is presented. At first we proposed a new conceptual model, i.e., discrete fracture-vug network model, to model the realistic fluid flow in fractured vuggy porous medium on fine scale. This new model consists of three systems: rock matrix system, fractures system, and vugs system. The fractures and vugs are embedded in porous rock, and the isolated vugs could be connected via discrete fracture network. The flow in porous rock and fractures follows Darcy’s law, and the vugs system is free fluid region. Based on two-scale homogenization theory, we obtained an equivalent macroscopic Darcy’s law on coarse scale from fine-scale discrete fracture-vug network model. A finite element numerical formulation for homogenization equations is developed. The method is verified through application to a periodic model problem and then is applied to the calculation of equivalent permeability tensor of porous media with complex fracture-vug networks. The applicability and validity of the method for these more general fractured vuggy systems are assessed through a simple test of the coarse-scale model.


2021 ◽  
Author(s):  
Ekaterina Chuprikova ◽  
Abraham Mejia Aguilar ◽  
Roberto Monsorno

<p>Increasing agricultural production challenges, such as climate change, environmental concerns, energy demands, and growing expectations from consumers triggered the necessity for innovation using data-driven approaches such as visual analytics. Although the visual analytics concept was introduced more than a decade ago, the latest developments in the data mining capacities made it possible to fully exploit the potential of this approach and gain insights into high complexity datasets (multi-source, multi-scale, and different stages). The current study focuses on developing prototypical visual analytics for an apple variety testing program in South Tyrol, Italy. Thus, the work aims (1) to establish a visual analytics interface enabled to integrate and harmonize information about apple variety testing and its interaction with climate by designing a semantic model; and (2) to create a single visual analytics user interface that can turn the data into knowledge for domain experts. </p><p>This study extends the visual analytics approach with a structural way of data organization (ontologies), data mining, and visualization techniques to retrieve knowledge from an extensive collection of apple variety testing program and environmental data. The prototype stands on three main components: ontology, data analysis, and data visualization. Ontologies provide a representation of expert knowledge and create standard concepts for data integration, opening the possibility to share the knowledge using a unified terminology and allowing for inference. Building upon relevant semantic models (e.g., agri-food experiment ontology, plant trait ontology, GeoSPARQL), we propose to extend them based on the apple variety testing and climate data. Data integration and harmonization through developing an ontology-based model provides a framework for integrating relevant concepts and relationships between them, data sources from different repositories, and defining a precise specification for the knowledge retrieval. Besides, as the variety testing is performed on different locations, the geospatial component can enrich the analysis with spatial properties. Furthermore, the visual narratives designed within this study will give a better-integrated view of data entities' relations and the meaningful patterns and clustering based on semantic concepts.</p><p>Therefore, the proposed approach is designed to improve decision-making about variety management through an interactive visual analytics system that can answer "what" and "why" about fruit-growing activities. Thus, the prototype has the potential to go beyond the traditional ways of organizing data by creating an advanced information system enabled to manage heterogeneous data sources and to provide a framework for more collaborative scientific data analysis. This study unites various interdisciplinary aspects and, in particular: Big Data analytics in the agricultural sector and visual methods; thus, the findings will contribute to the EU priority program in digital transformation in the European agricultural sector.</p><p>This project has received funding from the European Union's Horizon 2020 research and innovation program under the Marie Skłodowska-Curie grant agreement No 894215.</p>


2018 ◽  
Author(s):  
Larysse Silva ◽  
José Alex Lima ◽  
Nélio Cacho ◽  
Eiji Adachi ◽  
Frederico Lopes ◽  
...  

A notable characteristic of smart cities is the increase in the amount of available data generated by several devices and computational systems, thus augmenting the challenges related to the development of software that involves the integration of larges volumes of data. In this context, this paper presents a literature review aimed to identify the main strategies used in the development of solutions for data integration, relationship, and representation in smart cities. This study systematically selected and analyzed eleven studies published from 2015 to 2017. The achieved results reveal gaps regarding solutions for the continuous integration of heterogeneous data sources towards supporting application development and decision-making.


Sign in / Sign up

Export Citation Format

Share Document