A Refined Canonical Data Model for Multi-schema Integration and Mapping

Author(s):  
Michael Dietrich ◽  
Jens Lemcke
Author(s):  
Z. M. Ma ◽  
W. J. Zhang ◽  
Q. Li

Abstract Virtual enterprise is typically one kind of information-based enterprise. The features of information system in virtual enterprise can be generalized as heterogeneous and distributed. Its organization and production management put an essential requirement on information integration. In this paper, we shall discuss the forms of conflict in schema integration of multiple databases in virtual enterprise, and give an approach to resolve the conflicts. In order to implement this approach in relational database, an extended relational data model is proposed and the notion of marked partial value is introduced. Based on this extended data model, some relational operations for data management are defined. A case study is discussed about query processing on relational database with partial values.


Author(s):  
Louise Lane ◽  
Kalpdrum Passi ◽  
Sanjay Madria ◽  
Mukesh Mohania

XML has become the de facto standard for Information Exchange protocol for e-commerce and many workgroup applications such as Enterprise Resource Planning (ERP). The availability of large amounts of heterogeneous distributed web data necessitates the integration of XML data from multiple XML sources for many reasons. Currently, there are many e-commerce companies, which sell similar products but represent them using different XML schemas with possibly different ontologies. When any two such companies merge, there is a need for a uniform schema integration methodology. In some applications like comparison-shopping, there is a need for an illusionary centralized homogeneous information system. In this chapter, we propose an XML Schema integration methodology. We define an object-oriented data model called XSDM (XML Schema Data Model) and present a graphical representation of XML Schema for the purpose of schema integration. We use a three-layered architecture for XML Schema integration, with each layer presenting an integrated view of the concepts that characterize the layer below. The three layers included are namely pre-integration, comparison and integration. During pre-integration, an analysis of the schemas to be integrated occurs. During the comparison phase of integration, correspondences as well as conflicts between elements are identified. During the integration phase, restructuring and merging of the initial schemas takes place to obtain the global schema. We define integration policies for integrating element definitions as well as their data types and attributes. The policies are also applicable in integrating DTD schemas with other DTD/XML Schemas.


Author(s):  
Michael Dietrich ◽  
Jens Lemcke

Current approaches in schema mapping and matching focus on pair-wise comparison of schemas. This chapter gives an overview of how n-way comparison of schemas via a unified data model for large-scale multi-schema integration (ULMI) can benefit to schema matching and mapping processes. The approach integrates a set of input schemas into one comprehensive representation. Thus, a unified data model is constructed. The unified data model represents the closure of all integrated schemas. However, as the unified data model is too complex and too large, it is never revealed to the user. Therefore, the authors derive a canonical data model which represents the most common structure of all schemas. In a use case, the advantages of the canonical data model are demonstrated. Finally, challenges for further research are derived. This work is based on excerpts from realistic input schemas, and it provides a concrete, ideal canonical data model as a reference for further research.


Author(s):  
Michael Dietrich ◽  
Jens Lemcke ◽  
Gunther Stuhec

Nowadays, B2B integration still remains a big cost driver for companies. On the one hand, standardization efforts were able to reduce the mapping effort between e-Business schemas. However, the effort for creating customized messages from the huge and underspecified standard templates increased. Due to the myriad of different requirements by different companies, a great variety of standards coexist. Instead of forcing companies to adopt huge standards, this article propagates an iteratively improving schema and mapping derivation system in the cloud. Thus, we provide flexibility, but streamline companies' integration efforts based on an evolving canonical data model. This approach reduces the need for explicit standardization to a minimum. Our simulation based on real schemas shows a potential to reduce guide creation effort by 50% and mapping effort from 6% to almost 100%.


2008 ◽  
Author(s):  
Pedro J. M. Passos ◽  
Duarte Araujo ◽  
Keith Davids ◽  
Ana Diniz ◽  
Luis Gouveia ◽  
...  

2019 ◽  
Vol 13 (1-2) ◽  
pp. 95-115
Author(s):  
Brandon Plewe

Historical place databases can be an invaluable tool for capturing the rich meaning of past places. However, this richness presents obstacles to success: the daunting need to simultaneously represent complex information such as temporal change, uncertainty, relationships, and thorough sourcing has been an obstacle to historical GIS in the past. The Qualified Assertion Model developed in this paper can represent a variety of historical complexities using a single, simple, flexible data model based on a) documenting assertions of the past world rather than claiming to know the exact truth, and b) qualifying the scope, provenance, quality, and syntactics of those assertions. This model was successfully implemented in a production-strength historical gazetteer of religious congregations, demonstrating its effectiveness and some challenges.


MIS Quarterly ◽  
2013 ◽  
Vol 37 (1) ◽  
pp. 125-147 ◽  
Author(s):  
Rui Chen ◽  
◽  
Raj Sharman ◽  
H. Raghav Rao ◽  
Shambhu J. Upadhyaya ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document