Database Conceptual Schema Matching

Schema merging is a process of integrating multiple data sources into a GCS (Global Conceptual Schema). It is pivotal to various application domains, like data ware housing and multi-databases. Schema merging requires the identification of corresponding elements, which is done through schema matching process. In this process, corresponding elements across multiple data sources are identified after the comparison of these data sources with each other. In this way, for a given set of data sources and the correspondence between them, different possibilities for creating GCS can be achieved. In applications like multi-databases and data warehousing, new data sources keep joining in and GCS relations are usually expanded horizontally or vertically. Schema merging approaches usually expand GCS relations horizontally or vertically as new data sources join in. As a result of such expansions, an unbalanced GCS is created which either produces too much NULL values in response to global queries or a result of too many Joins causes poor query processing. In this paper, a novel approach, TuSMe (Tuned Schema Merging) technique is introduced to overcome the above mentioned issue via developing a balanced GCS, which will be able to control both vertical and horizontal expansion of GCS relations. The approach employs a weighting mechanism in which the weights are assigned to individual attributes of GCS. These weights reflect the connectedness of GCS attributes in accordance with the attributes of the principle data sources. Moreover, the overall strength of the GCS could be scrutinized by combining these weights. A prototype implementation of TuSMe shows significant improvement against other contemporary state-of-the-art approaches.

Download Full-text

A schema matching method for the semantic integration of spatial data

Management Innovation and Information Technology ◽

10.2495/miit130701 ◽

2014 ◽

Author(s):

Qiang Wang ◽

Huarui Wu ◽

Huaji Zhu

Keyword(s):

Spatial Data ◽

Semantic Integration ◽

Schema Matching ◽

Matching Method

Download Full-text

Ontology-Aided Automatic Schema Matching

Journal of Software ◽

10.3724/sp.j.1001.2009.00234 ◽

2009 ◽

Vol 20 (2) ◽

pp. 234-245 ◽

Cited By ~ 1

Author(s):

Qiang LIU ◽

Di ZHAO ◽

Hua ZHONG ◽

Tao HUANG

Keyword(s):

Schema Matching

Download Full-text

Deep Web query interface schema matching based on matching degree and semantic similarity

Journal of Computer Applications ◽

10.3724/sp.j.1087.2012.01688 ◽

2013 ◽

Vol 32 (6) ◽

pp. 1688-1691

Author(s):

Yong FENG ◽

Yang ZHANG

Keyword(s):

Semantic Similarity ◽

Deep Web ◽

Schema Matching ◽

Query Interface ◽

Matching Degree

Download Full-text

Multivariate consumption profiling (MCP) for intelligent meter systems: a methodology to define categories and levels

Water Science & Technology Water Supply ◽

10.2166/ws.2010.374 ◽

2010 ◽

Vol 10 (5) ◽

pp. 710-720 ◽

Cited By ~ 2

Author(s):

J. L. Solanas ◽

M. R. Cussó

Keyword(s):

Cluster Analysis ◽

Factor Analysis ◽

Multivariate Analysis ◽

Discriminant Analysis ◽

Reference Population ◽

Statistical Analyses ◽

Conceptual Schema ◽

Data Variability ◽

Water Companies ◽

Analyse Data

Multivariate Consumption Profiling (MCP) is a methodology to analyse the readings made by Intelligent Meter (IM) systems. Even in advanced water companies with well supported IM, full statistical analyses are not performed, since no efficient methods are available to deal with all the data items. Multivariate Analysis has been proposed as a convenient way to synthesise all IM information. MCP uses Factor Analysis, Cluster Analysis and Discriminant Analysis to analyse data variability by categories and levels, in a cyclical improvement process. MCP obtains a conceptual schema of a reference population on a set of classifying tables, one for each category. These tables are quantitative concepts to evaluate consumption, meter sizing, leakage and undermetering for populations and groupings and individual cases. They give structuring items to enhance “traditional” statistics. All the relevant data from each new meter reading can be matched to the classifying tables. A set of indexes is computed and thresholds are used to select those cases with the desired profiles. The paper gives an example of a MCP conceptual schema for five categories, three variables, and five levels, and obtains its classifying tables. It shows the use of case profiles to implement actions in accordance with the operative objectives.

Download Full-text