GSMA: A Structural Matching Algorithm for Schema Matching in Data Warehousing

XML SCHEMA MATCHING

International Journal of Software Engineering and Knowledge Engineering ◽

10.1142/s0218194007003446 ◽

2007 ◽

Vol 17 (05) ◽

pp. 575-597 ◽

Cited By ~ 2

Author(s):

JIANGUO LU ◽

JU WANG ◽

SHENGRUI WANG

Keyword(s):

Data Integration ◽

Software Reuse ◽

Edit Distance ◽

Xml Schema ◽

Schema Matching ◽

Schema Evolution ◽

Tree Edit Distance ◽

Matching Problem ◽

Matching Algorithm ◽

Unordered Trees

XML Schema matching problem can be formulated as follows: given two XML Schemas, find the best mapping between the elements and attributes of the schemas, and the overall similarity between them. XML Schema matching is an important problem in data integration, schema evolution, and software reuse. This paper describes a matching system that can find accurate matches and scales to large XML Schemas with hundreds of nodes. In our system, XML Schemas are modeled as labeled and unordered trees, and the schema matching problem is turned into a tree matching problem. We proposed Approximate Common Structures in trees, and developed a tree matching algorithm based on this concept. Compared with the traditional tree edit-distance algorithm and other schema matching systems, our algorithm is faster and more suitable for large XML Schema matching.

Download Full-text

An improved schema matching algorithm of opaque database schemas

2011 International Conference on Multimedia Technology ◽

10.1109/icmt.2011.6002333 ◽

2011 ◽

Cited By ~ 1

Author(s):

Wei Chen ◽

Fang Zhang ◽

Maosen Wen

Keyword(s):

Schema Matching ◽

Matching Algorithm

Download Full-text

An Effective Content-Based Schema Matching Algorithm

2008 International Seminar on Future Information Technology and Management Engineering ◽

10.1109/fitme.2008.38 ◽

2008 ◽

Cited By ~ 3

Author(s):

Yuan Yang ◽

Mengdong Chen ◽

Bin Gao

Keyword(s):

Schema Matching ◽

Matching Algorithm

Download Full-text

Automatic schema matching for data warehousing

Fifth World Congress on Intelligent Control and Automation (IEEE Cat. No.04EX788) ◽

10.1109/wcica.2004.1342235 ◽

2004 ◽

Author(s):

Lingmei Li ◽

Lan Yang

Keyword(s):

Data Warehousing ◽

Schema Matching

Download Full-text

Schema Matching Quality: Thesaurus as the Matcher

Jurnal Teknologi ◽

10.11113/jt.v70.3514 ◽

2014 ◽

Vol 70 (5) ◽

Author(s):

Thabit Sabbah ◽

Ali Selamat

Keyword(s):

Information Retrieval ◽

Data Integration ◽

Query Processing ◽

Data Warehousing ◽

Schema Matching ◽

Semantic Query ◽

Integration Data ◽

F Measure

Thesaurus is used in many Information Retrieval (IR) applications such as data integration, data warehousing, semantic query processing and classifiers. It was also utilized to solve the problem of schema matching. Considering the fact of existence of many thesauri for a certain area of knowledge, the quality of schema matching results when using different thesauri in the same field is not predictable. In this paper, we propose a methodology to study the performance of the thesaurus in solving schema matching. The paper also presents results of experiments using different thesauri. Precision, recall, F-measure, and similarity average were calculated to show that the quality of matching changed according to the used thesaurus.

Download Full-text

A Flexible and Composite Schema Matching Algorithm

On the Move to Meaningful Internet Systems 2004: CoopIS, DOA, and ODBASE - Lecture Notes in Computer Science ◽

10.1007/978-3-540-30468-5_6 ◽

2004 ◽

pp. 55-65

Author(s):

Shoujian Yu ◽

Zhongming Han ◽

Jiajin Le

Keyword(s):

Schema Matching ◽

Matching Algorithm

Download Full-text

A LEXICAL DECISION TREE SCHEME FOR SUPPORTING SCHEMA MATCHING

International Journal of Information Technology & Decision Making ◽

10.1142/s0219622011004439 ◽

2011 ◽

Vol 10 (03) ◽

pp. 519-537 ◽

Cited By ~ 7

Author(s):

BEEN-CHIAN CHIEN ◽

SHIANG-YI HE

Keyword(s):

Decision Tree ◽

Lexical Decision ◽

Structural Similarity ◽

Schema Matching ◽

Similarity Matching ◽

Matching Method ◽

Matching Algorithm ◽

Essential Step ◽

Structure Similarity ◽

Two Phases

To manipulate semantic web and integrate different data sources efficiently, automatic schema matching plays a key role. A generic schema matching method generally includes two phases: the linguistic similarity matching phase and the structural similarity matching phase. Since linguistic matching is an essential step for effective schema matching, developing a high accurate linguistic similarity matching scheme is required. In this paper, a schema matching approach called Similarity Yield Matcher (SYM) is proposed. In SYM, a lexical decision tree is presented to determine the linguistic similarity matching of the first phase. A structural matching algorithm is then proposed to find the structure similarity between two tree schemas. The proposed schema matching approach was evaluated by testing on several benchmarks of real schemas and comparing with other methods. The experimental results show that the proposed lexical decision tree substantially improves the linguistic similarity matching effectively and efficiently. The proposed SYM algorithm also performs high effectiveness on 1–1 schema matching.

Download Full-text