Defining and Evaluating Patient-Empowered Approaches to Improving Record Matching

2018 ◽  
Author(s):  
Robert Rudin ◽  
Richard Hillestad ◽  
M. Ridgely ◽  
Nabeel Qureshi ◽  
John Davis ◽  
...  
Keyword(s):  
Author(s):  
Arvind Arasu ◽  
Josep Domingo-Ferrer
Keyword(s):  

2018 ◽  
pp. 3129-3135
Author(s):  
Arvind Arasu ◽  
Josep Domingo-Ferrer
Keyword(s):  

Author(s):  
V. ALEKHYA ◽  
DS.BHUPAL NAIK

Record matching refers to the task of finding entries that refer to the same entity in two or more files, is a vital process in data integration. Most of the record matching methods are supervised, which requires the user to provide training data. These methods are not applicable for web database scenario, where query results dynamically generated on-the- fly. To address the problem of record matching in the Web database scenario, we present an unsupervised, online record matching method, UDD, which effectively identifies the duplicates from query result records of multiple web databases. First, same source duplicates are eliminated by using exact matching method the ―presumed‖ non duplicate records from the same source can be used as training examples . Starting from the non duplicate set, we use two cooperating classifiers a weight component similarity summing classifier and an SVM classifier, to iteratively identify duplicates in the query results from multiple Web databases.


Sign in / Sign up

Export Citation Format

Share Document