Clustering Tree based Implementation of Record Linkage on Many-to-Many Relation

Journal Title: UNKNOWN - Year 2015, Vol 4, Issue 3

Abstract

Record linkage or entity resolution are emerging strategy to avoid duplication and other purposes. Recommender domain uses the linkage method to provide efficient results in terms of accuracy. This paper introduces a new Many-to-Many Record Linkage (MMRL) algorithm which links records from one table with a set of records from another table. MMRL algorithm is based on clustering tree which forms the group on each table separately that to be linked. Hierarchical structure such as tree is suitable to understand and execute the linkage process. Intermediate nodes are having less similarity value than end nodes. Each node of the clustering tree contains a cluster instead of a single classification. Prediction accuracy depends on the end node. Jaccard similarity and metaphone similarity are used as distance measures. Prediction result shows whether the records are matched or not. This result proves the efficiency of MMRL algorithm. A data set from movie recommender domain was evaluated for this paper. This MMRL algorithm gives better performance and results.

Authors and Affiliations

Keywords

Related Articles

An Efficient Network Intrusion Based on Decision Tree Classifier & K-Mean Clustering using Dimensionality Reduction

Abstract: As the internet size grows rapidly so that the attacks on network. There is a need of intrusion detection system (IDS) but large and increasing size of network creates huge computational values which can be a p...

Mitigation of Global Warming Through Biological Carbon Sequestration Using Micro Algae

The world has been threatened due to the ongoing global warming and climate change due to the bulk discharge of anthropogenic green house gases into the atmosphere through the combustion of fossil fuels, automobile exhau...

Developing Firefox add-on for DOM vulnerability Assessment

Cross-site scripting (XSS) is a type of security vulnerability typically found in web applications which allows the attackers to inject malicious script into web pages/servers. XSS is the main cause of DOM attack. This a...

Development and Supplementation of Fibre Enriched Formulated Supplementary Mixture on Type 2 Diabetes Mellitus

"Abstract Diabetes mellitus (DM) is a metabolic disorder resulting from a defect in insulin secretion, insulin action, or both. Insulin deficiency in turn leads to chronic hyperglycaemia with disturbances of carbohydrate...

Factors Influencing Adoption of Woodfuel Energy Saving Technologies in Nakuru County, Kenya

There have been efforts to promote use of woodfuel conservation technologies. These technologies include the improved charcoal stoves, the improved fuelwood stoves and the fireless cookers that can save woodfuel of upto...

Download PDF file
  • EP ID EP357676
  • DOI -
  • Views 126
  • Downloads 0

How To Cite

(2015). Clustering Tree based Implementation of Record Linkage on Many-to-Many Relation. UNKNOWN, 4(3), -. https://europub.co.uk/articles/-A-357676