Detection of Similar Identities in XML Documents

Abstract

Duplicate detection is an important part of data cleaning; it is the process of detecting multiple representations of a same real-world object in the data sources. Numbers of solutions are available for detecting duplicates in XML data. One of the novel methods for XML duplicate detection is XMLDup. XMLDup makes use of a Bayesian network to evaluate the probability of two XML elements are duplicates. In addition a network pruning strategy is also used for improving the evaluation of the Bayesian network. A DOM tree construction algorithm for constructing the tree of the input XML data is proposed. It is seen that by using DOM tree construction algorithm higher efficiency is achieved for detection of similar identities in XML Documents.

Authors and Affiliations

Miss Amita Fulsundar, Dr. K. V. Metre

Keywords

Related Articles

Hand Gesture Detection Using Segmentation

Hand gesture detection is a project which recognizes the gesture of hands and detect accordingly. Hand Gesture recognition is an important technique for creating user-friendly interfaces. Hand gesture is recognized by r...

A Comprehensive Review of Various Security Features

In recent decades the subject of "handwritten authentication verification" has been investigated extensively, although there is still an open research issue. People are familiar with stylus and paperwork for legal transa...

Hybrid Active Power Filter for Power Quality Improvement

A Deadbeat current controller for an LC-coupling hybrid active power filter is proposed, which can track with the reference compensation current with low steady- state error and fast dynamic response. Moreover, it can le...

In Tharaka South, Eastern Kenya, Socio-Economic Variables Influence the Use of Rainwater Collecting and Conservation Methods

Rainwater harvesting and conserving technologies are essential interventions for water supply and food production in Kenya's dry and semi-arid regions due to low soil moisture levels. Despite extensive study on the subje...

Content-Based Movie Recommendation System: An Enhanced Approach to Personalized Movie Recommendations

With the exponential growth of digital media platforms and the vast amount of available movie content, users are often overwhelmed when selecting movies that match their preferences. Recommender systems have emerged as a...

Download PDF file
  • EP ID EP748811
  • DOI -
  • Views 55
  • Downloads 0

How To Cite

Miss Amita Fulsundar, Dr. K. V. Metre (2015). Detection of Similar Identities in XML Documents. International Journal of Innovative Research in Computer Science and Technology, 3(3), -. https://europub.co.uk/articles/-A-748811