Detection of Similar Identities in XML Documents

Abstract

Duplicate detection is an important part of data cleaning; it is the process of detecting multiple representations of a same real-world object in the data sources. Numbers of solutions are available for detecting duplicates in XML data. One of the novel methods for XML duplicate detection is XMLDup. XMLDup makes use of a Bayesian network to evaluate the probability of two XML elements are duplicates. In addition a network pruning strategy is also used for improving the evaluation of the Bayesian network. A DOM tree construction algorithm for constructing the tree of the input XML data is proposed. It is seen that by using DOM tree construction algorithm higher efficiency is achieved for detection of similar identities in XML Documents.

Authors and Affiliations

Miss Amita Fulsundar, Dr. K. V. Metre

Keywords

Related Articles

A Comparative Analysis of CNN, RCNN & Faster RCNN Object Detection Algorithm for CAPTCHA Breaking

CAPTCHA (Completely Automated Public Turing test to tell Computers and Humans Apart) systems serve as a crucial defense mechanism against automated attacks by distinguishing between human users and bots. However, advance...

Identification of the Barriers of Lean Construction Implementation in Construction Projects- A Review

In the current era of industrial and enterprise evolution, time and resource are the fundamental need of any industry. Utilizing the time and resource has always been the primary goal of especially a construction project...

A Study of Several Water Purification Techniques

Current article provides an overview of current water purification, filtration methods, & technologies. water purification is primarily addressed for a sensitive reason: it is one of most important sources of survival f...

Security In Inter Cloud Data Transfer

Cloud computing has quickly become one of the most Networking software in the IT world due to its revolutionary model of computing as a utility. It promises increased flexibility, scalability, and reliability, while prom...

Smart Grid Application Useing Iot

Smart cities are a natural extension of the sensible grid concept, and their implementation is inextricably linked to legacy power system transformation. Clients can utilize brilliant matrix innovations to plan loads at...

Download PDF file
  • EP ID EP748811
  • DOI -
  • Views 30
  • Downloads 0

How To Cite

Miss Amita Fulsundar, Dr. K. V. Metre (2015). Detection of Similar Identities in XML Documents. International Journal of Innovative Research in Computer Science and Technology, 3(3), -. https://europub.co.uk/articles/-A-748811