Detection of Similar Identities in XML Documents

Abstract

Duplicate detection is an important part of data cleaning; it is the process of detecting multiple representations of a same real-world object in the data sources. Numbers of solutions are available for detecting duplicates in XML data. One of the novel methods for XML duplicate detection is XMLDup. XMLDup makes use of a Bayesian network to evaluate the probability of two XML elements are duplicates. In addition a network pruning strategy is also used for improving the evaluation of the Bayesian network. A DOM tree construction algorithm for constructing the tree of the input XML data is proposed. It is seen that by using DOM tree construction algorithm higher efficiency is achieved for detection of similar identities in XML Documents.

Authors and Affiliations

Miss Amita Fulsundar, Dr. K. V. Metre

Keywords

Related Articles

Unlocking Athletic Potential The Athle-E-Team Software Solution

The "Athle-E-Team" project is an innovative collaborative sports platform designed to revolutionize the way athletes connect and engage in sports within their local communities. With a focus on enhancing the overall spor...

A Comparative Analysis of Computer Literacy in Rural And Urban Schools of Pune Region

Information and communication technology (ICT) has become modernized mediator in all aspects of life. Now a day’s Information and Technology (ICT) plays an important role to improve the quality of education. In This Rese...

A Design of Hybrid Energy Storage System for Electric Vehicles

Recently, Electronic Vehicles (EVs) have been attracted substantial responsiveness and so did the advance in battery equipment. Although the battery technology has been significantly advanced, the available batteries do...

A Study of Goodness –of- Fit Tests for Some Discrete Probability Distribution

This paper presents the goodness of fit (GOF) tests for several discrete distributions viz., Poisson, Generalized Poisson and Negative binomial distribution. Parameter estimation is performed and goodness of fit test for...

Detection of Human Behaviour by Object Recognition using Deep Learning: A Review

The major drawback of the society is the falsehood of human nature; so prediction of nature of the individual by analysing the video or image of that person is highly necessary. From the post World War II period due to t...

Download PDF file
  • EP ID EP748811
  • DOI -
  • Views 25
  • Downloads 0

How To Cite

Miss Amita Fulsundar, Dr. K. V. Metre (2015). Detection of Similar Identities in XML Documents. International Journal of Innovative Research in Computer Science and Technology, 3(3), -. https://europub.co.uk/articles/-A-748811