A Scalable Approach To Detect A Duplicate Data Using PSNM And PB Algorithm

Abstract

Now a day if we consider a data set we can find more duplicate data. Determining the redundant data in the data server is an open research in the data intensive application. The traditional method detects the duplicate for large dataset takes a large amount of time to produce the result. I proposed an algorithm (PSNM and PB) such that they maximize the gain of the overall process within the time available by reporting most results much earlier than traditional approaches. The contribution of the work gets improved by implementing both the algorithms in parallel process to effectively compute the duplication record in efficient time. The algorithm dynamically adjusts their behavior by automatically choosing optimal parameters, e.g., window sizes, block sizes, and sorting keys. The Experimental results prove that proposed system outperforms the state of arts approaches accuracy and efficiency.

Authors and Affiliations

P. Padmavathi, Mr. S. Dhanasekaran, Mr. A. Arockia Selvaraj

Keywords

Related Articles

Nonlinear Pushover Analysis for Performance Based Engineering Design – A Review

Engineering Structures are designed to withstand all the anticipated loads without failure. Even after utmost care in assessing the expected loads as per the Codal provisions and field experience, modeling random loads...

Performance of Fiber Reinforced Concrete from Recycled Pet Plastic Waste- A Study Review

Concrete is a composite material consisting of various ingredients such as cement, coarse aggregate, fine aggregate and has done wonders in the construction industry. The recent use of the concrete has constrained many...

RSVP Protocol Used in Real Time Application Networks

RSVP is a receiver oriented reservation protocol being an Internet standard approved by Internet Engineering Task Force [IETF].The goal of the Resource Reservation Protocol (RSVP) is to establish Quality of Service info...

This Novel Realize New Electronic Capsule

This work will speak to the confront to smooth the progress of the development of a high capacity radio system for a small, miniaturized electronic pill device that can be saleable or implantable in human body in order...

slugLiterature survey of network reconstruction, reconfiguration & QOS optimization approach in case of link failure in existing SSA protocol in Mobile Ad-Hoc Network

In this research paper the work done by the earlier researchers is discussed, literature related to the work is collected to find the direction of research work. To start the work author has studied more than 22 researc...

Download PDF file
  • EP ID EP22138
  • DOI -
  • Views 214
  • Downloads 3

How To Cite

P. Padmavathi, Mr. S. Dhanasekaran, Mr. A. Arockia Selvaraj (2016). A Scalable Approach To Detect A Duplicate Data Using PSNM And PB Algorithm. International Journal for Research in Applied Science and Engineering Technology (IJRASET), 4(5), -. https://europub.co.uk/articles/-A-22138