An Efficient Approach towards Duplicate Detection System

Abstract

Information on the web is very huge in size and the tasks of search engines have become more and more complex as a single entity on the web have two or more representations in databases. The duplicate detection is the process of identifying the entities who has multiple representation of the same real world entity, as the duplicate detection methods has to process large datasets, the identification of duplicate document in a large database is a issue significantly with wide-spread applications. In this paper a review on various approaches of duplicate detection will be presented. Proposed system will compare two Duplication detection methods, the first is based on two novel progressive duplicate detection algorithms that significantly increase the efficiency of finding duplicates if the execution time is limited. The second is based on Secure Hashing Algorithm which will detect and delete duplicate data, the secure hash algorithm will perform data de-duplication task in order to overcome the issues of time and to reduce hash collision.

Authors and Affiliations

Miss. Ruchira Dhananjay Deshpande, Sonali Bodkhe

Keywords

Related Articles

Bidirectional DC-DC Converter with MPPT Controller Using PV Array

This paper proposes simulation of Bidirectional DC - DC converter with Maximum Power Point Tracking using Photo Voltaic cell with motor load in boost and buck mode respectively. The proposed two level converters have th...

Review on Experimental Analysis of Heat Transfer in Shell and Twisted Tube Heat Exchanger

In recent years, the high cost of energy and material has resulted in an increased effort aimed at producing more efficient heat exchange equipment. The heat transfer rate can be improved by introducing a disturbance in...

Carbon Nanotubes and Its Applications: A Review

Carbon nanotubes also known as CNTs are allotropes of carbon with a tubular nanostructure, having diameters ranging from less than 1 nanometre (nm) up to 50 nm. CNTs have extraordinary electrical, mechanical, optical, t...

Dynamic Analysis of Multistorey Building using Response Spectrum Method and Seismic Coefficient Method – A Comparison

Earthquakes are very disastrous and cause a great harm to living life, material life and buildings. Hence proper dynamic analysis for building having earthquake threat is needed. This will ensure proper designs resultin...

Design and Fabrication of Attachments for Square Hole Drill

the mechanical design and of a square hole producing tool based on reuleaux triangle. The main aim of our paper is to investigate how the circular motion can be converted into a square motion by purely a mechanical link...

Download PDF file
  • EP ID EP23038
  • DOI -
  • Views 272
  • Downloads 4

How To Cite

Miss. Ruchira Dhananjay Deshpande, Sonali Bodkhe (2017). An Efficient Approach towards Duplicate Detection System. International Journal for Research in Applied Science and Engineering Technology (IJRASET), 5(1), -. https://europub.co.uk/articles/-A-23038