An Efficient Approach towards Duplicate Detection System

Abstract

Information on the web is very huge in size and the tasks of search engines have become more and more complex as a single entity on the web have two or more representations in databases. The duplicate detection is the process of identifying the entities who has multiple representation of the same real world entity, as the duplicate detection methods has to process large datasets, the identification of duplicate document in a large database is a issue significantly with wide-spread applications. In this paper a review on various approaches of duplicate detection will be presented. Proposed system will compare two Duplication detection methods, the first is based on two novel progressive duplicate detection algorithms that significantly increase the efficiency of finding duplicates if the execution time is limited. The second is based on Secure Hashing Algorithm which will detect and delete duplicate data, the secure hash algorithm will perform data de-duplication task in order to overcome the issues of time and to reduce hash collision.

Authors and Affiliations

Miss. Ruchira Dhananjay Deshpande, Sonali Bodkhe

Keywords

Related Articles

Pitch Controlled PMSG Based Wind Energy Conversion System with Control of DC Link Voltage and Load Voltage Variations

In this paper, a novel algorithm, based on dc link voltage, is proposed for a standalone permanent magnet synchronous generator (PMSG)-based variable speed wind energy conversion system. Moreover, by maintaining the dc...

Captcha as Graphical Password for E-Commerce

In E-commerce based web portals main issue is security. To rectify security issues we propose a new technique called captcha as a graphical password (CaRP). Graphical password address the security issues like online gue...

Parametric Study on Tubed Steel Reinforced Concrete Columns under Axial Loading

The decades have seen outstanding advances in the use of composite steel-concrete structural systems in the construction of buildings. Concrete-steel composite structure is defined as construction in which both steel an...

Automated Load Shedding and Notification to the Consumer Using GSM (Smart Power Grid)

Our aim of this project is to automate the load shedding and notifying the customer about the load shedding using GSM, according to the timings which is set in the PC. And also for reading electrical energy consumed by...

Application of Gravitational Search Algorithm for Planning Distribution Networks with Multiple Dg Units Based On Their Real and Reactive Power Delivering Capability

One of the major developments in the Current Distribution Network is the integration of Distributed Generation (DG). Planning the distribution systems without considering the site and size of multiple DG units could res...

Download PDF file
  • EP ID EP23038
  • DOI -
  • Views 263
  • Downloads 4

How To Cite

Miss. Ruchira Dhananjay Deshpande, Sonali Bodkhe (2017). An Efficient Approach towards Duplicate Detection System. International Journal for Research in Applied Science and Engineering Technology (IJRASET), 5(1), -. https://europub.co.uk/articles/-A-23038