An Efficient Approach towards Duplicate Detection System
Journal Title: International Journal for Research in Applied Science and Engineering Technology (IJRASET) - Year 2017, Vol 5, Issue 1
Abstract
Information on the web is very huge in size and the tasks of search engines have become more and more complex as a single entity on the web have two or more representations in databases. The duplicate detection is the process of identifying the entities who has multiple representation of the same real world entity, as the duplicate detection methods has to process large datasets, the identification of duplicate document in a large database is a issue significantly with wide-spread applications. In this paper a review on various approaches of duplicate detection will be presented. Proposed system will compare two Duplication detection methods, the first is based on two novel progressive duplicate detection algorithms that significantly increase the efficiency of finding duplicates if the execution time is limited. The second is based on Secure Hashing Algorithm which will detect and delete duplicate data, the secure hash algorithm will perform data de-duplication task in order to overcome the issues of time and to reduce hash collision.
Authors and Affiliations
Miss. Ruchira Dhananjay Deshpande, Sonali Bodkhe
Pitch Controlled PMSG Based Wind Energy Conversion System with Control of DC Link Voltage and Load Voltage Variations
In this paper, a novel algorithm, based on dc link voltage, is proposed for a standalone permanent magnet synchronous generator (PMSG)-based variable speed wind energy conversion system. Moreover, by maintaining the dc...
Captcha as Graphical Password for E-Commerce
In E-commerce based web portals main issue is security. To rectify security issues we propose a new technique called captcha as a graphical password (CaRP). Graphical password address the security issues like online gue...
Parametric Study on Tubed Steel Reinforced Concrete Columns under Axial Loading
The decades have seen outstanding advances in the use of composite steel-concrete structural systems in the construction of buildings. Concrete-steel composite structure is defined as construction in which both steel an...
Automated Load Shedding and Notification to the Consumer Using GSM (Smart Power Grid)
Our aim of this project is to automate the load shedding and notifying the customer about the load shedding using GSM, according to the timings which is set in the PC. And also for reading electrical energy consumed by...
Application of Gravitational Search Algorithm for Planning Distribution Networks with Multiple Dg Units Based On Their Real and Reactive Power Delivering Capability
One of the major developments in the Current Distribution Network is the integration of Distributed Generation (DG). Planning the distribution systems without considering the site and size of multiple DG units could res...