Semantic Similarity Search Model for Obfuscated Plagiarism Detection in Marathi Language using Fuzzy and Naïve Bayes Approaches
Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2016, Vol 18, Issue 3
Abstract
Abstract: Plagiarism detection (PD) in natural language texts is an example of NLP applications that is linked with information retrieval (IR) and soft computing (SC) approaches. Obfuscated plagiarism cases contain invisible texts, which is difficult to find in existing plagiarism detection methods. In this paper fuzzy semanticbased similarity search model and Naïve Bayes model for uncovering obfuscated plagiarism for English and Marathi language are presented and compared with different state-of-the-art baselines (B1-W1G, B2- W3G, B3- W5G, B4-S2S). The fuzzy model identification is based on ‘If-then’ fuzzy rules. Semantic relatedness between words is studied based on the part-of-speech (POS) tags and WordNet-based similarity measures. Naïve Bayes classifier is used to achieve better detection performance. Results are assessed using precision, recall, Fmeasure and granularity for Fuzzy and Naïve approaches and it is observed that Naïve Bayes model gives more appropriate result than fuzzy semantic based model.
Authors and Affiliations
Ms. Nilam Shenoy , Mrs. M. A. Potey
The Application of Model Predictive Control (MPC) to Fast Systems such as Autonomous Ground Vehicles (AGV)
Abstract: This paper investigates the application of Model Predictive Control (MPC) to fast systems such as Autonomous Ground Vehicles (AGV) or mobile robots. The control of Autonomous ground vehicles (AGV) is chal...
Twin Key Implementation in Aes
Abstract: In February 2001, NIST announced that a draft of the Federal Information Processing Standard (FIPS) was available for public review and comment. Finally, AES was published as FIPS 197 in the Federal Regis...
A Survey Report on: Become Prudent with Big Data - Technological sophistication in India
In the world of globalization at 360 degree, a heavy digitalized rainy season has raised & the rain drops of digital data is falling from the digitalized sky through lots of clouds of E-Commerce, Mobile-Commerce , So...
The Design and Implementation of On-Line Examination UsingFirewall security
Abstract: Online Examination System is a software solution, which allows a company or institute to arrange,conduct and manage examinations via an online environment. This can be done through the Internet, Intraneta...
Gender classification using face image and voice
Abstract: This paper is about gender classification using face image and voice of a speaker. The basic aim of the paper is to predict the gender of speaker through voice sample using Auto-correlation method and predict t...