Smart Cloud Document Clustering and plagiarism checker using TF-IDF Based on Cosine Similarity

Journal Title: GRD Journal for Engineering - Year 2017, Vol 2, Issue 5

Abstract

This research paper describes the results oriented from experimental study of conventional document clustering techniques implemented in the commercial spaces so far. Particularly, we compared main approaches related to document clustering, agglomerative hierarchical document clustering and K-means. Though this paper, we generates and implement checker’s algorithms which deals with the duplicacy of the document content with the rest of the documents in the cloud. We also generate algorithm required to deals with the classification of the cloud data. The classification in this algorithm is done on the basis of the date of data uploaded and. We will take the ratio of both vectors and generate a score which rates the document in the classification.

Authors and Affiliations

Sudhir Sahani, Rajat Goyal, Saurabh Sharma, Shaili Gupta

Keywords

Related Articles

Crowd Funding using Blockchain

Crowd funding is an online money-raising strategy that began as a way for the public to donate small amounts of money to help creative people finance their projects. Through crowdfunding, individuals are able to invest i...

Blade Design, Analysis and Utilization of Vertical Axis Windmill using ANSYS Software for Streetlights

The objective of this project is to generate electric power through the fabrication of savonius wind mill and to regulate the power generated in order to use that power for automatic street light using LDR (Light Depende...

Comparative Study on IS 456:2000 and Eurocode 2: EN 1992-1-1 for Analysis and Design of R.C.C. Beam

The reinforced concrete structures must be analyzed and designed according to the provisions of relative design standards. Design codes are the documents which are established for the design of a respective structure. Mo...

An Advanced Two Level Double Dual Boost Converter

This proposed work has two converters connected in cascade to have output voltage 4 times of input voltage. This converter is a non-isolated boost converter, which can level up Dc voltage from 24 Vdc input voltage to 120...

Implementation of a General Purpose Sorter on FPGA

The objective of the paper is to implement a general purpose sorting algorithm. The paper should offer a sorting network that can be deployed in various applications in impulsive noise reduction filters for image process...

Download PDF file
  • EP ID EP224420
  • DOI -
  • Views 86
  • Downloads 0

How To Cite

Sudhir Sahani, Rajat Goyal, Saurabh Sharma, Shaili Gupta (2017). Smart Cloud Document Clustering and plagiarism checker using TF-IDF Based on Cosine Similarity. GRD Journal for Engineering, 2(5), 331-333. https://europub.co.uk/articles/-A-224420