Decentralized Probabilistic Text Clustering by using Distributed Hierarchical peer to peer Clustering

Abstract

Document clustering (or text clustering) is the application of cluster analysis to textual documents. It has applications in automatic document organization, topic extraction and fast information retrieval or filtering. For text clustering we are using decentralized probabilistic text clustering Algorithm to mine the data, it is an traditional centralized approach by using this approach analyzing massive distributed data. But it is extremely difficult to draw conclusions based on the collective characteristics of disparate data. The goal is to achieve modularity and scalability. Decentralized probabilistic text clustering Algorithm is less scalable in distributed clustering. Distributed Hierarchically Peer-to-Peer Clustering (DHP2PC) Algorithm is scalable and efficient algorithm. In this a subset of the document collection is centrally partitioned into clusters, for which cluster signatures are created. The DHP2PC algorithm finds its roots in a parallel implementation. By using cluster signatures we can mine the Massive distributed data. The algorithm offers probabilistic guarantees for the correctness of each document assignment to a cluster.

Authors and Affiliations

Attuluri Uday Kiran, Rakesh Nayak

Keywords

Related Articles

Wavelet Quality Assessment For Mapping Images

To improve the quality of the images which are adaptively fused by various techniques and now representing wavelet fusion scheme to increase the naturalness of the image by using the parameters like SSIM or structura...

Efficient Anonymity Profile Comparison Techniques To Filter Unwanted Messages From OSN User Walls

Social networking creates digital communication technologieshone tools for make bigger the social ring of people.It has by nowturned intoasignificant integral part of our dailylives;allow us to contact our friends an...

Cloud Computing Security With Two-Level Mechanism

The recent success of cloud services and its ever increasing popularity are not unknown to anyone and neither are the security threats which persist in parallel with its growth. Countermeasures have been introduced t...

Development of Antirigging Voting System Using Finger Print

Now a days voting process is exercised by using EVM(Electronic voting machine). In this paper we present and use implementation is to implement the development of anti rigging voting system using finger print .The pu...

A Location Based Query solution Enables A User to Privately Determine And Acquire Location Data

Location based service (LBS) is an information, activity and efficacy service usually reachable by mobile devices such as, mobile phones, GPS devices, pocket PCs, and functioning in the course of a mobile network. LB...

Download PDF file
  • EP ID EP28112
  • DOI -
  • Views 255
  • Downloads 0

How To Cite

Attuluri Uday Kiran, Rakesh Nayak (2014). Decentralized Probabilistic Text Clustering by using Distributed Hierarchical peer to peer Clustering. International Journal of Research in Computer and Communication Technology, 3(11), -. https://europub.co.uk/articles/-A-28112