Decentralized Probabilistic Text Clustering by using Distributed Hierarchical peer to peer Clustering

Abstract

Document clustering (or text clustering) is the application of cluster analysis to textual documents. It has applications in automatic document organization, topic extraction and fast information retrieval or filtering. For text clustering we are using decentralized probabilistic text clustering Algorithm to mine the data, it is an traditional centralized approach by using this approach analyzing massive distributed data. But it is extremely difficult to draw conclusions based on the collective characteristics of disparate data. The goal is to achieve modularity and scalability. Decentralized probabilistic text clustering Algorithm is less scalable in distributed clustering. Distributed Hierarchically Peer-to-Peer Clustering (DHP2PC) Algorithm is scalable and efficient algorithm. In this a subset of the document collection is centrally partitioned into clusters, for which cluster signatures are created. The DHP2PC algorithm finds its roots in a parallel implementation. By using cluster signatures we can mine the Massive distributed data. The algorithm offers probabilistic guarantees for the correctness of each document assignment to a cluster.

Authors and Affiliations

Attuluri Uday Kiran, Rakesh Nayak

Keywords

Related Articles

Design of Built-in Self-Repair Strategy with Selectable Redundancy for Embedded SRAM

The main strategy of this project is to design a fault diagnoses system for detection and repair of any permanent failures or faults in the embedded read only memories. Built-In Self-Repair (BISR) with redundancy is...

Client-Merchant Online Payment System Exploiting Visual Cryptography

This paper exhibits another methodology for giving restricted data just that is important for asset exchange amid web shopping along these lines defending client information and expanding client certainty and avoidin...

HDRI (High Dynamic Range Image) Acquisition By Multiple Exposure Fusion

Innovative technologies in image capturing and image processing enable photographs with multiple exposures to be fused into high dynamic range images. Many such technologies came into existence. However, occasionally...

Modeling of Single-Phase Semi-Z-source Inverter

This paper presents several single-phase non-isolated semi-Z-source inverters for small distributed power generator in grid-connected applications with low cost and doubly grounded features. These semi-Z-source invert...

Android Application on Examination Using Speech Technology for Blind People

A number of developing countries continue to provide educational services to students with disabilities in "segregated" schools. Trends in provisions in India reflect that the leading policy predisposition before the...

Download PDF file
  • EP ID EP28112
  • DOI -
  • Views 237
  • Downloads 0

How To Cite

Attuluri Uday Kiran, Rakesh Nayak (2014). Decentralized Probabilistic Text Clustering by using Distributed Hierarchical peer to peer Clustering. International Journal of Research in Computer and Communication Technology, 3(11), -. https://europub.co.uk/articles/-A-28112