Computational Intelligence Methods for Clustering of SenseTagged Nepali Documents

Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2015, Vol 17, Issue 1

Abstract

 Abstract: This paper presents a method using hybridization of self organizing map (SOM ), particle swarmoptimization(PSO) and k-means clustering algorithm for document clustering. Document representation is animportant step for clustering purposes. The common way of represent a text is bag of words approach. Thisapproach is simple but has two drawbacks viz. synonymy and polysemy which arise because of the ambiguity ofthe words and the lack of information about the relations between the words. To avoid the drawbacks of bag ofwords approach words are tagged with senses in WordNet in this paper. Sense tagging of words provide exactsenses of words. Feature vectors are generated using sense tagged documents and clustering is carried outusing proposed hybrid SOM+PSO+K-means algorithm. In the proposed algorithm initially SOM is applied tothe feature vectors to produce the prototypes and then K-means clustering algorithm is applied to cluster theprototypes. Particle Swarm Optimization algorithm is used to find the initial centroid for K-means algorithm.Text documents in Nepali language are used to test the hybrid SOM+PSO+K-means clustering algorithm.

Authors and Affiliations

Sunita Sarkar , Arindam Roy , Bipul Syam Purkayastha

Keywords

Related Articles

 Green Computing and Energy Consumption Issues in the Modern  Age

 Green computing concept is to improve environmental condition. The main aim of green computing is to reduce  toxic materials. We systematically analyze its energy consumption which is based on types of &nbs...

 An Overview of Intrusion Detection and Prevention Systems  (IDPS) and Security Issues

 Technical solutions, introduced by policies and implantations are essential requirements of an information security program. Advanced technologies such as intrusion detection and prevention system (IDPS)  an...

Management Support System (Mss) In Higher Educational Institutions (Heis): University Of Jos, Nigeria. Ict Support

Abstract: Management support system is of great importance in the institutions of higher learning, as an interactive computer-base system that support cooperate memory or knowledge base that constitutes the problem-solvi...

Home Automation Using Mobile Communication

Abstract: In the present paper we have designed and implemented a typical and simple Home Automation System using GSM (Global System for Mobile Communication) technology that controls electrical devices at home from a re...

 Implementation of Web Based Manufacturing Process Planning Workflow in a Pressure Die Casting Industry

 Abstract : Today’s manufacturing industry requires product data management system (PDM) for efficient product development and production. Manufacturing industries must deal with competition, time to market, reducti...

Download PDF file
  • EP ID EP127150
  • DOI -
  • Views 102
  • Downloads 0

How To Cite

Sunita Sarkar, Arindam Roy, Bipul Syam Purkayastha (2015).  Computational Intelligence Methods for Clustering of SenseTagged Nepali Documents. IOSR Journals (IOSR Journal of Computer Engineering), 17(1), 83-89. https://europub.co.uk/articles/-A-127150