ENHANCED NEIGHBORHOOD NORMALIZED POINTWISE MUTUAL INFORMATION ALGORITHM FOR CONSTRAINT AWARE DATA CLUSTERING

Journal Title: ICTACT Journal on Soft Computing - Year 2016, Vol 6, Issue 4

Abstract

Clustering of similar data items is an important technique in mining useful patterns. To enhance the performance of Clustering, training or learning is an important task. A constraint learning semi-supervised methodology is proposed which incorporates SVM and Normalized Pointwise Mutual Information Computation Strategy to increase the relevance as well as the performance efficiency of clustering. The SVM Classifier is of Hard Margin Type to roughly classify the initial set. A recursive re-clustering approach is proposed for achieving higher degree of relevance in the final clustered set by incorporating ENNPI algorithm. An overall enriched F-Measure value of 94.09% is achieved as compared to existing algorithms.

Authors and Affiliations

Pushpa C N, Gerard Deepak, Mohammed Zakir, Thriveni J, Venugopal K R

Keywords

Related Articles

ENHANCED HYBRID PSO – ACO ALGORITHM FOR GRID SCHEDULING

Grid computing is a high performance computing environment to solve larger scale computational demands. Grid computing contains resource management, task scheduling, security problems, information management and so on. T...

ONTOLOGY EXTRACTION FOR AGRICULTURE DOMAIN IN MARATHI LANGUAGE USING NLP TECHNIQUES

Ontology is defined as shared specification of conceptual vocabulary used for formulating knowledge-level theories about a domain of discourse. Dataset is created by manually collecting information about different diseas...

A PARTIAL RATIO AND RATIO BASED FUZZY-WUZZY PROCEDURE FOR CHARACTERISTIC MINING OF MATHEMATICAL FORMULAS FROM DOCUMENTS

Retrieval of mathematical text from data is a key predicament in present circumstances. To achieve this, we have considered three different algorithms viz., Sequence matcher, Levenshtein Distance and Fuzzy-Wuzzy. Two dif...

A STATE OF THE ART SURVEY ON POLYMORPHIC MALWARE ANALYSIS AND DETECTION TECHNIQUES

Nowadays, systems are under serious security threats caused by malicious software, commonly known as malware. Such malwares are sophisticatedly created with advanced techniques that make them hard to analyse and detect,...

SARCASM DETECTION IN ONLINE REVIEW TEXT

Sarcasm is a type of sentiment where people express negative sentiment using positive connotation words in text and vice-versa. In this work, we propose a cross-domain sarcasm detection framework that allows acquisition,...

Download PDF file
  • EP ID EP199134
  • DOI 10.21917/ijsc.2016.0176
  • Views 85
  • Downloads 0

How To Cite

Pushpa C N, Gerard Deepak, Mohammed Zakir, Thriveni J, Venugopal K R (2016). ENHANCED NEIGHBORHOOD NORMALIZED POINTWISE MUTUAL INFORMATION ALGORITHM FOR CONSTRAINT AWARE DATA CLUSTERING. ICTACT Journal on Soft Computing, 6(4), 1287-1292. https://europub.co.uk/articles/-A-199134