Accuracy Based Feature Ranking Metric for Multi-Label Text Classification
Journal Title: International Journal of Advanced Computer Science & Applications - Year 2017, Vol 8, Issue 10
Abstract
In many application domains, such as machine learning, scene and video classification, data mining, medical diagnosis and machine vision, instances belong to more than one categories. Feature selection in single label text classification is used to reduce the dimensionality of datasets by filtering out irrelevant and redundant features. The process of dimensionality reduction in multi-label classification is a different scenario because here features may belong to more then one classes. Label and instance space is rapidly increasing by the grandiose of Internet, which is challenging for Multi-Label Classification (MLC). Feature selection is crucial for reduction of data in MLC. Method adaptation and data set transformation are two techniques used to select features in multi label text classification. In this paper, we present dataset transformation technique to reduce the dimensionality of multi-label text data. We used two model transformation approaches: Binary Relevance, and Label Power set for transformation of data from multi-label to single label. The Process of feature selection is done using filter approach which utilizes the data to decide the importance of features without applying learning algorithm. In this paper we used a simple measure (ACC2) for feature selection in multi-label text data. We used problem transformation approach to apply single label feature selection measures on multi-label text data; did the comparison of ACC2 with two other feature selection methods, information gain (IG) and Relief measure. Experimentation is done on three bench mark datasets and their empirical evaluation results are shown. ACC2 is found to perform better than IG and Relief in 80% cases of our experiments.
Authors and Affiliations
Muhammad Nabeel Asim, Abdur Rehman, Umar Shoaib
Scheduling on Heterogeneous Multi-core Processors Using Stable Matching Algorithm
Heterogeneous Multi-core Processors (HMP) are better to schedule jobs as compare to homogenous multi-core processors. There are two main factors associated while analyzing both architectures i.e. performance and power co...
Cross Site Scripting: Detection Approaches in Web Application
Web applications have become one of the standard platforms for service releases and representing information and data over the World Wide Web. Thus, security vulnerabilities headed to various type of attacks in web appli...
MapReduce Performance in MongoDB Sharded Collections
In the modern era of computing and countless of online services that gather and serve huge data around the world, processing and analyzing Big Data has rapidly developed into an area of its own. In this paper, we focus o...
Formal Specification and Analysis of Termination Detection by Weight-throwing Protocol
Termination detection is a critical problem in distributed systems. A distributed computation is called terminated if all of its processes become idle and there are no in-transit messages in communication channels. A dis...
Energy Efficient Algorithm for Wireless Sensor Network using Fuzzy C-Means Clustering
Energy efficiency is a vital issue in wireless sensor networks. In this paper, an energy efficient routing algorithm has been proposed with an aim to enhance lifetime of network. In this paper, Fuzzy C-Means clustering h...