Accuracy Based Feature Ranking Metric for Multi-Label Text Classification

Abstract

In many application domains, such as machine learning, scene and video classification, data mining, medical diagnosis and machine vision, instances belong to more than one categories. Feature selection in single label text classification is used to reduce the dimensionality of datasets by filtering out irrelevant and redundant features. The process of dimensionality reduction in multi-label classification is a different scenario because here features may belong to more then one classes. Label and instance space is rapidly increasing by the grandiose of Internet, which is challenging for Multi-Label Classification (MLC). Feature selection is crucial for reduction of data in MLC. Method adaptation and data set transformation are two techniques used to select features in multi label text classification. In this paper, we present dataset transformation technique to reduce the dimensionality of multi-label text data. We used two model transformation approaches: Binary Relevance, and Label Power set for transformation of data from multi-label to single label. The Process of feature selection is done using filter approach which utilizes the data to decide the importance of features without applying learning algorithm. In this paper we used a simple measure (ACC2) for feature selection in multi-label text data. We used problem transformation approach to apply single label feature selection measures on multi-label text data; did the comparison of ACC2 with two other feature selection methods, information gain (IG) and Relief measure. Experimentation is done on three bench mark datasets and their empirical evaluation results are shown. ACC2 is found to perform better than IG and Relief in 80% cases of our experiments.

Authors and Affiliations

Muhammad Nabeel Asim, Abdur Rehman, Umar Shoaib

Keywords

Related Articles

Comparative Performance Analysis of Efficient MIMO Detection Approaches

The promising massive level MIMO (multiple-input-multiple-output) systems based on extremely huge antenna collections have turned into a sizzling theme of wireless com-munication systems. This paper assesses the performa...

ReCSDN: Resilient Controller for Software Defined Networks

Software Defined Networking (SDN) is an emerging network paradigm that provides central control over the network. Although, this simplifies the network management and makes efficient use of network resources, it introduc...

Development of Talent Model based on Publication Performance using Apriori Technique

The main problem or challenge faced by Human Resource Management (HRM) is to recognize, develop and manage talent efficiently and effectively. This is because HRM is responsible for selecting the correct talent for suita...

Investigating Students’ Acceptance of Online Courses at Al-Ahliyya Amman University

Online courses allow students to access the course materials anytime and anywhere. Those courses are meant to enhance and improve the learning processes. Unfortunately, by analyzing data of an online course in Al-Ahliyya...

13: 32 x 10 and 64 × 10 Gb/s transmission using hybrid Raman-Erbium doped optical amplifiers

We have successfully demonstrated a long-haul transmission of 32 × 10 Gbit/s and 64 × 10 Gbit/s over single-mode fiber of 650 km and 530 km respectively by using RAMAN-EDFA hybrid optical amplifier as inline and preampli...

Download PDF file
  • EP ID EP262292
  • DOI 10.14569/IJACSA.2017.081048
  • Views 67
  • Downloads 0

How To Cite

Muhammad Nabeel Asim, Abdur Rehman, Umar Shoaib (2017). Accuracy Based Feature Ranking Metric for Multi-Label Text Classification. International Journal of Advanced Computer Science & Applications, 8(10), 369-378. https://europub.co.uk/articles/-A-262292