An Investigation on Topic Maps Based Document Classification with Unbalance Classes

Journal Title: Journal of Independent Studies and Research - Computing - Year 2015, Vol 13, Issue 1

Abstract

Classification of imbalanced data has become a widespread problem due to the fact that the most real world datasets are imbalanced. In a classification task, one of the challenges is to learn the feature-space of classification under class-imbalance setting. The majority classes generally have good representation of features in the learned classification function and the minority classes lack this representation; subsequently, the classification for these classes failed more often. In this paper, authors investigate the task of document classification with topic map based representation of documents under class imbalance setting. In order to measure of topic-map based representation for classification under imbalance data, authors compare three representations: Bag-ofWords, Phrases and Topic terms for three approaches (i) under-sampling, (ii) cost-adjusting, and (iii) cluster based sampling. A series of experiments are carried out and results are reported.

Authors and Affiliations

Keywords

Related Articles

Extracting a Graph Model by Mapping Two Heterogeneous Graphs

With the development of wireless communications, several studies have been performed on Location based Services due to their numerous applications. Amongst those recommendations, Travel Planning and Recommendations are f...

Acceptance of Internet Banking Services with Respect to Security and Privacy Perceptions: An Application of TAM

The internet is playing a major role in providing financial services in Banking, leading to competitive edge in gaining banking customers, who would like essential banking services to be availed anywhere and at any time....

Analytical Comparison of RSA and RSA with Chinese Remainder Theorem

RSA encryption algorithm is one of the most powerful public key encryption algorithm. The problem with RSA algorithm is that RSA decryption is relatively slow in comparison to RSA encryption. Chinese Remainder Theorem (C...

Improving ATM User Interface (UI) of Pakistani Banks Using Keystroke Level Modelling (KLM)

The ATM connotes as Automated Teller Machine or Cash Machine. This machine has earned its currency on a larger scale in our modern society. However, unfortunately, most users have met bad experiences. For instance, reins...

Ontology Driven Requirement Specification

Requirement engineering RE process is an important step of software development lifecycle and it includes a variety of activities starting with requirement elicitation to requirement documentation. This form of engineeri...

Download PDF file
  • EP ID EP643241
  • DOI 10.31645/jisrc/(2015).13.1.0007
  • Views 161
  • Downloads 0

How To Cite

(2015). An Investigation on Topic Maps Based Document Classification with Unbalance Classes. Journal of Independent Studies and Research - Computing, 13(1), 50-56. https://europub.co.uk/articles/-A-643241