Text Summarization versus CHI for Feature Selection

Journal Title: Journal of Advances in Mathematics and Computer Science - Year 2017, Vol 22, Issue 4

Abstract

Text Classification is an important technique for handling the huge and increasing amount of text documents on the web. An important problem of text classification is features selection. Many feature selection techniques were used in order to solve this problem, such as chi-square (CHI). Rather than using these techniques, this paper proposes a method for feature selection based on text summarization. We demonstrate this method on Arabic text documents and use text summarization for feature selection. Support Vector Machine (SVM) is then used to classify the summarized documents and the ones processed by CHI. The classification indicators (precision, recall, and accuracy) achieved by text summarization are higher than the ones achieved by CHI. However, text summarization has negligible higher execution time.

Authors and Affiliations

R. S. Jabri, E. Al-Thwaib

Keywords

Related Articles

The Propositional Lattice of Divisibility and Beal's Conjecture

This article is devoted to the lattice-theoretic analysis of Beal's conjecture. We discuss whether this conjecture is deducible from the laws of logic of divisibility.

Iris Texture Analysis for Ethnicity Classification Using Self-Organizing Feature Maps

Ethnicity Classification from iris texture is a notable research in the field of pattern recognition that differentiates groups of people as distinct community by certain characteristics and attributes. Several ethnicity...

DT- optimality Criteria for Second Order Rotatable Designs Constructed Using Balanced Incomplete Block Design

Experimenters have come to a realization that a design can perform very well in terms of a particular statistical characteristic and still perform poorly in terms of a rival characteristic. Due to this studies have narro...

Hypertension Prediction System Using Naive Bayes Classifier

Hypertension is an illness that often leads to severe and life-threatening diseases such as heart failure, coronary artery disease, heart attack and other severe conditions if not promptly diagnosed and treated. Data Min...

Fuzzy Tangle Graph

We will study anew graph, this graph called fuzzy tangle graph, we will study the matrices which represent this graph, and we will discuss the relation between fuzzy tangle graph and dual fuzzy tangle graph. In fuzzy tan...

Download PDF file
  • EP ID EP322060
  • DOI 10.9734/BJMCS/2017/33615
  • Views 51
  • Downloads 0

How To Cite

R. S. Jabri, E. Al-Thwaib (2017). Text Summarization versus CHI for Feature Selection. Journal of Advances in Mathematics and Computer Science, 22(4), 1-8. https://europub.co.uk/articles/-A-322060