Text Summarization versus CHI for Feature Selection

Journal Title: Journal of Advances in Mathematics and Computer Science - Year 2017, Vol 22, Issue 4

Abstract

Text Classification is an important technique for handling the huge and increasing amount of text documents on the web. An important problem of text classification is features selection. Many feature selection techniques were used in order to solve this problem, such as chi-square (CHI). Rather than using these techniques, this paper proposes a method for feature selection based on text summarization. We demonstrate this method on Arabic text documents and use text summarization for feature selection. Support Vector Machine (SVM) is then used to classify the summarized documents and the ones processed by CHI. The classification indicators (precision, recall, and accuracy) achieved by text summarization are higher than the ones achieved by CHI. However, text summarization has negligible higher execution time.

Authors and Affiliations

R. S. Jabri, E. Al-Thwaib

Keywords

Related Articles

Properties of T–Anti-Fuzzy Ideals of a –Near-Ring

In this paper, we define Anti-fuzzy ideal of a -near-ring in and -anti-fuzzy ideal of a -near-ring in . we made an attempt to study the properties of -anti-fuzzy ideal of a -near-ring, union of -anti-fuzzy ideal...

An Efficient CRT Based Reverse Converter for {22n+1-1, 2n-1, 22n-1} Moduli Set

This paper presents a reverse converter for the moduli set {22n+1-1, 2n-1, 22n-1} using a Chinese Remainder Theorem (CRT) algorithm and reverse method of data conversion. We compare our result with other converters found...

Estimation of Community Views on Criminal Justice a Statistical Document Analysis Approach

The Community Views on Criminal Justice System (CVCJS) initiative was established to collect a city community's perceptions on experiences with local Police Departments and other agencies in the criminal justice system,...

Designing and Implementation of PIC Microcontroller Based Educational Kit

The microcontrollers are very common components in modern electronic systems. Their using is so widespread that it is almost impossible to work in electronics without coming across it. They are now providing us with a ne...

Empirical Performance of Internal Sorting Algorithm

Internal Sorting Algorithms are used when the list of records is small enough to be maintained entirely in primary memory for the duration of the sort, while External Sorting Algorithms are used when the list of records...

Download PDF file
  • EP ID EP322060
  • DOI 10.9734/BJMCS/2017/33615
  • Views 74
  • Downloads 0

How To Cite

R. S. Jabri, E. Al-Thwaib (2017). Text Summarization versus CHI for Feature Selection. Journal of Advances in Mathematics and Computer Science, 22(4), 1-8. https://europub.co.uk/articles/-A-322060