A Method for Chinese Short Text Classification Considering Effective Feature Expansion

Abstract

 This paper presents a Chinese short text classification method which considering extended semantic constraints and statistical constraints. This method uses “HowNet” tools to build the attribute set of concept. when coming to the part of feature expansion, we judge the collocation between the attribute words of original text and the characteristics before and after expansion as the semantic constraints, and calculate the ratio between the mutual information of the original contents and the features before expansion versus the mutual information of the original contents and the features after expansion as statistical constraints, so as to judge whether feature expansion is effective with this two constraints , then rationally use various semantic relation word-pairs in short text classification. Experiments show that this method can use semantic relations in Chinese short text classification effectively, and improve the classification performance.

Authors and Affiliations

Mingxuan liu , Xinghua Fan

Keywords

Related Articles

Analysis of Gumbel Model for Software Reliability Using Bayesian Paradigm

In this paper, we have illustrated the suitability of Gumbel Model for software reliability data. The model parameters are estimated using likelihood based inferential procedure: classical as well as Bayesian. The quasi...

 Category Decomposition Method for Un-Mixing of Mixels Acquired with Spaceborne Based Visible and Near Infrared Radiometers by Means of Maximum Entropy Method with Parameter Estimation Based on Simulated Annealing

 Category decomposition method for un-mixing of mixels (Mixed Pixels) which is acquired with spaceborne based visible to near infrared radiometers by means of Maximum Entropy Method (MEM) with parameter estimation b...

 Multiple-Language Translation System Focusing on Long-distance Medical and Outpatient Services

 For people living in the countryside, an effective long-distance medical and health service is very important. People living in western China, especially, require convenient communication in their native language w...

Local Feature based Gender Independent Bangla ASR

This paper presents an automatic speech recognition (ASR) for Bangla (widely used as Bengali) by suppressing the speaker gender types based on local features extracted from an input speech. Speaker-specific characteristi...

 Speech emotion recognition in emotional feedback for Human-Robot Interaction

 For robots to plan their actions autonomously and interact with people, recognizing human emotions is crucial. For most humans nonverbal cues such as pitch, loudness, spectrum, speech rate are efficient carriers of...

Download PDF file
  • EP ID EP130001
  • DOI -
  • Views 108
  • Downloads 0

How To Cite

Mingxuan liu, Xinghua Fan (2012).  A Method for Chinese Short Text Classification Considering Effective Feature Expansion. International Journal of Advanced Research in Artificial Intelligence(IJARAI), 1(1), 1-5. https://europub.co.uk/articles/-A-130001