A Method for Chinese Short Text Classification Considering Effective Feature Expansion

Abstract

 This paper presents a Chinese short text classification method which considering extended semantic constraints and statistical constraints. This method uses “HowNet” tools to build the attribute set of concept. when coming to the part of feature expansion, we judge the collocation between the attribute words of original text and the characteristics before and after expansion as the semantic constraints, and calculate the ratio between the mutual information of the original contents and the features before expansion versus the mutual information of the original contents and the features after expansion as statistical constraints, so as to judge whether feature expansion is effective with this two constraints , then rationally use various semantic relation word-pairs in short text classification. Experiments show that this method can use semantic relations in Chinese short text classification effectively, and improve the classification performance.

Authors and Affiliations

Mingxuan liu , Xinghua Fan

Keywords

Related Articles

 Method and System for Human Action Detections with Acceleration Sensors for the Proposed Rescue System for Disabled and Elderly Persons Who Need a Help in Evacuation from Disaster Area

 Method and system for human action detections with acceleration sensors for the proposed rescue system for disabled and elderly persons who need a help in evacuation from disaster areas is proposed. Not only vital...

 Automatic Recognition of Human Parasite Cysts on Microscopic Stools Images using Principal Component Analysis and Probabilistic Neural Network

 Parasites live in a host and get its food from or at the expensive of that host. Cysts represent a form of resistance and spread of parasites. The manual diagnosis of microscopic stools images is time-consuming and...

 New concepts of fuzzy planar graphs

 Fuzzy planar graph is an important subclass of fuzzy graph. Fuzzy planar graphs and its several properties are presented. A very close association of fuzzy planar graph is fuzzy dual graph. This is also defined and...

A Cumulative Multi-Niching Genetic Algorithm for Multimodal Function Optimization

This paper presents a cumulative multi-niching genetic algorithm (CMN GA), designed to expedite optimization problems that have computationally-expensive multimodal objective functions. By never discarding individuals fr...

 Applying Inhomogeneous Probabilistic Cellular Au-tomata Rules on Epidemic Model

 This paper presents some of the results of our probabilis¬tic cellular automaton (PCA) based epidemic model. It is shown that PCA performs better than deterministic ones. We consider two possible ways of interactio...

Download PDF file
  • EP ID EP130001
  • DOI -
  • Views 110
  • Downloads 0

How To Cite

Mingxuan liu, Xinghua Fan (2012).  A Method for Chinese Short Text Classification Considering Effective Feature Expansion. International Journal of Advanced Research in Artificial Intelligence(IJARAI), 1(1), 1-5. https://europub.co.uk/articles/-A-130001