A Method for Chinese Short Text Classification Considering Effective Feature Expansion
Journal Title: International Journal of Advanced Research in Artificial Intelligence(IJARAI) - Year 2012, Vol 1, Issue 1
Abstract
This paper presents a Chinese short text classification method which considering extended semantic constraints and statistical constraints. This method uses “HowNet” tools to build the attribute set of concept. when coming to the part of feature expansion, we judge the collocation between the attribute words of original text and the characteristics before and after expansion as the semantic constraints, and calculate the ratio between the mutual information of the original contents and the features before expansion versus the mutual information of the original contents and the features after expansion as statistical constraints, so as to judge whether feature expansion is effective with this two constraints , then rationally use various semantic relation word-pairs in short text classification. Experiments show that this method can use semantic relations in Chinese short text classification effectively, and improve the classification performance.
Authors and Affiliations
Mingxuan liu , Xinghua Fan
Method and System for Human Action Detections with Acceleration Sensors for the Proposed Rescue System for Disabled and Elderly Persons Who Need a Help in Evacuation from Disaster Area
Method and system for human action detections with acceleration sensors for the proposed rescue system for disabled and elderly persons who need a help in evacuation from disaster areas is proposed. Not only vital...
Automatic Recognition of Human Parasite Cysts on Microscopic Stools Images using Principal Component Analysis and Probabilistic Neural Network
Parasites live in a host and get its food from or at the expensive of that host. Cysts represent a form of resistance and spread of parasites. The manual diagnosis of microscopic stools images is time-consuming and...
New concepts of fuzzy planar graphs
Fuzzy planar graph is an important subclass of fuzzy graph. Fuzzy planar graphs and its several properties are presented. A very close association of fuzzy planar graph is fuzzy dual graph. This is also defined and...
A Cumulative Multi-Niching Genetic Algorithm for Multimodal Function Optimization
This paper presents a cumulative multi-niching genetic algorithm (CMN GA), designed to expedite optimization problems that have computationally-expensive multimodal objective functions. By never discarding individuals fr...
Applying Inhomogeneous Probabilistic Cellular Au-tomata Rules on Epidemic Model
This paper presents some of the results of our probabilis¬tic cellular automaton (PCA) based epidemic model. It is shown that PCA performs better than deterministic ones. We consider two possible ways of interactio...