A Fuzzy Similarity Based Concept Mining Model for Text Classification
Journal Title: International Journal of Advanced Computer Science & Applications - Year 2011, Vol 2, Issue 11
Abstract
Text Classification is a challenging and a red hot field in the current scenario and has great importance in text categorization applications. A lot of research work has been done in this field but there is a need to categorize a collection of text documents into mutually exclusive categories by extracting the concepts or features using supervised learning paradigm and different classification algorithms. In this paper, a new Fuzzy Similarity Based Concept Mining Model (FSCMM) is proposed to classify a set of text documents into pre - defined Category Groups (CG) by providing them training and preparing on the sentence, document and integrated corpora levels along with feature reduction, ambiguity removal on each level to achieve high system performance. Fuzzy Feature Category Similarity Analyzer (FFCSA) is used to analyze each extracted feature of Integrated Corpora Feature Vector (ICFV) with the corresponding categories or classes. This model uses Support Vector Machine Classifier (SVMC) to classify correctly the training data patterns into two groups; i. e., + 1 and – 1, thereby producing accurate and correct results. The proposed model works efficiently and effectively with great performance and high - accuracy results.
Authors and Affiliations
Shalini Puri
Undergraduate’s Perception on Massive Open Online Course (MOOC) Learning to Foster Employability Skills and Enhance Learning Experience
The Massive Open Online Course (MOOC) is a very recent development in higher education institutions in Malaysia. As in September 2015, Universiti Teknikal Malaysia Melaka (UTeM) has introduced Mandarin course under Malay...
Fault Attacks Resistant Architecture for KECCAK Hash Function
The KECCAK cryptographic algorithms widely used in embedded circuits to ensure a high level of security to any systems which require hashing as the integrity checking and random number generation. One of the most efficie...
Design of Wearable Patch Antenna for Wireless Body Area Networks
Wireless body area networks are being widely used due to the increase in the use of wireless networks and various electrical devices. A Wearable Patch antenna is used for enhancement of various applications for WBAN. In...
Strategic Framework and Maturity Index for Measuring Knowledge Management Practices in Government Organizations
Knowledge is considered as an intellectual asset of any Organization through which performance of the Organization could be enhanced exponentially. Harnessing of the Organization’s Tacit and Explicit knowledge and its Ma...
A Decision Tree Classification Model for University Admission System
Data mining is the science and techniques used to analyze data to discover and extract previously unknown patterns. It is also considered a main part of the process of knowledge discovery in databases (KDD). In this pape...