Self Appreciating Concept Based Model For Cross Domain Document Classification

Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2014, Vol 16, Issue 3

Abstract

 Abstract : In text mining, text categorization is an important technique for classifying the documents. Most of the times statistical approaches that are based on analysis of the term in the form of frequency of the term, that is the number of occurrences of one or more words in the document are used for classification. Even statistical analysis indicates the importance of the term, but it is hard to analyze when multiple terms have the same frequency value, but one term is more important in terms of meaning than the other. Also, there are a wide variety of documents being generated that belongs to different domains which differ in formats, writing styles, etc. These domains can be news articles, e-mails, online chats, blogs, wiki articles, twitter posts, message forums, speech transcripts, etc. Often a classification method that works well in one domain does not work as well in another. The proposed system tries to implement a concept based text classification model that classifies the cross-domain text data based on the semantics or theme of the text data. Also the proposed approach makes the training system stronger and stronger at all possible positive tests of the categorizer. This system is called as a Self Appreciating Concept Based Classifier (SACBC).

Authors and Affiliations

Dipak A. Sutar

Keywords

Related Articles

Soft Phone Support Voice and Video Calling Using Sip And Rtp Protocol

Soft Phone is a VoIP soft phone that uses the Session Initiation Protocol. It is a powerful and unique SIP software telephone that lets users make phone and video calls using single software application using any Voice o...

 Aisha Email System

 The Paper Entitle “Aisha Email System” deals with identifying the clients to send and receive mail with the same login. This utility will allow multiple clients to login under the same login name and still have &...

 Monitoring Wireless Sensor Network using Android based Smart Phone Application

 Abstract: Wireless Sensor Network application’s is use in detection of natural calamities like forest fire detection, flood detection, , earth quick early detection ,snow detection, traffic congestion and various o...

 Scalable Schedule Routing and Minimizing Delay in Tolerant Networks

 There are many routing strategies for message delivery in Delay Tolerant Networks. Among them Multicopying routing strategies have been considered the most applicable DTNs. Epidemic routing and two-hop forwarding r...

 A Novel Approach for Semi Supervised Document Clustering with Constraint Score based Feature Supervision

Abstract: Text document clustering provides an effective technique to manage a huge amount of retrieval outcome by grouping documents in a small number of meaningful classes. In unsupervised clustering method the unlabel...

Download PDF file
  • EP ID EP152520
  • DOI 10.9790/0661-16329095
  • Views 80
  • Downloads 0

How To Cite

Dipak A. Sutar (2014).  Self Appreciating Concept Based Model For Cross Domain Document Classification. IOSR Journals (IOSR Journal of Computer Engineering), 16(3), 90-95. https://europub.co.uk/articles/-A-152520