Self Appreciating Concept Based Model For Cross Domain Document Classification

Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2014, Vol 16, Issue 3

Abstract

 Abstract : In text mining, text categorization is an important technique for classifying the documents. Most of the times statistical approaches that are based on analysis of the term in the form of frequency of the term, that is the number of occurrences of one or more words in the document are used for classification. Even statistical analysis indicates the importance of the term, but it is hard to analyze when multiple terms have the same frequency value, but one term is more important in terms of meaning than the other. Also, there are a wide variety of documents being generated that belongs to different domains which differ in formats, writing styles, etc. These domains can be news articles, e-mails, online chats, blogs, wiki articles, twitter posts, message forums, speech transcripts, etc. Often a classification method that works well in one domain does not work as well in another. The proposed system tries to implement a concept based text classification model that classifies the cross-domain text data based on the semantics or theme of the text data. Also the proposed approach makes the training system stronger and stronger at all possible positive tests of the categorizer. This system is called as a Self Appreciating Concept Based Classifier (SACBC).

Authors and Affiliations

Dipak A. Sutar

Keywords

Related Articles

 I-ViDE: An Improved Vision-Based Approach for Deep Web Data Extraction

 Abstract: Deep Web contents are accessed by queries submitted to Web databases and the returned data records are enwrapped in dynamically generated Web pages (they will be called deep Web pages in this paper). Extr...

 Real Time Monitoring Of Urban Water Systems for Developing Countries

 Abstract: The safe and satisfactory provision of water related services in urban areas of developing countries remain a major challenge mainly attributed to limited resources and inability to promptly detect operat...

 Load Balancing In Public Cloud

 In present days cloud computing is one of the greatestplatform which provides storage of data in very lowercost and available for all time over the internet.But ithas more critical issue like security, load man...

 An Extended Approach for Online Testing of Reversible Circuits

 Reversible computing has tremendous benefits in terms of power consumption, less heat dissipation and packaging density. Because its applications are found in diverse fields including quantum computing, nanotech...

Adaptive Personalized Web Search with Safety Seclusion

Abstract: The Comprehensive research demonstrations the competence of our framework. We additionally give an online expectation component to choosing whether customizing a query is beneficial. Personalized web search (PW...

Download PDF file
  • EP ID EP152519
  • DOI 10.9790/0661-16329095
  • Views 75
  • Downloads 0

How To Cite

Dipak A. Sutar (2014).  Self Appreciating Concept Based Model For Cross Domain Document Classification. IOSR Journals (IOSR Journal of Computer Engineering), 16(3), 90-95. https://europub.co.uk/articles/-A-152519