Efficient Preprocessing and Patterns Identification Approach for Text Mining

Journal Title: INTERNATIONAL JOURNAL OF COMPUTER TRENDS & TECHNOLOGY - Year 2013, Vol 6, Issue 2

Abstract

Due to the rapid expansion of digital data , knowledge discovery and data mining have attracted significant amount of a ttention for turning such data into helpful information and knowledge. Text categorization is continuing to become the most researched NLP problems on account of the ever-increasing levels of electronic documents and digital libraries. we present a novel text categorization method that puts together the decision on multiple attributes. Since the most of existing text mining methods adopted term-based approaches, all of these are affected by the difficulties of polysemy and synonymy. Existing pattern discovery technique includes the processes of pattern deploying and pattern evolving, to strengthen the impact of using and updating discovered patterns for looking for relevant and interesting information. But the current association Rules methods exist shortage in two aspects once it is used on patterns classification. a person is the strategy ignored the data about word's frequency in a text . The opposite happens to be the method need pruning rules whenever the mass rules are generated. Within this proposed work specific documents are preprocessed before placing patterns discovery. Preprocessing the document dataset using tokenization, stemming, and probability filtering approaches. Proposed approach gives better decision rules compare to existing approach.

Authors and Affiliations

Pattan Kalesha , M. Babu Rao , Ch. Kavitha

Keywords

Related Articles

IPv4 Mobility Support

Mobile computing offers mobile users anytime, anywhere bi-directional reliable access to the Internet. Mobile IP as a network layer routing protocol has been designed by the IETF (Internet Engineering Task Force) to prov...

Efficient Preprocessing and Patterns Identification Approach for Text Mining

Due to the rapid expansion of digital data , knowledge discovery and data mining have attracted significant amount of a ttention for turning such data into helpful information and knowledge. Text categorization is contin...

Study of Routing Protocols in Mobile Ad Hoc Networks

Mobile ad hoc networks (MANETs) are rapidly evolving as an important area of mobility. MANETs are infrastructure less autonomous collection of mobile users that communicate over relatively bandwidth constrained wireless...

Query Based Expert Search Based on Relevance Class and Web Page Quality Ranking

Expert search is mostly used in the areas of academic groups, organizations. The general expert search problem which is observed is searching experts on the web where lot of web pages and experts names are considered. It...

A Review on Impersonation Attack in Mobile Ad-Hoc Network

An ad hoc network is a collection of mobile nodes that dynamically form a temporary network and are capable of communicating with each other without the use of a network infrastructure or any centralized administration....

Download PDF file
  • EP ID EP146823
  • DOI -
  • Views 129
  • Downloads 0

How To Cite

Pattan Kalesha, M. Babu Rao, Ch. Kavitha (2013). Efficient Preprocessing and Patterns Identification Approach for Text Mining. INTERNATIONAL JOURNAL OF COMPUTER TRENDS & TECHNOLOGY, 6(2), 124-129. https://europub.co.uk/articles/-A-146823