Efficient Preprocessing and Patterns Identification Approach for Text Mining

Journal Title: INTERNATIONAL JOURNAL OF COMPUTER TRENDS & TECHNOLOGY - Year 2013, Vol 6, Issue 2

Abstract

Due to the rapid expansion of digital data , knowledge discovery and data mining have attracted significant amount of a ttention for turning such data into helpful information and knowledge. Text categorization is continuing to become the most researched NLP problems on account of the ever-increasing levels of electronic documents and digital libraries. we present a novel text categorization method that puts together the decision on multiple attributes. Since the most of existing text mining methods adopted term-based approaches, all of these are affected by the difficulties of polysemy and synonymy. Existing pattern discovery technique includes the processes of pattern deploying and pattern evolving, to strengthen the impact of using and updating discovered patterns for looking for relevant and interesting information. But the current association Rules methods exist shortage in two aspects once it is used on patterns classification. a person is the strategy ignored the data about word's frequency in a text . The opposite happens to be the method need pruning rules whenever the mass rules are generated. Within this proposed work specific documents are preprocessed before placing patterns discovery. Preprocessing the document dataset using tokenization, stemming, and probability filtering approaches. Proposed approach gives better decision rules compare to existing approach.

Authors and Affiliations

Pattan Kalesha , M. Babu Rao , Ch. Kavitha

Keywords

Related Articles

MIMO Schemes With Spatial Modulation in Wireless Communication

The combination of spatial modulation (SM) and space-time block coding (STBC) provides more advantages than other modulation techniques. In the MA-SM system, the transmitted symbols are mapped into a high dimensional con...

Performance Analysis of SEP and LEACH for Heterogeneous Wireless Sensor Networks

While wireless sensor networks are increasingly equipped to handle more complicated functions, these battery powered sensors which used in network processing, use their constrained energy to enhance the lifetime of the n...

WLan Architecture

It is the review paper of Architecture of Wireless local area networks. In this paper we are discussing the architecture of wlan. A wireless LAN (WLAN) is a local area network based on wireless technology. Most modern lo...

ACO, Its Modification and Variants

Ant colony optimization (ACO) is a P based metaheuristic algorithm which has been proven as a successful technique and applied to a number of combinatorial optimization problems and is also applied to the Traveling sales...

New Julia and Mandelbrot Sets for Jungck Ishikawa Iterates

The generation of fractals and study of the dynamics of polynomials is one of the emerging and interesting field of research nowadays. We introduce in this paper the dynamics of polynomials z n - z + c = 0 for n 2 and ap...

Download PDF file
  • EP ID EP146823
  • DOI -
  • Views 112
  • Downloads 0

How To Cite

Pattan Kalesha, M. Babu Rao, Ch. Kavitha (2013). Efficient Preprocessing and Patterns Identification Approach for Text Mining. INTERNATIONAL JOURNAL OF COMPUTER TRENDS & TECHNOLOGY, 6(2), 124-129. https://europub.co.uk/articles/-A-146823