Stemmers for Tamil Language: Performance Analysis

Abstract

Stemming is the process of extracting root word from the given inflection word and also plays significant role in numerous application of Natural Language Processing (NLP). Tamil Language raises several challenges to NLP, since it has rich morphological patterns than other languages. The rule based approach light-stemmer is proposed in this paper, to find stem word for given inflection Tamil word. The performance of proposed approach is compared to a rule based suffix removal stemmer based on correctly and incorrectly predicted. The experimental result clearly show that the proposed approach light stemmer for Tamil language perform better than suffix removal stemmer and also more effective in Information Retrieval System (IRS).

Authors and Affiliations

M. Thangarasu , Dr. R. Manavalan

Keywords

Related Articles

A SURVEY ON NEW CRYPTOGRAPHIC SYSTEM TECHNIQUES FOR DATA SHARING IN CLOUD STORAGE

Cryptography is the art and science of achieving security by encoding the message or data to make them unreadable. It is related to the aspects of network security such as privacy, reliability and accessibility of the da...

Prime Generating Algorithms by Skipping Composite Divisors

Three elementary versions of simple prime generating sieves have already been improved by skipping even divisors other than 2. All composite integers are multiples of primes. Taking help of the transitivity property of d...

Requirement Engineering Research

The requirement validation is vital for every successful software development. In this process, the requirements from the users are checks and analyzed with its consistency, completeness and correctness. The validation o...

Improving the Network Lifetime of MANETs through CSP routing Algorithm

Cooperative Communication is a technique that allows multiple node to transmit the same data. In this paper propose a Cooperative Shortest Path (CSP) Routing Algorithm for Mobile ad hoc Network (MANETs).Cooperative Short...

A NEW STATISTICAL APPROACH FOR IMAGE FUSION TECHNIQUE

Image Fusion is an emerging area of research in image processing and computer vision. This paper proposes an algorithm which is statistical based and it overcomes the shortcomings of the traditional image fusion algorith...

Download PDF file
  • EP ID EP151532
  • DOI -
  • Views 108
  • Downloads 0

How To Cite

M. Thangarasu, Dr. R. Manavalan (2013). Stemmers for Tamil Language: Performance Analysis. International Journal of Computer Science & Engineering Technology, 4(7), 902-908. https://europub.co.uk/articles/-A-151532