MAULIK: An Effective Stemmer for Hindi Language

Journal Title: International Journal on Computer Science and Engineering - Year 2012, Vol 4, Issue 5

Abstract

In this paper, a new stemmer has been proposed named as “Maulik” for Hindi Language. This stemmer is purely based on Devanagari script and it uses the Hybrid approach (combination of brute force and suffix removal approach). Stemming can be used to improve the effectiveness of information retrieval. The proposed stemmer is both computationally inexpensive and domain independent. The results are favorable and indicate that the proposed stemmer can be used effectively in Information Retrieval systems. This stemmer also reduces the problem of over-stemming and under-stemming which was found in A Light weight Stemmer for Hindi.

Authors and Affiliations

Upendra Mishra , Chandra Prakash

Keywords

Related Articles

WSLA Schema for Functionality Based Weight fixing of Non-Functional Parameters of Web Services

Recently Web services have evolved as a cost-effective solution for exchanging information between distributed applications over different operating system, platform, and software environment. The success of such a syste...

Tree-kNN: A Tree-Based Algorithm for Protein Sequence Classification

The phylogenomic classification of protein sequences attempts to categorize a given protein within the evolutionary context of the entire family. It involves mainly four steps: selection of homologous sequences, multiple...

Job-Oriented Monitoring of Clusters

There has been a lot of development in the field of clusters and grids. Recently, the use of clusters has been on rise in every possible field. This paper proposes a system that monitors jobs on large computational clust...

A Recent Survey on Bloom Filters in Network Intrusion Detection Systems

Computer networks are prone to hacking, viruses and other malware; a Network Intrusion Detection System (NIDS) is needed to protect the end-user machines from threats. An effective NIDS is therefore a network security sy...

SECURING WMN USING HONEYPOT TECHNIQUE

WMN has been a field of active research in the recent years. Lot of research has focused various routing mechanism but very little effort has been made towards attack detection or intrusion detection. In this paper, we p...

Download PDF file
  • EP ID EP150826
  • DOI -
  • Views 97
  • Downloads 0

How To Cite

Upendra Mishra, Chandra Prakash (2012). MAULIK: An Effective Stemmer for Hindi Language. International Journal on Computer Science and Engineering, 4(5), 711-717. https://europub.co.uk/articles/-A-150826