A Context-Sensitive Approach to Find Optimum Language Model for Automatic Bangla Spelling Correction

Abstract

Automated spelling correction is an important phenomenon in typing that has intense effect on aiding both literate and semi-literate people while using keyboard or other similar devices. Such automated spelling correction technique also helps students significantly in learning process through applying proper words during word processing. A lot of work has been conducted for English language, but for Bangla, it is still not adequate. All work done so far in Bangla is context-free. Bangla is one of the mostly spoken languages (3.05% of world population) and considered seventh language of all languages in the world. In this paper, we propose a context-sensitive approach for automated spelling correction in Bangla. We make combined use of edit distance and stochastic, i.e. N-gram language model. We use six N-gram models in total. A novel approach is deployed in order to find the optimum language model in terms of performance. In addition, for finding out better performance, a large Bangla corpus of different word types is used. We have achieved a satisfactory and promising accuracy of 87.58%.

Authors and Affiliations

Muhammad Ifte Khairul Islam, Md. Tarek Habib, Md. Sadekur Rahman, Md. Riazur Rahman, Farruk Ahmed

Keywords

Related Articles

Evaluating the Usability of Optimizing Text-based CAPTCHA Generation

A CAPTCHA is a test that can, automatically, tell human and computer programs apart. It is a mechanism widely used nowadays for protecting web applications, interfaces, and services from malicious users and automated spa...

A Proposed Peer Selection Algorithm for Transmission Scheduling in P2P-VOD Systems

Video transmission in peer-to-peer video-on-demand faces some challenges. These challenges include long transmission delay and poor quality of service. The peer selection plays an important role in enhancing transmission...

Multi- Spectrum Bands Allocation for Time-Varying Traffic in the Flexible Optical Network

The flexible optical networks are the promising solution to the exponential increase of traffic generated by telecommunications networks. They combine flexibility with the finest granularity of optical resources. Therefo...

Mutual Exclusion Principle for Multithreaded Web Crawlers

This paper describes mutual exclusion principle for multithreaded web crawlers. The existing web crawlers use data structures to hold frontier set in local address space. This space could be used to run more crawler thre...

Smart Rubric-based Systematic Model for Evaluating and Prioritizing Academic Practices to Enhance the Education Outcomes

Recently, the impact of free-market economy, globalization, and knowledge economy has become a challenging and focal to higher educational institutions, which resulted in radical change. Therefore, it became mandatory fo...

Download PDF file
  • EP ID EP417625
  • DOI 10.14569/IJACSA.2018.091126
  • Views 103
  • Downloads 0

How To Cite

Muhammad Ifte Khairul Islam, Md. Tarek Habib, Md. Sadekur Rahman, Md. Riazur Rahman, Farruk Ahmed (2018). A Context-Sensitive Approach to Find Optimum Language Model for Automatic Bangla Spelling Correction. International Journal of Advanced Computer Science & Applications, 9(11), 184-191. https://europub.co.uk/articles/-A-417625