A Context-Sensitive Approach to Find Optimum Language Model for Automatic Bangla Spelling Correction

Abstract

Automated spelling correction is an important phenomenon in typing that has intense effect on aiding both literate and semi-literate people while using keyboard or other similar devices. Such automated spelling correction technique also helps students significantly in learning process through applying proper words during word processing. A lot of work has been conducted for English language, but for Bangla, it is still not adequate. All work done so far in Bangla is context-free. Bangla is one of the mostly spoken languages (3.05% of world population) and considered seventh language of all languages in the world. In this paper, we propose a context-sensitive approach for automated spelling correction in Bangla. We make combined use of edit distance and stochastic, i.e. N-gram language model. We use six N-gram models in total. A novel approach is deployed in order to find the optimum language model in terms of performance. In addition, for finding out better performance, a large Bangla corpus of different word types is used. We have achieved a satisfactory and promising accuracy of 87.58%.

Authors and Affiliations

Muhammad Ifte Khairul Islam, Md. Tarek Habib, Md. Sadekur Rahman, Md. Riazur Rahman, Farruk Ahmed

Keywords

Related Articles

Vague Set Theory for Profit Pattern and Decision Making in Uncertain Data

Problem of decision making, especially in financial issues is a crucial task in every business. Profit Pattern mining hit the target but this job is found very difficult when it is depends on the imprecise and vague envi...

Workshare Process of Thread Programming and MPI Model on Multicore Architecture

Comparison between OpenMP for thread programming model and MPI for message passing programming model will be conducted on multicore shared memory machine architectures in order to find which has a better performance in t...

Sound user Interface with Touch Panel for Data and Information Expression and its Application to Meteorological Data Representation

Sound User Interface (SUI) with touch panel for representation of quantitative data and information together with its application to meteorological data representation is proposed. The proposed SUI is not a merely ear-co...

Toward an Effective Information Security Risk Management of Universities’ Information Systems Using Multi Agent Systems, Itil, Iso 27002,Iso 27005

Universities in the public and private sectors depend on information technology and information systems to successfully carry out their missions and business functions. Information systems are subject to serious threats...

Mining Opinion in Online Messages

The number of messages that can be mined from online entries increases as the number of online application users increases. In Malaysia, online messages are written in mixed languages known as ‘Bahasa Rojak’. Therefore,...

Download PDF file
  • EP ID EP417625
  • DOI 10.14569/IJACSA.2018.091126
  • Views 79
  • Downloads 0

How To Cite

Muhammad Ifte Khairul Islam, Md. Tarek Habib, Md. Sadekur Rahman, Md. Riazur Rahman, Farruk Ahmed (2018). A Context-Sensitive Approach to Find Optimum Language Model for Automatic Bangla Spelling Correction. International Journal of Advanced Computer Science & Applications, 9(11), 184-191. https://europub.co.uk/articles/-A-417625