Urdu to Punjabi Machine Translation: An Incremental Training Approach

Abstract

The statistical machine translation approach is highly popular in automatic translation research area and promising approach to yield good accuracy. Efforts have been made to develop Urdu to Punjabi statistical machine translation system. The system is based on an incremental training approach to train the statistical model. In place of the parallel sentences corpus has manually mapped phrases which were used to train the model. In preprocessing phase, various rules were used for tokenization and segmentation processes. Along with these rules, text classification system was implemented to classify input text to predefined classes and decoder translates given text according to selected domain by the text classifier. The system used Hidden Markov Model(HMM) for the learning process and Viterbi algorithm has been used for decoding. Experiment and evaluation have shown that simple statistical model like HMM yields good accuracy for a closely related language pair like Urdu-Punjabi. The system has achieved 0.86 BLEU score and in manual testing and got more than 85% accuracy.

Authors and Affiliations

Umrinderpal Singh, Vishal Goyal, Gurpreet Lehal

Keywords

Related Articles

Decision Making Systems for Managing Business Processes in Enterprises Groups

In the current economic realities, the forms of integration business entities through the creation of enterprise groups (EGs), reorganized from industry structures or created a new by acquiring existing companies, are be...

Personalized Semantic Retrieval and Summarization of Web Based Documents

The current retrieval methods are essentially based on the string-matching approach lacking of semantic information and can’t understand the user's query intent and interest very well. These methods do regard as the pers...

A Case Study for the IONEX CODE-Database Processing Tool Software: Ionospheric Anomalies before the Mw 8.2 Earthquake in Mexico on September 7, 2017

A software tool was developed in the Imaging Processing Research laboratory (INTI-Lab) that automatically downloads several IONEX files around a specific user input date and also performs statistical calculations to look...

Investigating Students’ Achievements in Computing Science Using Human Metric

This study investigates the role of personality traits, motivation for career choice and study habits in students’ academic achievements in the computing sciences. A quantitative research method was employed. Data was co...

Novel Causality in Consumer’s Online Behavior: Ecommerce Success Model

Online shopping (e-Shopping) has grown at a rapid pace with the advancement in modern web technologies, there are then socio and technical aspects (factors) in the mentioned e-shopping. The following research paper highl...

Download PDF file
  • EP ID EP128283
  • DOI 10.14569/IJACSA.2016.070428
  • Views 88
  • Downloads 0

How To Cite

Umrinderpal Singh, Vishal Goyal, Gurpreet Lehal (2016). Urdu to Punjabi Machine Translation: An Incremental Training Approach. International Journal of Advanced Computer Science & Applications, 7(4), 227-238. https://europub.co.uk/articles/-A-128283