Urdu to Punjabi Machine Translation: An Incremental Training Approach

Abstract

The statistical machine translation approach is highly popular in automatic translation research area and promising approach to yield good accuracy. Efforts have been made to develop Urdu to Punjabi statistical machine translation system. The system is based on an incremental training approach to train the statistical model. In place of the parallel sentences corpus has manually mapped phrases which were used to train the model. In preprocessing phase, various rules were used for tokenization and segmentation processes. Along with these rules, text classification system was implemented to classify input text to predefined classes and decoder translates given text according to selected domain by the text classifier. The system used Hidden Markov Model(HMM) for the learning process and Viterbi algorithm has been used for decoding. Experiment and evaluation have shown that simple statistical model like HMM yields good accuracy for a closely related language pair like Urdu-Punjabi. The system has achieved 0.86 BLEU score and in manual testing and got more than 85% accuracy.

Authors and Affiliations

Umrinderpal Singh, Vishal Goyal, Gurpreet Lehal

Keywords

Related Articles

Requirements Prioritization and using Iteration Model for Successful Implementation of Requirements

Requirements prioritization is ranking of software requirements in particular order. Prioritize requirements are easy to manage and implement while un-prioritized requirements are costly and consume much time as total es...

Autonomous Vehicle-to-Vehicle (V2V) Decision Making in Roundabout using Game Theory

Roundabout intersections promote a continuous flow of traffic. Roundabouts entry move traffic through an intersection more quickly, and with less congestion on approaching roads. With the introduction of smart vehicles a...

Improvement of Data Transmission Speed and Fault Tolerance over Software Defined Networking

Software Defined Networking (SDN) is a new networking paradigm where control plane is decoupled from the forwarding plane. Nowadays, for the development of information technology large number of data traffic has been add...

Universal Simplest possible PLC using Personal Computer

Need of industrial automation and control is not closed yet. PLC, the programmable logic controller as available in 2009 with all standardized possible features, discussed here concisely. This work on PLC gives a simples...

Minimizing Information Asymmetry Interference using Optimal Channel Assignment Strategy in Wireless Mesh Networks

Multi-radio multi-channel wireless mesh networks (MRMC-WMNs) in recent years are considered as the prioritized choice for users due to its low cost and reliability. MRMC-WMNs is recently been deployed widely across the w...

Download PDF file
  • EP ID EP128283
  • DOI 10.14569/IJACSA.2016.070428
  • Views 96
  • Downloads 0

How To Cite

Umrinderpal Singh, Vishal Goyal, Gurpreet Lehal (2016). Urdu to Punjabi Machine Translation: An Incremental Training Approach. International Journal of Advanced Computer Science & Applications, 7(4), 227-238. https://europub.co.uk/articles/-A-128283