Sentiment Analysis for Roman Urdu

Abstract

The majority of online comments/opinions are written in text-free format. Sentiment Analysis can be used as a measure to express the polarity (positive/negative) of comments/opinions. These comments/ opinions can be in different languages i.e. English, Urdu, Roman Urdu, Hindi, Arabic etc. Mostly, people have worked on the sentiment analysis of the English language. Very limited research work has been done in Urdu or Roman Urdu languages. Whereas, Hindi/Urdu is the third largest language in the world. In this paper, we focus on the sentiment analysis of comments/opinions in Roman Urdu. There is no publicly available Roman Urdu public opinion dataset. We prepare a dataset by taking comments/opinions of people in Roman Urdu from different websites. Three supervised machine learning algorithms namely NB (Naive Bayes), LRSGD (Logistic Regression with Stochastic Gradient Descent) and SVM (Support Vector Machine) have been applied on this dataset. From results of experiments, it can be concluded that SVM performs better than NB and LRSGD in terms of accuracy. In case of SVM, an accuracy of 87.22% is achieved.

Authors and Affiliations

A. Rafique, K. Malik, Z. Nawaz, F. Bukhari, A. H. Jalbani

Keywords

Related Articles

LIFEREC: A Framework for Recommending Users from Past Life Experiences

Life logging has been an eminent topic of concern in recent years with many researchers focusing on capturing daily life activities of human. With the proliferation of IoT (Internet of Things) domain, the devices are now...

Assurance due to the Usage of Two ERP Methods: Microsoft Dynamics AX and SAP

A speculation-based resource organising technique aids agencies in routing information across many industrial components. Organisation functions through IT (Information Technology) with the use of the latest technology h...

Exergy Analysis of a Subcritical Reheat Steam Power Plant with Regression Modeling and Optimization

In this paper, exergy analysis of a 210 MW SPP (Steam Power Plant) is performed. Firstly, the plant is modeled and validated, followed by a parametric study to show the effects of various operating parameters on the perf...

LabVIEW Based Simulator for Solar Cell Characteristics and MPPT Under Varying Atmospheric Conditions

Though intermittent, solar energy is a clean and eternal source of energy. PV (Photovoltaic) cell is one of the technology to harness the solar energy and use it as electricity. In recent years rising cost of electricity...

Factors Causing Health and Safety Hazards in Construction Projects in Pakistan

In spite of technical advancements, construction industry in developing countries, including Pakistan, heavily relies upon manual labor and orthodox methods of construction. Such practices then give rise to safety issues...

Download PDF file
  • EP ID EP557640
  • DOI 10.22581/muet1982.1902.20
  • Views 90
  • Downloads 0

How To Cite

A. Rafique, K. Malik, Z. Nawaz, F. Bukhari, A. H. Jalbani (2019). Sentiment Analysis for Roman Urdu. Mehran University Research Journal of Engineering and Technology, 38(2), 463-470. https://europub.co.uk/articles/-A-557640