Sentiment Summerization and Analysis of Sindhi Text

Abstract

Text corpus is important for assessment of language features and variation analysis. Machine learning techniques identify the language terms, features, text structures and sentiment from linguistic corpus. Sindhi language is one of the oldest languages of the world having proper script and complete grammar. Sindhi is remained less resourced language computationally even in this digital era. Viewing this problem of Sindhi language, Sindhi NLP toolkit is developed to solve the Sindhi NLP and computational linguistics problems. Therefore, this research work may be an addition to NLP. This research study has developed an own Sindhi sentimentally structured and analyzed corpus on the basis of accumulated results of Sindhi sentiment analysis tool. Corpus is normalized and analyzed for language features and variation analysis using DTM and TF-IDF techniques. DTM and TF-IDF analysis is performed using n-gram model. The supervised machine learning model is formulated using SVMs and K-NN techniques to perform analysis on Sindhi sentiment analysis corpus dataset. Precision, recall and f-score show better performance of machine learning technique than other techniques. Cross validation techniques is used with 10 folds to validate and evaluate data set randomly for supervised machine learning analysis. Research study opens doors for linguists, data analysts and decision makers to work more for sentiment summarization and visual tracking.

Authors and Affiliations

Mazhar Ali, Asim Imdad Wagan

Keywords

Related Articles

Mining Educational Data to Analyze Students Performance

The main objective of higher education institutions is to provide quality education to its students. One way to achieve highest level of quality in higher education system is by discovering knowledge for prediction regar...

BioPay: Your Fingerprint is Your Credit Card

In recent years, credit and debit cards have become a very convenient method of payment. The growing use of card payments, hereafter referred to as credit cards, is evident in the daily use with many applications, such a...

An Analysis on Host Vulnerability Evaluation of Modern Operating Systems

Security is a major concern in all computing environments. One way to achieve security is to deploy a secure operating system (OS). A trusted OS can actually secure all the resources and can resist the vulnerabilities an...

Image Blocks Model for Improving Accuracy in Identification Systems of Wood Type

Image-based recognition systems commonly use an extracted image from the target object using texture analysis. However, some of the proposed and implemented recognitionues systems of wood types up to this time have not b...

Impact of Story Point Estimation on Product using Metrics in Scrum Development Process

Agile Software Development techniques are worldwide accepted, regardless of the definition of agile we all must agree with the fact that agile is maturing day by day, suppliers of software systems are moving away from tr...

Download PDF file
  • EP ID EP262255
  • DOI 10.14569/IJACSA.2017.081038
  • Views 77
  • Downloads 0

How To Cite

Mazhar Ali, Asim Imdad Wagan (2017). Sentiment Summerization and Analysis of Sindhi Text. International Journal of Advanced Computer Science & Applications, 8(10), 296-300. https://europub.co.uk/articles/-A-262255