Feature-rich PoS Tagging through Taggers Combination : Experience in Arabic

Journal Title: Transactions on Machine Learning and Artificial Intelligence - Year 2017, Vol 5, Issue 4

Abstract

Since words can play different syntactic roles in different contexts, it is not trivial to assign the appropriate morphosyntactic category to each word according to the context. Part of Speech (PoS) tagging is the task which manage this issue. Several probabilistic methods have been adapted for PoS tagging such as Hidden Markov Models, Support Vector Machines, and Decision Tree. Based on these methods, languageindependent PoS taggers have been developed such as TnT, SVMTool, and Treetagger. The main purpose of this work is to combine automatically the output of these standard PoS taggers and investigate several options for how to do this combination. The experiments are applied to one of the morphologically complex languages, Arabic. In this paper, we highlight the use of these taggers via various experiments. In fact, the evaluations involve several tests on both Classical and Modern Standard Arabic, trained/untrained and tagged/untagged data. Finally, a deeper investigation of Arabic PoS tagging through these language-independent taggers combination is performed.

Authors and Affiliations

Imad Zeroual, Abdelhak Lakhouaja

Keywords

Related Articles

Data Editing for Semi-Supervised Co-Forest by the Local Cut Edge Weight Statistic Graph (CEWS-Co-Forest)

In order to address the large amount of unlabeled training data problem, many semisupervised algorithms have been proposed. The training data in semisupervised learning may contain much noise due to the insufficient numb...

Comparative Study of Harris and Active Contour Using Viola-Jones Algorithm for Facial Landmarks Detection

In this paper, we present a comparative study of two methods: Harris corners detector (H) and Active Contour detector (A.C) using ViolaJones algorithm (V.J) for facial landmarks (eyes, nose, mouth) corners detection. The...

Implementation and Comparison of Machine Learning Algorithms for Recognition of Fingerspelling in Indian Sign Language

Communication is the biggest hurdle faced by the hearing and speech impaired in leading a normal life. In this context, Sign Language is the most prominent means of communication. Machine learning and Computer Vision is...

E-CLONALG: A classifier based on Clonal Selection Algorithm

This paper proposes an improved version of CLONALG, Clone Selection Algorithm based on Artificial Immune System(AIS), that matches with the conventional classifiers in terms of accuracy tested on the same data sets. Clon...

Creatinine, Urea and Uric Acid in Hospitalized Patients with and Without Hyperglycemia Analysis using Generalized Additive Model

Hyperglycemia is an important risk factor for heart disease and premature mortality. In hospitalized patients, it is related to an increase in morbidity and development of other disease like kidney disease. To evaluate t...

Download PDF file
  • EP ID EP308390
  • DOI 10.14738/tmlai.54.2981
  • Views 67
  • Downloads 0

How To Cite

Imad Zeroual, Abdelhak Lakhouaja (2017). Feature-rich PoS Tagging through Taggers Combination : Experience in Arabic. Transactions on Machine Learning and Artificial Intelligence, 5(4), 112-122. https://europub.co.uk/articles/-A-308390