POS tagger based on second-order HMM

Journal Title: Romanian Journal of Human - Computer Interaction - Year 2012, Vol 5, Issue 3

Abstract

Part-of-speech tagging (POS tagging) is the process of grammatical labelling of each word in a sentence, phrase or paragraph with the corresponding part of speech. This process is a component of other modules of natural language processing and therefore the results should be as precise as possible. Once a part of speech has been identified, it provides supplementary information about the parts of speech that can appear in the same sentence. In the case of POS tagging, the ambiguities arise due to the fact that a word may have multiple morphological values depending on context. In this paper is performed, from an experimental perspective, an analysis of a POS Tagger based on a Second-Order Hidden Markov Model, using the Brown corpus. The tests have been conducted to obtain results according to various parameters. We will show how changes the accuracy of a POS tagger for English when become different, on the one hand, the training set size, and on the other hand, the domains of the original functions in comparison with the domain of the training set. We have identified the categories of texts from Brown corpus used for the training corpus when the accuracy of the POS tagger is higher, lower respectively.

Authors and Affiliations

Dumitru-Clementin Cercel , Stefan Trăuşan-Matu

Keywords

Related Articles

WikiDetect: Automatic Vandalism Detection On Wikipedia

Article vandalism has always been one of the greatest security issues of Wikipedia, yet few automatic (non-human) solutions for this problem have been developed so far. Large amounts of time are spent by volunteers corre...

Improving a eLearning system Using Specific Elements of a Question-Answering System

Today Web developers try to create customized Web pages that are specific for every user, based on characteristics such as their interests, social class they belong to them or the context in which they access the pages....

WebVOX – a Solution for Web Page Accessibility Improvement for Persons with Reading Deficiency

This paper presents the WebVOX system, for Web page accessibility improvement for persons with reading deficiency. The presented solution addresses peoples with dyslexia, low literacy and reading skills, learning difficu...

Ch.A.M.P.– Modeling and Assessment System for Chat Evalution

The paper presents a system developed to assess the skills and evolution of participants in a collaborative environment. To obtain an overall approach, two scenarios have been considered: quantitative approach based on s...

MOM – software instrument for the analysis of graphical user interface accessibility. Functionality and case studies.

The graphical user’s interfaces accessibility is a key condition for visual disabled peoples to use the computer applications. This paper presents the MOM (Meaningful Object Manager) software tool for accessibility analy...

Download PDF file
  • EP ID EP28915
  • DOI -
  • Views 437
  • Downloads 7

How To Cite

Dumitru-Clementin Cercel, Stefan Trăuşan-Matu (2012). POS tagger based on second-order HMM. Romanian Journal of Human - Computer Interaction, 5(3), -. https://europub.co.uk/articles/-A-28915