Case Study: Automatic Identification of Romanian Suffixes

Journal Title: Romanian Journal of Human - Computer Interaction - Year 2011, Vol 4, Issue 2

Abstract

Assuming the perspective of automatically identifying derived words and their bases in the Romanian wordnet, with the aim of enriching it with derivational relations and semantic labels associated to them, in this article we present the results of a case study whose aim was to automatically identify suffixes by means of which new words are created in Romanian. In the beginning of the paper we make a brief overview of derivation in Romanian, also anticipating the challenges of our study. For the automatic identification of the suffixes in words we used generalized suffix trees for representing the lemmas in an electronic Romanian lexicon. We imposed a series of filters, identified via observation, by means of which we improved the results. For the evaluation of the identified suffixes we used gold standard lists of suffixes grouped according to the part of speech of the derived words. The evaluation of results was performed in three stages, by comparing them with the gold standard list of suffixes, with the gold standard list of suffixes and suffixoides, and with the unified gold standard list of suffixes and suffixoides of the parts of speech displaying homonymy in Romanian. We showed how the precision and accuracy of the algorithm change with the threshold imposed on the productivity of suffixes. The present study reveals the necessary knowledge for recognizing the morphological structure of words, the difficulties encountered in this process, and also the importance of this study for linguistic research, for the Artificial Intelligence domain, in tasks such as information retrieval, summarization, question answering and all the other tasks relying on natural language processing.

Authors and Affiliations

Verginica Barbu Mititelu

Keywords

Related Articles

Understanding perceptual-gestural knowledge in TEL systems with eye-tracking

This paper presents our methodology to capture and model multimodal interactions in Intelligent Tutoring Systems (ITS). We are specifically interested in perceptual-gestural interactions combining perceptions, gestures a...

Methods for Modelling Sketches in the Collaborative Prototyping of User Interfaces

Cross-functional teams with different technical backgrounds working on cross-platform environments require the production of flexible modeling of user interfaces in early steps of a design process. We observe that model-...

Comparative Evaluation of Two Augmented Reality Learning Scenarios

Augmented reality is featuring a new type of human-computer interaction that is based on the integration of the real and virtual within a single interaction space. More recently, these systems have been used to implement...

A general description of automatic speech recognition systems architecture

Over the last decades, the progress in the ASR domain has been amplified by a significant amount of technical and scientific advancements, amongst which the continuous expansion in the power of computing systems. From a...

User Interaction Techniques regarding Satellite Image Processing – GreenView and GreenLand Exemplification

Environmental applications have an increasing role in the modern society and in the life of human kind, through the services they offer (e.g. meteorological prediction, wheatear forecast, etc). Unfortunately most of the...

Download PDF file
  • EP ID EP28852
  • DOI -
  • Views 346
  • Downloads 10

How To Cite

Verginica Barbu Mititelu (2011). Case Study: Automatic Identification of Romanian Suffixes. Romanian Journal of Human - Computer Interaction, 4(2), -. https://europub.co.uk/articles/-A-28852