RACAI-RoTb: A Core of a Romanian Treebank Syntactically Annotated with Dependency Relations

Journal Title: Romanian Journal of Human - Computer Interaction - Year 2015, Vol 8, Issue 2

Abstract

This article presents the activity of creating a core of a treebank for Romanian, made up of 5000 sentences syntactically annotated with dependency grammar. In Introduction we bring arguments illustrating the need for creating such a language resource in the context of an important lack of electronic representation of Romanian, as compared to other languages (international ones). At the international level, big size treebanks (containing hundreds of thousands of sentences) have been created since the 90’s (section 2.1), while the few initiatives dedicated to Romanian count less than several thousand sentences (section 2.2). We present the selection methodology for the sentences to be included in our corpus (section 3), the dependency grammar we used (section 4), the annotation methodology (section 5) and the results of the evaluation of the automatic annotation with the help of its further manual correction (section 6). In order to speed up the annotation process, we start from a statistical annotation (using a statistic model of syntactic analysis for Spanish) that is subject to manual corrections made by two linguists.

Authors and Affiliations

Elena Irimia, Verginica Barbu Mititelu

Keywords

Related Articles

Multi-User Interaction Meta-Model

In recent years, there has been a wide interest on how groups of people work together, and on how collaboration might be supported. Many authors, rely on Task Modeling to design collaborative information systems. Task mo...

Distributed Multimedia System for Human Computer Interaction

The aim of the paper is to provide some software components developed for acquisition, controlling and management of multimedia streams, of multimedia devices and for human computer interaction. Implemented software comp...

Usability Specific Heuristics for Parallel and Distributed Aplications

The usability of the applications based on new technologies arises new issues. New evaluation methods or at least classical methods adapted to the new real case requirements have to be defined and developed. One of the m...

A User Centered Approach In Developing An Intelligent System For National Cancer Registry Management

Cancer registry is an informational system for acquiring, managing and analyzing information about neoplazic disease persons. Its purpose is to provide accurate information about cancer occurrence in a particular geograp...

Web 2.0 Recruiting

The article describes the first steps towards a semantic competence management system. The system is designed to improve the activity of a recruiting company specialized in the IT domain and it uses technologies from the...

Download PDF file
  • EP ID EP28966
  • DOI -
  • Views 371
  • Downloads 8

How To Cite

Elena Irimia, Verginica Barbu Mititelu (2015). RACAI-RoTb: A Core of a Romanian Treebank Syntactically Annotated with Dependency Relations. Romanian Journal of Human - Computer Interaction, 8(2), -. https://europub.co.uk/articles/-A-28966