Romanian dependency parser developed based on parsers for other Romanic languages
Journal Title: Romanian Journal of Human - Computer Interaction - Year 2014, Vol 7, Issue 1
Abstract
Determining the syntactic dependencies is an important task in natural language processing, as it is useful for improving the results of a wide range of applications, such as machine translation, opinion mining, question-answering systems and others. This paper presents an initial step for semi-automatically building a corpus annotated with syntactic dependencies for Romanian and enriched with information about the type of the words and the relationships between them. As Romanian lacks an open-source syntactic or dependency parser specially trained on Romanian phrases, this corpus is necessary for obtaining better results for linguistic applications that are in need of dependency parse trees. To achieve this, we have started from two types of well-known parsers, the first trained for French syntactic parsing and the second for Spanish dependency parsing, that have been modified for analyzing phrases written in Romanian. The results we have obtained using these two methods are compared with the ones returned by the single existing dependency parser trained for Romanian on a medium-size corpus, which is available at this moment as a web service.
Authors and Affiliations
Iulia Maria Florea, Traian Rebedea, Costin-Gabriel Chiru
Paronym Generation Algorithms for Malapropism Correction
The Web pages have been intensively used lately for automatic or semiautomatic extraction of useful information. Because of the open nature of the Web, the texts that have no spelling errors are very rare exceptions. One...
Study on a Design Methodology for an Intelligent Interface of a Recomender System
This paper presents a study on a design methodology for the interface of a recommender system. A usability evaluation is designed for an intelligent interface of a recommender system that runs along Tesys e-Learning plat...
The Components Of A Text To Speech System
Converting words from written form into speakable forms strongly influences the performance of a text-to-speech (TTS) system. The text analysis component of a TTS system is responsible for parsing the language structure...
A Multidimensional Model of the Usefulness of Facebook for University Students
The popularity of social networking websites among university students stimulated the interest for studying the potential of use for educational purposes. The objective of this study is to test and validate a multidimens...
An Analysis Of The Quality And Accessibility Of Suicide Information Available To The Romanian-Speaking User
As the potential impact of Internet use on suicidal behaviour is currently under questioned, experts have yet not conclusively ruled on the extent of this problem. At the moment, no one really knows what kind of informat...