News Web Portal based on Natural Language Processing

Journal Title: Romanian Journal of Human - Computer Interaction - Year 2008, Vol 1, Issue 3

Abstract

The paper presents an autonomous text classification module for a news web portal for the Romanian language. Statistical natural language processing techniques are combined in order to achieve a completely autonomous functionality of the portal. The news items are automatically collected from a large number of news sources using web syndication. Afterward, machine-learning techniques are used for achieving an automatic classification of the news stream. Firstly, the items are clustered using an agglomerative algorithm and the resulting groups correspond to the main news topics. Thus, more in-formation about each of the main topics is acquired from various news sources. Secondly, text classification algorithms are applied to automatically label each cluster of news items in a predetermined number of classes. More than a thou-sand news items were employed for both the training and the evaluation of the classifiers. The paper presents a complete comparison of the results obtained for each method.

Authors and Affiliations

Traian Rebedea, Costin-Gabriel Chiru, Ştefan Trăuşan-Matu

Keywords

Related Articles

New Technologies and Children’s Cognitive Development: Some Guidelines and Recommendations for Design

Most HCI principles and design recommendations have been tested and refined in the process of developing computer interfaces for the adult user. During the last few years, a growing demand for new sets of recommendations...

Non-conventional User-Interaction. General Considerations and Case Studies

The paper presents several aspects of interest regarding the current non-conventional user-interaction methods. The conducted experiments were focused on using specific hardware devices – e.g., sensor gloves, mobile term...

Applications on Touchscreen Mobile Devices for Visually Impaired People

Nowadays, the mobile devices are used for accomplishing a wide range of tasks and activities. Nonetheless, interacting with them represents a considerable challenge for the visually impaired people, especially in what co...

A Multidimensional Model of the Usefulness of Facebook for University Students

The popularity of social networking websites among university students stimulated the interest for studying the potential of use for educational purposes. The objective of this study is to test and validate a multidimens...

Model-Driven Engineering of User Interfaces: Promises, Successes, Failures, and Challenges

Model-driven engineering (MDE) of user interfaces consists in describing a user interface and aspects involved in it (e.g., task, domain, context of use) in models from which a final interface is produced. With one bi...

Download PDF file
  • EP ID EP28767
  • DOI -
  • Views 412
  • Downloads 10

How To Cite

Traian Rebedea, Costin-Gabriel Chiru, Ştefan Trăuşan-Matu (2008). News Web Portal based on Natural Language Processing. Romanian Journal of Human - Computer Interaction, 1(3), -. https://europub.co.uk/articles/-A-28767