An Intra and Inter-Topic Evaluation and Cleansing Method

Journal Title: Romanian Journal of Human - Computer Interaction - Year 2010, Vol 3, Issue 2

Abstract

Topic modeling is a growing research field and novel ways of interpreting and evaluating results are necessary. We propose a method for evaluating and improving the performance of topic models generating algorithms relying on WordNet data. We first propose a measure for determining a topic model’s fitness factoring in its broadness and redundancy. Then, for each individual topic, the amount of relevant information it provides, along with its most important words and related concepts are determined by defining a cohesion function based on the topic’s projection on WordNet concepts. The model as a whole is improved by eliminating each topic’s outliers with respect to the ontology projection. We define a inter topic ontology based distance and we further use it to investigate the impact of removing redundant topics from a model with regard to the overlap between topics’ ontological projections. Clustering similar topics into conceptually cohesive groups is tried as an alternative to pruning less relevant topics. Results show that evaluating and improving statistical models with WordNet is a promising research track that leads to more coherent topic models.

Authors and Affiliations

Claudiu Muşat, Marian-Andrei Rizoiu , Ştefan Trauşan-Matu

Keywords

Related Articles

The ergonomic quality of an educational application based on augmented reality – a measurement model with causal indicators

In recent years the educational applications based on augmented reality (AR) gained interest due to the opportunities provided by the integration of real objects from the traditional didactics on a hardware-software plat...

Visual tools for Software Development in GIS applications

This paper aims to showcase a set of features which enables users to develop custom processing models using a specific interface for the workflow. The component presented in this paper, part of the ArcGIS software suite,...

User Localization by Spatial Context Processing

This paper presents a case study and a proposed solution for a train localization system, designed to satisfy the safety requirements specific to the railway domain, and the limits imposed by the low cost regional railwa...

Current State Of Research In Augmented Reality

This article will review current research in augmented reality. Describes the work carried out in several parts and explains the problems encountered in the construction of augmented reality. It will summarize the approa...

Distributed Multimedia System for Human Computer Interaction

The aim of the paper is to provide some software components developed for acquisition, controlling and management of multimedia streams, of multimedia devices and for human computer interaction. Implemented software comp...

Download PDF file
  • EP ID EP28813
  • DOI -
  • Views 362
  • Downloads 10

How To Cite

Claudiu Muşat, Marian-Andrei Rizoiu, Ştefan Trauşan-Matu (2010). An Intra and Inter-Topic Evaluation and Cleansing Method. Romanian Journal of Human - Computer Interaction, 3(2), -. https://europub.co.uk/articles/-A-28813