A Knowledge-based Topic Modeling Approach for Automatic Topic Labeling

Abstract

Probabilistic topic models, which aim to discover latent topics in text corpora define each document as a multinomial distributions over topics and each topic as a multinomial distributions over words. Although, humans can infer a proper label for each topic by looking at top representative words of the topic but, it is not applicable for machines. Automatic Topic Labeling techniques try to address the problem. The ultimate goal of topic labeling techniques are to assign interpretable labels for the learned topics. In this paper, we are taking concepts of ontology into consideration instead of words alone to improve the quality of generated labels for each topic. Our work is different in comparison with the previous efforts in this area, where topics are usually represented with a batch of selected words from topics. We have highlighted some aspects of our approach including: 1) we have incorporated ontology concepts with statistical topic modeling in a unified framework, where each topic is a multinomial probability distribution over the concepts and each concept is represented as a distribution over words; and 2) a topic labeling model according to the meaning of the concepts of the ontology included in the learned topics. The best topic labels are selected with respect to the semantic similarity of the concepts and their ontological categorizations. We demonstrate the effectiveness of considering ontological concepts as richer aspects between topics and words by comprehensive experiments on two different data sets. In another word, representing topics via ontological concepts shows an effective way for generating descriptive and representative labels for the discovered topics.

Authors and Affiliations

Mehdi Allahyari, Seyedamin Pouriyeh, Krys Kochut, Hamid Reza Arabnia

Keywords

Related Articles

Real-Time Concept Feedback in Lectures for Botho University Students

This is a mixed methodology study which focused on developing a real-time concept feedback system for Botho University students. The study takes advantage of the tablets distributed freely by the institution to ameliorat...

Multi-Depots Vehicle Routing Problem with Simultaneous Delivery and Pickup and Inventory Restrictions: Formulation and Resolution

Reverse logistics can be defined as a set of practices and processes for managing returns from the consumer to the manufacturer, simultaneously with direct flow management. In this context, we have chosen to study an imp...

Generating a Domain Specific Inspection Evaluation Method through an Adaptive Framework

The electronic information revolution and the use of computers as an essential part of everyday life are now more widespread than ever before, as the Internet is exploited for the speedy transfer of data and business. So...

Segmentation and Recognition of Handwritten Kannada Text Using Relevance Feedback and Histogram of Oriented Gradients – A Novel Approach

India is a multilingual country with 22 official languages and more than 1600 languages in existence. Kannada is one of the official languages and widely used in the state of Karnataka whose population is over 65 million...

A Mixed Method Study for Investigating Critical Success Factors (CSFs) of E-Learning in Saudi Arabian Universities

Electronic Learning (E-Learning) in the education system has become the obvious choice of the community over the globe because of its numerous advantages. The main aim of the present study is to identify Critical Success...

Download PDF file
  • EP ID EP261201
  • DOI 10.14569/IJACSA.2017.080947
  • Views 102
  • Downloads 0

How To Cite

Mehdi Allahyari, Seyedamin Pouriyeh, Krys Kochut, Hamid Reza Arabnia (2017). A Knowledge-based Topic Modeling Approach for Automatic Topic Labeling. International Journal of Advanced Computer Science & Applications, 8(9), 335-349. https://europub.co.uk/articles/-A-261201