A Hybrid Multi-Word Terms Extraction System Applied to Topic Detection

Journal Title: INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY - Year 2014, Vol 13, Issue 10

Abstract

Mutli-word Terms extraction plays an important role in many Natural Language Processing (NLP) tasks. Despite their major importance, few works were dedicated to Arabic multi-word terms extraction. This paper proposes an automatic Arabic multi-word terms (MWTs) extraction system based on two major filtering steps: linguistics filter using a part-of-speech tagger along with morphological patterns and statistical filter based on probabilistic methods, namely: Log-Likelihood Ratio (LLR) and C-value. We evaluate the performances of the realized systems on Wattan; an Arabic oriented topic newspaper corpus. Our system manages to achieve 90.23% in term of multi-word extraction precision. We also study the use of MWTs as features in Arabic Topic Detection. The conducted experiments show good results.

Authors and Affiliations

Rim Koulali, Abdelouafi Meziane

Keywords

Related Articles

TIFIM: Tree based Incremental Frequent Itemset Mining over Streaming Data

Data Stream Mining algorithms performs under constraints called space used and time taken, which is due to the streaming property. The relaxation in these constraints is inversely proportional to the streaming speed of t...

ENLIGHTENING THE CLOUD COMPUTING DOMAIN

Cloud computing is an emerging model of “computing as utility” to provide convenient, on demand access to shared pool of resources. In this paper, this grooming technology is presented in terms of its basic character...

AN IMPLEMENTATION OF LOAD BALANCING ALGORITHM IN CLOUD ENVIRONMENT

Cloud Computing is an emerging computing paradigm. It aims to share data, calculations, and service transparently overascalable network of nodes. Since Cloud computing stores the data and disseminated resources in the op...

A REVIEW OF ELEMENTS FOR ASSESSMENT OFE-LEARNING READINESS IN LIBYA UNIVERSITIES

The e-learning is developing rapidly in the world nowadays, where technology has replaced the traditional educational approach in many universities around the world. This is because e-learning can save time and cost in e...

Integration of Human Resource Information System to DSS, CMS and other applications to increase productivity

Human Resources Management also deal with the facilities and requirements the Human Workforce are availing and need for their working process and carrier growth. It used to act as a bidirectional process flow which inc...

Download PDF file
  • EP ID EP650603
  • DOI 10.24297/ijct.v13i10.2333
  • Views 90
  • Downloads 0

How To Cite

Rim Koulali, Abdelouafi Meziane (2014). A Hybrid Multi-Word Terms Extraction System Applied to Topic Detection. INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY, 13(10), 5105-5112. https://europub.co.uk/articles/-A-650603