Combining Arabic Nested Noun Compound and Collocation Extraction Using Linguistic and Statistical Approach

Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2016, Vol 18, Issue 5

Abstract

Abstract : Arabic multi-word expressions are the combinations of two or more terms that associated with each other as one concept. The process of extracting such expressions is challenging especially when the length of the combination is getting longer. Recently, researchers attempted to extract nested noun compounds whichconsists of two or more combinations of nouns. Such extraction process requires comprehensive analysis using linguistic and statistical approaches. However, the process of extraction in the state of the art have extended to include 4-gra and 5-gram candidates. This paper aims to combine the extraction of nested noun compound andcollocation in order to extend the process of extraction to include 6-gram and 7-gram. For this manner, a linguistic approach comprises of various kinds of pattern has been used, as well as, three statistical measures have been utilized including NC-value, LLR and PMI. Results shown that the proposed method has the ability to extend the extraction to include longer candidates.

Authors and Affiliations

Maryam Yaseen Al-Mashhadani , Luma Adnan Al-Sagban

Keywords

Related Articles

An Optimized and Secured Ranking Approach for Retrieving Cloud Data Using Keyword Search

Abstract: Cloud computing is a versatile technology that emerged as a solution to reduce costs in organizations by providing on-demand high quality applications and services from a centralized pool of configurable comput...

Design and Implementation of Thresholding Algorithm based on MFR for Retinal Fundus Images

Abstract: In this paper, the entropy of maximum filter response (MFR) is applied followed by normalization and thresholding for retinal fundus image is used. The performance of our proposed method has been assessed on 23...

 Grid Computing- An Emerging Technology that enables large-scale resource sharing

 Abstract: In the last few years there has been a rapid exponential increase in computer processing power, data storage and communication. But still there are many complex and computation intensive problems, which c...

Mobility Management Schemes for WMNS Using Pointer Forwarding Techniques

Abstract: The efficient mobility management schemes based on pointer forwarding for wireless mesh networks (WMNs) with the objective to reduce the overall network traffic incurred by mobility management and packet delive...

 “An efficient IP trace back using packetizing logging and preshared key exchange”

 Abstract: Here in this work we presented a hybrid model of packet marking and logging for the IP trace backfor the node that wants to attack any node in the network. The main idea is to detect the DOS attacks in th...

Download PDF file
  • EP ID EP154478
  • DOI -
  • Views 76
  • Downloads 0

How To Cite

Maryam Yaseen Al-Mashhadani, Luma Adnan Al-Sagban (2016). Combining Arabic Nested Noun Compound and Collocation Extraction Using Linguistic and Statistical Approach. IOSR Journals (IOSR Journal of Computer Engineering), 18(5), 64-69. https://europub.co.uk/articles/-A-154478