Combining Arabic Nested Noun Compound and Collocation Extraction Using Linguistic and Statistical Approach

Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2016, Vol 18, Issue 5

Abstract

Abstract : Arabic multi-word expressions are the combinations of two or more terms that associated with each other as one concept. The process of extracting such expressions is challenging especially when the length of the combination is getting longer. Recently, researchers attempted to extract nested noun compounds whichconsists of two or more combinations of nouns. Such extraction process requires comprehensive analysis using linguistic and statistical approaches. However, the process of extraction in the state of the art have extended to include 4-gra and 5-gram candidates. This paper aims to combine the extraction of nested noun compound andcollocation in order to extend the process of extraction to include 6-gram and 7-gram. For this manner, a linguistic approach comprises of various kinds of pattern has been used, as well as, three statistical measures have been utilized including NC-value, LLR and PMI. Results shown that the proposed method has the ability to extend the extraction to include longer candidates.

Authors and Affiliations

Maryam Yaseen Al-Mashhadani , Luma Adnan Al-Sagban

Keywords

Related Articles

 Decisive Role Model for Data Association

 Abstract : This paper focuses on defined rule based on the itemsets appearing in the database and their relationship among themselves. Features are extracted leading to data trends, patterns and associations. Const...

 A Smart and Wearable Cardiac Healthcare System with Monitoring of Sudden Fall for Elderly and Post-Operative Patients

Abstract: The dominance of chronic diseases, driven by an increasingly aging population with a new health paradigm that emphasizes early finding, early diagnosis and early treatment, is highly recommended. Especially, Ca...

Energy Secure Dynamic Source Routing (ESDSR) Protocol For (MANET)

Abstract : MANET (Mobile Ad-hoc Network) is an unstructured, self-organized and self-deployment network. It can be set up anywhere, anytime because there is no need of centralize base station. Nodes in MANET are connecte...

 Big Data: The Future of Data Storage

 Abstract: According to Internet World statistics, todayInternet has 1.7 Billion users, compared with the population of 6.7 billion people.Around 40% of the world population is connected via internet across the gl...

 Improved Technique for Gait Reconigition & Performing Feature Extraction Using Videos of Different Forms

 Abstract: This paper describes the importance of GAIT Recognition where identification of human being from far distance without any sort of co-operation from his side is performed. Motive to develop automatic ident...

Download PDF file
  • EP ID EP154478
  • DOI -
  • Views 103
  • Downloads 0

How To Cite

Maryam Yaseen Al-Mashhadani, Luma Adnan Al-Sagban (2016). Combining Arabic Nested Noun Compound and Collocation Extraction Using Linguistic and Statistical Approach. IOSR Journals (IOSR Journal of Computer Engineering), 18(5), 64-69. https://europub.co.uk/articles/-A-154478