Combining Arabic Nested Noun Compound and Collocation Extraction Using Linguistic and Statistical Approach
Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2016, Vol 18, Issue 5
Abstract
Abstract : Arabic multi-word expressions are the combinations of two or more terms that associated with each other as one concept. The process of extracting such expressions is challenging especially when the length of the combination is getting longer. Recently, researchers attempted to extract nested noun compounds whichconsists of two or more combinations of nouns. Such extraction process requires comprehensive analysis using linguistic and statistical approaches. However, the process of extraction in the state of the art have extended to include 4-gra and 5-gram candidates. This paper aims to combine the extraction of nested noun compound andcollocation in order to extend the process of extraction to include 6-gram and 7-gram. For this manner, a linguistic approach comprises of various kinds of pattern has been used, as well as, three statistical measures have been utilized including NC-value, LLR and PMI. Results shown that the proposed method has the ability to extend the extraction to include longer candidates.
Authors and Affiliations
Maryam Yaseen Al-Mashhadani , Luma Adnan Al-Sagban
A Genetic Algorithm For Scheduling JobsWith Burst Time And Priorities
Abstract: Scheduling play extremely important role in our day-to-day life, same as the performance of system is highly affected by the CPU scheduling. For the better scheduling the performance is depend upon the paramete...
Empirical Coding for Curvature Based Linear Representation inImage Retrieval System
Abstract : Image retrieval systems are finding their applications in all automation systems, wherein automateddecision needs to be taken based on the image contents. The prime requirement of such systems is to develop av...
A novel semantic level text classification by combining NLP and Thesaurus concepts
Abstract: Text categorization (also known as text classification or topic spotting) is the task of automatically sorting a set of documents into categories from a predefined set. Automated text classification is at...
ID3 Derived Fuzzy Rules for Predicting the Students AcedemicPerformance
Abstract: This paper presents a technique to use ID3 decision rules to produce fuzzy rules to get the optimizeprediction of the students academic performance. In this paper, a the student administrative data for a...
Risk Minimization in Agribusiness using Soft Computing Technique
Abstract: India is an agriculture based country and farmer community is the backbone of the agriculture sector. Agribusiness is one of the important segments of agriculture sector. This paper aims to minimize agribusine...