A PROPOSED MODEL FOR EXTRACTING INFORMATION FROM ARABIC-BASED CONTROLLED TEXT DOMAINS, DISCUSSING THE INITIAL MODEL STEPS

Journal Title: International Journal of Applied and Natural Sciences - Year 2018, Vol 7, Issue 2

Abstract

Information extraction from Arabic as well as other languages text is commonly implemented over restricted text domains. Approaching open text domains is challenging, because of the syntactic, semantic and pragmatics ambiguities and variations in text. For the purpose of approaching more relaxed versions of Arabic text domains, Fasha et al. (Fasha et al. 2017) presented a high-level description fora proposed work methodology that can establish a model for extracting information from controlled text domains. In that work, controlled text domains were defined as the text domains that are not restricted in their linguistic features or their knowledge types yet they are not very unanticipated in these respects. In this paper, we discuss that work methodology and its implementation in more detail. Our discussion includes the initial phases of the methodology which covers the corpus preparation processes including its selection, analysis and annotation using a custom morpho-syntactic Part-of-Speech tagging scheme, we also discuss the designing of the supporting knowledge-base model which will be used to represent a

Authors and Affiliations

Mohammad Fasha, Nadim Obeid, Bassam Hammo

Keywords

Related Articles

Effect of Different Insecticides on Solenopsis Mealybug Parasitoid, A. Bambawalei (Hayat)

Inspite of the success of the biological control, chemical control is still being largely used as an important component of integrated pest management (IPM) and is used in conjunction with the biological control. But pes...

DISILLUSIONMENT, DISSONANCE AND ENTROPY AMIDST INDIAN AGRICULTURE: THE REFLECTION AND REFRACTION

According to primer and various propositions, the farmers and the farm economy of India have adequate reasons to be called fatigued and disillusioned. A scenario where more than 2.5 lakh farmers committed suicide, outnum...

Primary Signet-Ring Cell Carcinoma of the Urinary Bladder Literature Review

Primary signet ring cell carcinoma of the urinary bladder is very rare variant of mucus-producing adenocarcinoma, which has poor outcome. This tumor mainly occurs in the middle age patient with slight male predominance,...

ASSESSMENT OF SOME CHEMICAL AND SENSORY PROPERTIES OF DONKWA PRODUCED FROM THE BLEND OF MAIZE AND BAMBARA GROUNDNUT

This work investigated the compositional characteristics of donkwa produced from maize and bambara groundnut blends at 10%, 20%, 30%, and 50% of substitution levels of bambara groundnut. Samples were compared with donkwa...

EVALUATION OF ANTIFUNGAL ACTIVITY AND FORMULATION OF HERBAL HAIR OIL FROM Phyllanthusniruri

Phyllanthusniruriis a widespread tropical herb which is well known for its medicinal properties. In the present study, we evaluated the antifungal activity of acetone, hexane, chloroform and methanolic extract of leaves...

Download PDF file
  • EP ID EP275625
  • DOI -
  • Views 119
  • Downloads 0

How To Cite

Mohammad Fasha, Nadim Obeid, Bassam Hammo (2018). A PROPOSED MODEL FOR EXTRACTING INFORMATION FROM ARABIC-BASED CONTROLLED TEXT DOMAINS, DISCUSSING THE INITIAL MODEL STEPS. International Journal of Applied and Natural Sciences, 7(2), 65-86. https://europub.co.uk/articles/-A-275625