Extracting Topics from the Holy Quran Using Generative Models

Abstract

The holy Quran is one of the Holy Books of God. It is considered one of the main references for an estimated 1.6 billion of Muslims around the world. The Holy Quran language is Arabic. Specialized as well as non-specialized people in religion need to search and lookup certain information from the Holy Quran. Most research projects concentrate on the translation of the holy Quran in different languages. Nevertheless, few research projects pay attention to original text of the holy Quran in Arabic language. Keyword search is one of the Information Retrieval (IR) methods but will retrieve what is called exact search. Semantic search aims at finding deeper meanings of a text, and it is a hot field of study in Natural Language Processing (NLP). In this paper topic modeling techniques are explored to setup a framework for semantic search in the holy Quran. As the Holy Quran is the word of God, its meanings are unlimited. In this paper the words of chapter Joseph (Peace Be Upon Him (PBUH)) from the Holy Quran is analyzed based on topic modeling techniques as a case study. Latent Dirichlet Allocation (LDA) topic modeling technique has been applied in this paper into two structures (Hizb Quarters and verses) of Joseph chapter as: words, roots and stems. The log-Likelihood has been calculated for the two structures of the chapter. Results show that the best structure to use is verses, which gives the least energy for data. Some of the results of the attained topics are shown. These results suggest that topic modeling techniques failed to capture in an accurate manner the coherent topics of the chapter.

Authors and Affiliations

Mohammad Alhawarat

Keywords

Related Articles

Analysis of IPv4 vs IPv6 Traffic in US

It is still an accepted assumption that internet traffic is dominated by IPv4. However, due to introduction of modern technologies and concepts like Internet of Things (IoT) IPv6 has become the essential element. So keep...

Pre-Eminance of Open Source Eda Tools and Its Types in The Arena of Commercial Electronics

Digital synthesis with a goal of chip designing in the commercial electronics arena is packed into large EDA Software providers like, Synopsys, Cadence, or MentorGraphics. These commercial tools being expensive and havin...

Establishing Standard Rules for Choosing Best KPIs for an E-Commerce Business based on Google Analytics and Machine Learning Technique

The predictable values that indicate the performance of any company and determine that how well they are performing in order to achieve their objective is referred by the term called as “key performance indicators”. The...

Scalable Service for Predictive Learning based on the Professional Social Networking Sites

Professional social networking sites are widely used as a tool for obtaining specific information such as technology trends and professional skills demand. The article is aimed to consider the evolution of services for p...

Survey Paper for Software Project Team, Staffing, Scheduling and Budgeting Problem

Software project scheduling is a standout amongst the most imperative scheduling zones looked by Software project management team. Software development companies are under substantial strain to finish projects on time, w...

Download PDF file
  • EP ID EP95819
  • DOI 10.14569/IJACSA.2015.061238
  • Views 110
  • Downloads 0

How To Cite

Mohammad Alhawarat (2015). Extracting Topics from the Holy Quran Using Generative Models. International Journal of Advanced Computer Science & Applications, 6(12), 288-294. https://europub.co.uk/articles/-A-95819