Document Grouping by Using Meronyms and Type-2 Fuzzy Association Rule Mining

Journal Title: Journal of ICT Research and Applications - Year 2017, Vol 11, Issue 3

Abstract

The growth of the number of textual documents in the digital world, especially on the World Wide Web, is incredibly fast. This causes an accumulation of information, so we need efficient organization to manage textual documents. One way to accurately classify documents is using fuzzy association rules. The quality of the document clustering is affected by phase extraction of key terms and type of fuzzy logic system (FLS) used for clustering. The use of meronyms in the extraction of key terms to obtain cluster labels helps obtaining meaningful cluster labels and in addition ambiguities and uncertainties that occur in the rules of type-1 fuzzy logic systems can be overcome by using type-2 fuzzy sets. This study proposes a method of key term extraction based on meronyms with an initialization cluster using fuzzy association rule mining for document clustering. This method consists of four stages, i.e. preprocessing of the document, extraction of key terms with meronyms, extraction of candidate clusters, and cluster tree construction. Testing of this method was done with three different datasets: classic, Reuters, and 20 Newsgroup. Testing was done by comparing the overall F-measure of the method without meronyms and with meronyms. Based on the testing, the method with meronyms in the extraction of keywords produced an overall F-measure of 0.5753 for the classic dataset, 0.3984 for the Reuters dataset, and 0.6285 for the 20 Newsgroup dataset.

Authors and Affiliations

Fahrur Rozi, Farid Sukmana

Keywords

Related Articles

Dynamic Path Planning for Mobile Robots with Cellular Learning Automata

In this paper we propose a new approach to path planning for mobile robots with cellular automata and cellular learning automata. We divide the planning into two stages. In the first stage, global path planning is perfor...

Rainfall Prediction in Tengger, Indonesia Using Hybrid Tsukamoto FIS and Genetic Algorithm Method

Countries with a tropical climate, such as Indonesia, are highly dependent on rainfall prediction for many sectors, such as agriculture, aviation, and shipping. Rainfall has now become increasingly unpredictable due to c...

Automatic Title Generation in Scientific Articles for Authorship Assistance: A Summarization Approach

This paper presents a study on automatic title generation for scientific articles considering sentence information types known as rhetorical categories. A title can be seen as a high-compression summary of a document. A...

Efficient CFO Compensation Method in Uplink OFDMA for Mobile WiMax

Mobile WiMax uses Orthogonal Frequency Division Multiple Access (OFDMA) in uplink where synchronization is a complex task as each user presents a different carrier frequency offset (CFO). In the Data Aided Phase Incremen...

A Printed PAW Image Database of Arabic Language for Document Analysis and Recognition

Document image analysis and recognition are important topics in the field of artificial intelligence. In this context, the availability of a database with good script samples is an important requirement for machine-learn...

Download PDF file
  • EP ID EP326327
  • DOI 10.5614/itbj.ict.res.appl.2017.11.3.4
  • Views 83
  • Downloads 0

How To Cite

Fahrur Rozi, Farid Sukmana (2017). Document Grouping by Using Meronyms and Type-2 Fuzzy Association Rule Mining. Journal of ICT Research and Applications, 11(3), 268-283. https://europub.co.uk/articles/-A-326327