Mining High Utility Itemsets with Regular Occurrence

Journal Title: Journal of ICT Research and Applications - Year 2016, Vol 10, Issue 2

Abstract

High utility itemset mining (HUIM) plays an important role in the data mining community and in a wide range of applications. For example, in retail business it is used for finding sets of sold products that give high profit, low cost, etc. These itemsets can help improve marketing strategies, make promotions/ advertisements, etc. However, since HUIM only considers utility values of items/itemsets, it may not be sufficient to observe product-buying behavior of customers such as information related to “regular purchases of sets of products having a high profit margin”. To address this issue, the occurrence behavior of itemsets (in the term of regularity) simultaneously with their utility values was investigated. Then, the problem of mining high utility itemsets with regular occurrence (MHUIR) to find sets of co-occurrence items with high utility values and regular occurrence in a database was considered. An efficient single-pass algorithm, called MHUIRA, was introduced. A new modified utility-list structure, called NUL, was designed to efficiently maintain utility values and occurrence information and to increase the efficiency of computing the utility of itemsets. Experimental studies on real and synthetic datasets and complexity analyses are provided to show the efficiency of MHUIRA combined with NUL in terms of time and space usage for mining interesting itemsets based on regularity and utility constraints.

Authors and Affiliations

Komate Amphawan

Keywords

Related Articles

A Comprehensive Performance Analysis of IEEE 802.11p based MAC for Vehicular Communications Under Non-saturated Conditions

Reliable and efficient data broadcasting is essential in vehicular networks to provide safety-critical and commercial service messages on the road. There is still no comprehensive analysis of IEEE 802.11p based MAC that...

Topic Modeling in Sentiment Analysis: A Systematic Review

With the expansion and acceptance of Word Wide Web, sentiment analysis has become progressively popular research area in information retrieval and web data analysis. Due to the huge amount of user-generated contents over...

Automatic Title Generation in Scientific Articles for Authorship Assistance: A Summarization Approach

This paper presents a study on automatic title generation for scientific articles considering sentence information types known as rhetorical categories. A title can be seen as a high-compression summary of a document. A...

An Energy Aware Unequal Clustering Algorithm using Fuzzy Logic for Wireless Sensor Networks

In wireless sensor networks, clustering provides an effective way of organising the sensor nodes to achieve load balancing and increasing the lifetime of the network. Unequal clustering is an extension of common clusteri...

Mining High Utility Itemsets with Regular Occurrence

High utility itemset mining (HUIM) plays an important role in the data mining community and in a wide range of applications. For example, in retail business it is used for finding sets of sold products that give high pro...

Download PDF file
  • EP ID EP331711
  • DOI 10.5614/itbj.ict.res.appl.2016.10.2.5
  • Views 82
  • Downloads 0

How To Cite

Komate Amphawan (2016). Mining High Utility Itemsets with Regular Occurrence. Journal of ICT Research and Applications, 10(2), 153-176. https://europub.co.uk/articles/-A-331711