Mining High Utility Itemsets with Regular Occurrence
Journal Title: Journal of ICT Research and Applications - Year 2016, Vol 10, Issue 2
Abstract
High utility itemset mining (HUIM) plays an important role in the data mining community and in a wide range of applications. For example, in retail business it is used for finding sets of sold products that give high profit, low cost, etc. These itemsets can help improve marketing strategies, make promotions/ advertisements, etc. However, since HUIM only considers utility values of items/itemsets, it may not be sufficient to observe product-buying behavior of customers such as information related to “regular purchases of sets of products having a high profit margin”. To address this issue, the occurrence behavior of itemsets (in the term of regularity) simultaneously with their utility values was investigated. Then, the problem of mining high utility itemsets with regular occurrence (MHUIR) to find sets of co-occurrence items with high utility values and regular occurrence in a database was considered. An efficient single-pass algorithm, called MHUIRA, was introduced. A new modified utility-list structure, called NUL, was designed to efficiently maintain utility values and occurrence information and to increase the efficiency of computing the utility of itemsets. Experimental studies on real and synthetic datasets and complexity analyses are provided to show the efficiency of MHUIRA combined with NUL in terms of time and space usage for mining interesting itemsets based on regularity and utility constraints.
Authors and Affiliations
Komate Amphawan
Generic Animation Method for Multi-Objects in IFS Fractal Form
Both non-metamorphic animation and metamorphic animation of objects or multi-objects in IFS fractal form as basic animation method can be implemented by a modified version of the random iteration algorithm as basic algor...
Efficient CFO Compensation Method in Uplink OFDMA for Mobile WiMax
Mobile WiMax uses Orthogonal Frequency Division Multiple Access (OFDMA) in uplink where synchronization is a complex task as each user presents a different carrier frequency offset (CFO). In the Data Aided Phase Incremen...
Social Media Text Classification by Enhancing Well-Formed Text Trained Model
Social media are a powerful communication tool in our era of digital information. The large amount of user-generated data is a useful novel source of data, even though it is not easy to extract the treasures from this va...
Passive Available Bandwidth Estimation Based on Collision Probability and Node State Synchronization in Wireless Networks
In wireless networks, available bandwidth estimation is challenging because wireless channels are used by multiple users or applications concurrently. In this study, we propose a passive measurement scheme to estimate th...
Randomized Symmetric Crypto Spatial Fusion Steganographic System
The image fusion steganographic system embeds encrypted messages in decomposed multimedia carriers using a pseudorandom generator but it fails to evaluate the contents of the cover image. This results in the secret data...