Feature Selection Optimization using Hybrid Relief-f with Self-adaptive Differential Evolution
Journal Title: International Journal of Intelligent Engineering and Systems - Year 2017, Vol 10, Issue 2
Abstract
In various classification areas, the curse of dimensionality becomes a major challenge among the researchers. Thus, feature selection plays an important role in overcoming dimensionality problem. Relief-f is one of the filter methods to rank the most significant features based on their relevance. Although relief-f proved to be a powerful technique in filter strategy, but this method only rank the features based on their significant level. Hence, feature selection is embedded to select the most meaningful features based on their rank. Differential evolution (DE) is one of the evolutionary algorithms that are widely used in various classification domains. Simple and powerful in implementation, we combined relief-f with DE in our proposed feature selection method to solving the optimization problem. In this work, population size and generation size were adaptively determined from the number of features from relief-f. The performance of proposed method is compared with several feature selection techniques in order to prove their superiority using ten datasets obtained from UCI machine learning repository.
Authors and Affiliations
M Zainudin
Feature Selection Optimization using Hybrid Relief-f with Self-adaptive Differential Evolution
In various classification areas, the curse of dimensionality becomes a major challenge among the researchers. Thus, feature selection plays an important role in overcoming dimensionality problem. Relief-f is one of the f...
Optimal Decision Tree Based Unsupervised Learning Method for Data Clustering
Clustering is an investigative data analysis task. It aims to find the intrinsic structure of data by organizing data objects into similarity groups or clusters. Our investigation using a pattern based clustering on nume...
Classification of Imbalanced Data Using a Modified Fuzzy-Neighbor Weighted Approach
Classification of imbalanced datasets is one of the widely explored challenges of the decade. The imbalance occurs in many real world datasets due to uneven distribution of data into classes, i.e. one class has more inst...
Reliable and Efficient Distribution of Multicast Session Key for Deduplicated Data in Cloud Computing
Data deduplication is one of the fascinating features of any cloud computing storage service which is generally realized as Cross User Data Deduplication (CUDD). Although it provides optimization which is challenging to...
Multi Agent Based Diabetes Diagnosing and Classification with the Aid of Hybrid Firefly-Neural Network
A multi agent distributed data mining system for diagnosing diabetes and classification is proposed. Here we are introducing four agents namely user agent, connection agent, updation agent, and security agent. In which e...