FEATURE SELECTION METHODS AND ALGORITHMS
Journal Title: International Journal on Computer Science and Engineering - Year 2011, Vol 3, Issue 5
Abstract
Feature selection is an important topic in data mining, especially for high dimensional datasets. Feature selection (also known as subset selection) is a process commonly used in machine learning, wherein subsets of the features available from the data are selected for application of a learning algorithm. The best subset contains the least number of dimensions that most contribute to accuracy; we discard the remaining, unimportant dimensions. This is an important stage of preprocessing and is one of two ways of avoiding the curse of dimensionality (the other is feature extraction). There are two approaches in Feature selection known as Forward selection and backward selection. Feature selection has been an active research area in pattern recognition, statistics, and data mining communities. The main idea of feature selection is to choose a subset of input variables by eliminating features with little or no predictive information. Feature selection methods can be decomposed into three broad classes. One is Filter methods and another one is Wrapper method and the third one is Embedded method. This paper presents an empirical comparison of feature selection methods and its algorithms. In view of the substantial number of existing feature selection algorithms, the need arises to count on criteria that enable to adequately decide which algorithm to use in certain situations. This work reviews several fundamental algorithms found in the literature and assesses their performance in a controlled scenario.
Authors and Affiliations
L. Ladha , T. Deepa,
Quantum Computation and Consciousness in Cyclic and Mythological Models of Universe
Cyclic models such as Steinhardt-Turok model, Baum-Frampton model, and CCC models have been proposed for the universe. It has been postulated that the value of the physical constants in different aeons may possibly be di...
Chairperson and Secretarius Meeting Guides for Electronic Meeting Direction
Facilitation and guidance of computer-supported meetings is a well-known activity that can be supported electronically. Various forms of facilitator support have been developed over the years. This paper presents a uniqu...
A Survey of QOS with IEEE 802.11e
IP is the fundamental protocol of Internet. It provides best efforts service. It has no in-built mechanisms to provide Quality of service. Some of the applications that are being used in Internet require Quality of servi...
AN EFFECTIVE RETRIVAL SCHEME FOR SOFTWARE COMPONENT REUSE
Software component reuse has become of much interest in the software community due to its potential benefits, cost benefit, time saving, etc. which include increased product quality and decreased product development cost...
A Review of Feature Selection Algorithms for Data Mining Techniques
Feature selection is a pre-processing step, used to improve the mining performance by reducing data dimensionality. Even though there exists a number of feature selection algorithms, still it is an active research area i...