Relevant Feature Selection from High-Dimensional Data Using MST Based Clustering
Journal Title: International journal of Emerging Trends in Science and Technology - Year 2015, Vol 2, Issue 3
Abstract
Feature selection is the process of identifying a subset of the most useful features that produces compatible results as the original entire set of features. Features provide the information about the data set. In Highdimensional data representation each sample is described by many features. The data sets are typically not task-specific, many features are irrelevant or redundant and should be pruned out or filtered for the purpose of classifying target objects. Given a set of features the feature selection problem is to find a subset of features that “maximizes the learner’s ability to classify patterns”. A graph theoretic clustering algorithm based on boruvka’s algorithm is implemented and experimentally evaluated in this paper. The proposed algorithm works in two steps. In the first step, features are divided into clusters by using graphtheoretic clustering methods. In the second step, the most representative feature that is strongly related to target classes is selected from each cluster to form a subset of features. All the representative features from different clusters form the final feature subset. After finding feature subset accuracy of a classifier, time required for classification and proportion of features selected can be calculated
Authors and Affiliations
Yaswanth Kumar Alapati
Efficient scheduling Algorithm for Satisfying SLA between Cloud and Users
Testing assume a vital part in programming improvement cycle. In the interim, distributed computing has gotten to be one of the important patterned at data innovation. Founded on top of virtualization innovation, distrib...
Design and Performance Analysis for Welding Fumes Extraction System
In a technical institute the welding is basic labs which learn the joining of two similar metals and also a common industrial process. During Welding operation, the fumes and metallic gases are emitted from the work piec...
Study of the Effect of Amlodipine, Atenolol, Enalapril and Losartan in Patients with Mild to Moderate Hypertension
Objectives: To evaluate and compare the efficacy and tolerability of amlodipine, atenolol, enalapril and losartan in patients with mild to moderate hypertension. Research design and method: This study was conducted in me...
Contour Based Real Time Multiple Object Tracking
Contour based real time multiple object tracking is an important task in computer vision. contour based object tracking is useful in many areas such as motion based recognition, automated surveillance, human computer int...
Stature from Human Sternum in Females
An essential aspect of medico-legal proceedings is stature estimation. Securing the regression formula for calculation of stature of adult females by assessing the length of manubrium and meso-sternum is the objective of...