Feature subset selection for high dimensional data with domain analysis using Semantic Mining

Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2014, Vol 16, Issue 5

Abstract

Abstract: Feature subset selection involves identifying a subset of the most useful features that produces compatible results as the original entire set of features. A feature selection algorithm may be evaluated from both the efficiency and effectiveness points of view. While the efficiency concerns the time required to find a subset of features, the effectiveness is related to the quality of the subset of features. Current existing algorithms for feature sub set selection works only based on conducting statistical test like Pearson test or symmetric uncertainty test to find the correlation between the features and apply threshold to filter redundant and irrelevant features. FAST proposed by Qinbao Song [9] uses symmetric uncertainty test for feature subset selection. In this work we extend the FAST algorithm by applying the domain analysis using semantic Mining to improve the relevance of the feature subset selection.

Authors and Affiliations

Abdul Majeed K. M , Pallavi K. N , Tanvir Habib Sardar

Keywords

Related Articles

Design of Non-Volatile SRAM Using Magnetic Tunnel Junction

In the last 10 years, FPGA circuits have developed rapidly, because of their flexibility, their ease of use and the low cost to design a function with them. However, the internal memories used in FPGA circuit could limit...

 Comparative study on Cache Coherence Protocols

 Abstract: In this new age of technology, not only the software but also the computer architecture has beenevoluted to support those softwares. The main motive of evolution of architecture day by day is to make thes...

 An Intelligent Decision Support System Using Adaptive Network-Based Fuzzy Inference System (ANFIS) For Choosing Suitable Bank Loan Installments

 Abstract : Choosing the best bank to obtain a loan from has become a dilemma for many loan applicants; Thus the need to automate the process throughout establishing an intelligent decision support system (IDSS) to...

Performance Evaluation and QoS Analysis of EEPB and PDCH Routing Protocols in Wireless Sensor Networks

Abstract : EEPB (Energy-Efficient PEGASIS-Based protocol) is a chain-based protocol. It has certain deficiencies such as it ignores the nodes energy and distance between nodes to BS when selecting the leader. It causes o...

 Development of a D.C Circuit Analysis Software Using MicrosoftVisual C#.Net

 Abstract: In this paper, the development of D.C circuit simulation software, using Microsoft visual C#.net, hasbeen achieved. This paper aims at (i) analysing a purely resistive planar circuit, (ii) displaying curr...

Download PDF file
  • EP ID EP110723
  • DOI 10.9790/0661-1651108111
  • Views 77
  • Downloads 0

How To Cite

Abdul Majeed K. M, Pallavi K. N, Tanvir Habib Sardar (2014).  Feature subset selection for high dimensional data with domain analysis using Semantic Mining. IOSR Journals (IOSR Journal of Computer Engineering), 16(5), 108-111. https://europub.co.uk/articles/-A-110723