Effect of Principle Component Analysis and Support Vector Machine in Software Fault Prediction
Journal Title: INTERNATIONAL JOURNAL OF COMPUTER TRENDS & TECHNOLOGY - Year 2014, Vol 7, Issue 3
Abstract
Machine Learning (ML) approaches have a great impact in fault prediction. Demand for producing quality assured software in an organization has been rapidly increased during the last few years. This leads to increase in development of machine learning algorithms for analyzing and classifying the data sets, which can be used in constructing models for predicting the important quality attributes such as fault proneness. Defective modules in software project have a considerable risk which reduces the quality of the software. This paper mainly addresses the software fault prediction using hybrid Support Vector Machine (SVM) classifier. We conduct a comparative study using the WEKA tool for three different levels of software metrics (package level, class level and method level) with hybrid SVM classifiers using feature selection techniques such as Principle Component Analysis (PCA). The experiments are carried out on the datasets such as NASA KC1 method level data set, NASA KC1 class level dataset and Eclipse dataset for package level metrics. The feature selection techniques evolved by experiments shows that Principle Component Analysis (PCA) with hybrid SVM performs better than other feature selection techniques.
Authors and Affiliations
A. Shanthini , G. Vinodhini , RM. Chandrasekaran
A Novel Technique in Cryptography for Data Hiding i n Digital Images
Visual cryptography [1],[6] an emerging technology used in the purpose of data hiding and other specific purposes, uses the characteristics of normal encryption[13] and decryption levels of many digital images. Generally...
Implementation of Minutiae Based Fingerprint Identification System using Crossing Number Concept
Biometric system is essentially a pattern recognition system which recognizes a person by determining the authenticity of a specific physiological (e.g., fingerprints, face, retina, iris) or behavioral (e.g., gait, signa...
Study of TCP Packet Labeling to Alleviate Time-Out
Many applications (e.g., cluster based storage and Map Reduce) in modern data centers require a high fan-in, many- to-one type of data communication (known as TCP in cast), which could cause severe in cast congestion in...
Survey on Security Issues and Solutions in Cloud Computing
Cloud computing is a combination of several key technologies that have evolved and matured over the years. Cloud computing has a potential for cost savings to the enterprises but the security risk are also enormous. Clou...
Categorical Heuristic for Attribute Based Encryption in the Cloud Server
Attribute-based encryption (ABE) is a public-key based one-to-many encryption that allows users to encrypt and decrypt data based on user attributes. A promising application of ABE is flexible access control of encrypted...