Statistical Testing on Prediction of Software Defects
Journal Title: EAI Endorsed Transactions on Energy Web - Year 2018, Vol 5, Issue 20
Abstract
Statistical Tests are used to make inferences from data. These tests will tell whether the observed pattern is real or just due to chance. The type of the test, to be used, depends on research design, distribution of data and type of variables. In this paper, we are addressing high dimensionality problem in software defect prediction using statistical tests. We determined the distribution of data to choose appropriate statistical test. We observed most of the variables follow gamma distribution and hence applied wilcoxon Rank Sum Test for correlation between input variables and outcome variable. We extracted the variable with high correlation. We observed the performance of the classifier was improved by addressing high dimensionality problem with wilcoxon Rank Sum Test.
Authors and Affiliations
Satya Srinivas Maddipati, Malladi Srinivas
CASSANDRA - A simulation-based, decision-support tool for energy market stakeholders
Energy gives personal comfort to people, and is essential for the generation of commercial and societal wealth. Nevertheless, energy production and consumption place considerable pressures on the environment, such as the...
A Review On Automatic Detection of Brain Tumor Using Computer Aided Diagnosis System Through MRI
In diagnosing brain tumor using Magnetic Resonance Imaging (MRI) plays a major role in complicated stages. To extract the images, it uses a kind of nuclear magnetic resonance technique. To identify the exact region where...
Incident Management of Information Technology in the Indonesia Higher Education based on COBIT Framework: A Review
Nowadays, implementing the IT management in Indonesia Higher Education (HE) has been an integral part of institution management and all business functions, starting from teaching & learning, academic information system,...
Dynamical demand response method
In this paper authors suggest method of dynamic demand response to consumers of the municipal sector. The electrical grid scheme with adjustments for the application of the dynamic demand response method is described. Re...
Analysis on Improving the Response Time with PIDSARSA-RAL in ClowdFlows Mining Platform
This paper provides an improved parallel data processing in Big Data mining using ClowdFlows platform. The big data processing involves an improvement in Proportional Integral Derivative (PID) controller using Reinforcem...