Predicting software aging related bugs from imbalanced datasets by using data mining techniques
Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2016, Vol 18, Issue 1
Abstract
Abstract: Software aging bugs are related with the life-span of the software. Rebooting is one of the solutions of this problem, however, it is time consuming and causes resources loss. It is difficult to detect these bugsduring the time-limited software testing process. Data mining techniques can be useful to predict whether a piece of software has aging related bugs or not. The available datasets of software aging bugs present a challenge as they are imbalanced datasets. In these datasets, the number of data points with bugs is very small as compared to the number of data points with no bugs. It is important to predict the rare class (Bugs). In this paper we carried out experiment with a dataset containing data points related to aging-related bugs found in an open-source project MySQL DBMS. Data mining techniques developed for imbalanced datasets were compared with general data mining techniques. Various performance measures were used for the comparative study. The results suggest that data mining techniques developed for imbalanced datasets are more useful for correct prediction of data points related to aging related bugs. Data mining techniques developed for imbalanced datasets performed better than general data mining techniques on G-mean measure which is an important performance measure for imbalanced datasets
Authors and Affiliations
Amir Ahmad
Soft Phone Support Voice and Video Calling Using Sip And Rtp Protocol
Soft Phone is a VoIP soft phone that uses the Session Initiation Protocol. It is a powerful and unique SIP software telephone that lets users make phone and video calls using single software application using any Voice o...
Implementation of Point of Sale Software in Mobile Shop
At present,Point of Sale (POS) software is used widely in retail business. It has changed the manual system of business to computerized system. The main goal of this paper is to implement point of sale software which is...
Touchless Palmprint Verification using Shock Filter,SIFT, I-RANSAC, and LPD
Abstract: Palmprint have some basic features. These basic features are unique and unchangeable in one’s life.It is constant and not easy to fake. A palmprint contains three major lines that are called principal line, se...
A Review onImage Mining Techniques and its application on asoftware BOND
Abstract: Image processing is one of the most researched areas in computer science and it finds numerousapplications in various fields like, medical research and diagnosis, geological research, crime investigation,and so...
Towards a new ontology of the Moroccan Post-baccalaureatelearner profile for the E-orientation system “MMSyOrientation”
Abstract: Today E-orientation systems are interested in helping learners to choose a suitable branch to theirskills and preferences. In this context the research center within the University Hassan II Mohammedia AinChock...