ECLogger: Cross-Project Catch-Block Logging Prediction Using Ensemble of Classifiers

Journal Title: e-Informatica Software Engineering Journal - Year 2017, Vol 11, Issue 1

Abstract

Background: Software developers insert log statements in the source code to record program execution information. However, optimizing the number of log statements in the source code is challenging. Machine learning based within-project logging prediction tools, proposed in previous studies, may not be suitable for new or small software projects. For such software projects, we can use cross-project logging prediction. Aim: The aim of the study presented here is to investigate cross-project logging prediction methods and techniques. Method: The proposed method is ECLogger, which is a novel, ensemble-based, cross-project, catch-block logging prediction model. In the research We use 9 base classifiers were used and combined using ensemble techniques. The performance of ECLogger was evaluated on on three open-source Java projects: Tomcat, CloudStack and Hadoop. Results: ECLogger Bagging, ECLogger AverageVote, and ECLogger MajorityVote show a considerable improvement in the average Logged F-measure ($LF$) on 3, 5, and 4 source$rightarrow $target project pairs, respectively, compared to the baseline classifiers. ECLogger AverageVote performs best and shows improvements of 3.12% (average $LF$) and 6.08% (average $ACC$ -- Accuracy). Conclusion: The classifier based on ensemble techniques, such as bagging, average vote, and majority vote outperforms the baseline classifier. Overall, the ECLogger AverageVote model performs best. The results show that the CloudStack project is more generalizable than the other projects.

Authors and Affiliations

Sangeeta Lal, Neetu Sardana, Ashish Sureka

Keywords

Related Articles

Construction of Variable Strength Covering Array for Combinatorial Testing Using a Greedy Approach to Genetic Algorithm

The limitation of time and budget usually prohibits exhaustive testing of interactions between components in a component based software system. Combinatorial testing is a software testing technique that can be used to de...

Highly Automated Agile Testing Process: An Industrial Case Study

This paper presents a description of an agile testing process in a medium size software project that is developed using Scrum. The research methods used is the case study were as follows: surveys, quantifiable project da...

Data Flow Approach to Testing Java Programs Supported with DFC

Code based (``white box'') approach to testing can be divided into two main types: control flow coverage and data flow coverage. The data flow testing was introduced to structural programming languages and later adopted...

Experience Report: Introducing Kanban into Automotive Software Project

The boundaries between traditional and agile approach methods are disappearing. A significant number of software projects require a continuous implementation of tasks without dividing them into sprints or strict project...

A Graphical Modelling Editor for STARSoC Design Flow Tool Based on Model Driven Engineering Approach

Background : Due to the increasing complexity of embedded systems, system designers use higher levels of abstraction in order to model and analyse system performances. STARSoC (Synthesis Tool for Adaptive and Reconfigura...

Download PDF file
  • EP ID EP200430
  • DOI 10.5277/e-Inf170101
  • Views 124
  • Downloads 0

How To Cite

Sangeeta Lal, Neetu Sardana, Ashish Sureka (2017). ECLogger: Cross-Project Catch-Block Logging Prediction Using Ensemble of Classifiers. e-Informatica Software Engineering Journal, 11(1), 7-38. https://europub.co.uk/articles/-A-200430