ECLogger: Cross-Project Catch-Block Logging Prediction Using Ensemble of Classifiers

Journal Title: e-Informatica Software Engineering Journal - Year 2017, Vol 11, Issue 1

Abstract

Background: Software developers insert log statements in the source code to record program execution information. However, optimizing the number of log statements in the source code is challenging. Machine learning based within-project logging prediction tools, proposed in previous studies, may not be suitable for new or small software projects. For such software projects, we can use cross-project logging prediction. Aim: The aim of the study presented here is to investigate cross-project logging prediction methods and techniques. Method: The proposed method is ECLogger, which is a novel, ensemble-based, cross-project, catch-block logging prediction model. In the research We use 9 base classifiers were used and combined using ensemble techniques. The performance of ECLogger was evaluated on on three open-source Java projects: Tomcat, CloudStack and Hadoop. Results: ECLogger Bagging, ECLogger AverageVote, and ECLogger MajorityVote show a considerable improvement in the average Logged F-measure ($LF$) on 3, 5, and 4 source$rightarrow $target project pairs, respectively, compared to the baseline classifiers. ECLogger AverageVote performs best and shows improvements of 3.12% (average $LF$) and 6.08% (average $ACC$ -- Accuracy). Conclusion: The classifier based on ensemble techniques, such as bagging, average vote, and majority vote outperforms the baseline classifier. Overall, the ECLogger AverageVote model performs best. The results show that the CloudStack project is more generalizable than the other projects.

Authors and Affiliations

Sangeeta Lal, Neetu Sardana, Ashish Sureka

Keywords

Related Articles

The Role of Organisational Phenomena in Software Cost Estimation: A Case Study of Supporting and Hindering Factors

Despite the fact that many researchers and practitioners agree that organisational issues are equally important as technical issues from the software cost estimation (SCE) success point of view, most of the research focu...

An Approach to Assessing the Quality of Business\ Process Models Expressed in BPMN

Introduction: The quality of business process models is important in the area of model-based software development. To the best knowledge of the author there is no working practical model for quality assessment of BPMN 2....

Automatic SUMO to UML Translation

Existing ontologies are a valuable source of domain knowledge. This knowledge could be extracted and reused to create domain models. The extraction process can be aided by tools that enable browsing ontology, marking int...

Are We Working Well with Others? How the Multi Team Systems Impact Software Quality

Background: There are many studies on software development teams, but few about the interactions between teams. Current findings suggest that these multi-team systems may have a significant impact on software development...

Model Driven Web Engineering: A Systematic Mapping Study

Background: Model Driven Web Engineering (MDWE) is the application of the model driven paradigm to the domain of Web software development, where it is particularly helpful because of the continuous evolution of Web techn...

Download PDF file
  • EP ID EP200430
  • DOI 10.5277/e-Inf170101
  • Views 101
  • Downloads 0

How To Cite

Sangeeta Lal, Neetu Sardana, Ashish Sureka (2017). ECLogger: Cross-Project Catch-Block Logging Prediction Using Ensemble of Classifiers. e-Informatica Software Engineering Journal, 11(1), 7-38. https://europub.co.uk/articles/-A-200430