Machine Learning or Information Retrieval Techniques for Bug Triaging: Which is Better?

Journal Title: e-Informatica Software Engineering Journal - Year 2017, Vol 11, Issue 1

Abstract

Bugs are the inevitable part of a software system. Nowadays, large software development projects even release beta versions of their products to gather bug reports from users. The collected bug reports are then worked upon by various developers in order to resolve the defects and make the final software product more reliable. The high frequency of incoming bugs makes the bug handling a difficult and time consuming task. Bug assignment is an integral part of bug triaging that aims at the process of assigning a suitable developer for the reported bug who corrects the source code in order to resolve the bug. There are various semi and fully automated techniques to ease the task of bug assignment. This paper presents the current state of the art of various techniques used for bug report assignment. Through exhaustive research, the authors have observed that machine learning and information retrieval based bug assignment approaches are most popular in literature. A deeper investigation has shown that the trend of techniques is taking a shift from machine learning based approaches towards information retrieval based approaches. Therefore, the focus of this work is to find the reason behind the observed drift and thus a comparative analysis is conducted on the bug reports of the Mozilla, Eclipse, Gnome and Open Office projects in the Bugzilla repository. The results of the study show that the information retrieval based technique yields better efficiency in recommending the developers for bug reports.

Authors and Affiliations

Anjali Goyal, Neetu Sardana

Keywords

Related Articles

A literature review on the effectiveness and efficiency of business modeling

Background: Achieving and maintaining a strategic competitive advantage through business and technology innovation via continually improving effectiveness and efficiency of the operations are the critical survival factor...

Data Flow Approach to Testing Java Programs Supported with DFC

Code based (``white box'') approach to testing can be divided into two main types: control flow coverage and data flow coverage. The data flow testing was introduced to structural programming languages and later adopted...

Systematic Literature Review on Search Based Mutation Testing

Search based techniques have been widely applied in the domain of software testing. This Systematic Literature Review aims to present the research carried out in the field of search based approaches applied particularly...

Construction of Variable Strength Covering Array for Combinatorial Testing Using a Greedy Approach to Genetic Algorithm

The limitation of time and budget usually prohibits exhaustive testing of interactions between components in a component based software system. Combinatorial testing is a software testing technique that can be used to de...

Generating Graphical User Interfaces from Precise Domain Specifications

Turning requirements into working systems is the essence of software engineering. This paper proposes automation of one of the aspects of this vast problem: generating user interfaces directly from requirements models. I...

Download PDF file
  • EP ID EP200435
  • DOI 10.5277/e-Inf170106
  • Views 96
  • Downloads 0

How To Cite

Anjali Goyal, Neetu Sardana (2017). Machine Learning or Information Retrieval Techniques for Bug Triaging: Which is Better?. e-Informatica Software Engineering Journal, 11(1), 117-141. https://europub.co.uk/articles/-A-200435