Software Bug Reports: Automatic Keyword and Sentence-Based Text Summarization Using Artificial Intelligence

Abstract

The purpose of text summarization is to quickly and accurately extract the most important data from papers. The proposed unsupervised method seeks to synthesise complete and informative bug reports (software artefacts). The suggested approach employs Rapid Auto- matic Keyword Extraction and the term frequency-inverse document frequency method to identify applicable keywords and phrases. During the sentence extraction procedure, fuzzy C-means clustering is used to prioritise sentences that have a high degree of membership in each cluster (beyond a predefined threshold). The selection of sentences is performed by a rule-engine. Information is extracted using keywords and sentences chosen by the clustering process, and the rules are developed using domain knowledge. The proposed method produces a logical and well-organized summary of apache bug reports. The retrieval summary is improved with the help of hierarchical clustering by removing unnecessary details and rearranging them. The Apache Project Bug Report Corpus (APBRC) and the original Bug Report Corpus are used to evaluate the effectiveness of the proposed method. Measures of performance such as precision, recall, pyramid precision, and F-score are used to evaluate the results. Experiment results demonstrate that our proposed method significantly outperforms the state-of-the-art baseline methods like BRC and LRCA. In addition, it achieves substantial gains compared to prior art unsupervised methods as Hurried and centroid. It extracts the most relevant keyword phrases and sentences from each cluster to offer comprehensive coverage and a coherent summary. The average values for precision, recall, f-score, and pyramid precision on the APBRC corpus are 78.22%, 82.18%, 80.10%, and 81.66%, respectively.

Authors and Affiliations

Zaid Altaf, and Ashish Oberoi

Keywords

Related Articles

Depression Identification Using Machine Learning Classifiers

Depression is a mental condition that indicates emotional issues, including anger issues, unhappiness, boredom, appetite loss, lack of concentration, anxiety, etc. The quality of life of an individual may be negatively i...

Brain Haemorrhage Detection using LSTM, Convolution Neural Network and CT Scan Images

A brain hemorrhage is an eruption of the brain's arteries brought on by either excessive blood pressure or blood coagulation, which may result in fatalities or serious injuries. It is the kind of medical emergency that r...

Design and Analysis of Composite Propeller Blade for Aircraft

The work in this paper primarily focuses on the modelling and analysis of a plane's propeller blade for strength. The geometry of a propeller blade is a sophisticated 3D model. CATIA V5 R20 is utilised to generate the bl...

IoT Based Smart Alert Network Security System Using Machine Learning

The increasing security threats in public places such as airports, train stations, and shopping malls require the development of smart security systems that can detect potential threats and provide timely alerts to secur...

Effect of addition of Alccofine on Coal Bottom Ash Concrete Properties

The effect of addition of ultra-fine material i.e. Alccofine on the properties of coal bottom ash-assisted concrete has been studied in this study. Alccofine is added in steps in the 40% bottom ash concrete to revive the...

Download PDF file
  • EP ID EP746018
  • DOI 10.55524/ijircst.2022.10.6.18
  • Views 1
  • Downloads 0

How To Cite

Zaid Altaf, and Ashish Oberoi (2022). Software Bug Reports: Automatic Keyword and Sentence-Based Text Summarization Using Artificial Intelligence. International Journal of Innovative Research in Computer Science and Technology, 10(6), -. https://europub.co.uk/articles/-A-746018