Software Bug Reports: Automatic Keyword and Sentence-Based Text Summarization Using Artificial Intelligence
Journal Title: International Journal of Innovative Research in Computer Science and Technology - Year 2022, Vol 10, Issue 6
Abstract
The purpose of text summarization is to quickly and accurately extract the most important data from papers. The proposed unsupervised method seeks to synthesise complete and informative bug reports (software artefacts). The suggested approach employs Rapid Auto- matic Keyword Extraction and the term frequency-inverse document frequency method to identify applicable keywords and phrases. During the sentence extraction procedure, fuzzy C-means clustering is used to prioritise sentences that have a high degree of membership in each cluster (beyond a predefined threshold). The selection of sentences is performed by a rule-engine. Information is extracted using keywords and sentences chosen by the clustering process, and the rules are developed using domain knowledge. The proposed method produces a logical and well-organized summary of apache bug reports. The retrieval summary is improved with the help of hierarchical clustering by removing unnecessary details and rearranging them. The Apache Project Bug Report Corpus (APBRC) and the original Bug Report Corpus are used to evaluate the effectiveness of the proposed method. Measures of performance such as precision, recall, pyramid precision, and F-score are used to evaluate the results. Experiment results demonstrate that our proposed method significantly outperforms the state-of-the-art baseline methods like BRC and LRCA. In addition, it achieves substantial gains compared to prior art unsupervised methods as Hurried and centroid. It extracts the most relevant keyword phrases and sentences from each cluster to offer comprehensive coverage and a coherent summary. The average values for precision, recall, f-score, and pyramid precision on the APBRC corpus are 78.22%, 82.18%, 80.10%, and 81.66%, respectively.
Authors and Affiliations
Zaid Altaf, and Ashish Oberoi
A Review on Trademark and Its Enforcement Provisions under TRIPS
The Agreement on Trade Related Aspects of Intellectual Property Rights (TRIPS) for the protection and promotion of Copyright Protection (IP) rights in World Trade Organization member nations includes trademark provisions...
Reducing Complexity of Graph Isomorphism Problem
Graph isomorphism has been discussed in the literature as NP-hard problem. It has applications in various areas. Work done earlier in this area employs backtracking for identifying isomorphism between given two graphs as...
Digital Marketing Analysis on Social Media Using Machine Learning
The field of machine learning has received insufficient attention. Because of their superior Artificial Intelligence, machines that are capable of deep learning have the potential to push the boundaries of what is possib...
A Better Solution Towards Microservices Communication In Web Application: A Survey
Most of the software applications are configured as client-server fashion in the network in order to enable communication among them. But sometimes these applications must be able to communicate or exchange data between...
COMPARATIVE PERFORMANCE STUDY OF HYBRID ROUTING PROTOCOL OVER REAL TIME DATA IN MANET
MANET is a collection of wireless nodes that can dynamically form a network to exchange information without using any pre-existing fixed network infrastructure with or without centralized network controller. MANETs are b...