RIN-Sum: A System for Query-Specific Multi-Document Extractive Summarization

Abstract

In paper, we have proposed a novel summarization framework to generate a quality summary by extracting Relevant-Informative-Novel (RIN) sentences from topically related document collection called as RIN-Sum. In the proposed framework, with the aim to retrieve user's relevant informative sentences conveying novel information, ranking of structured sentences has been carried out. For sentence ranking, Relevant-Informative-Novelty (RIN) ranking function is formulated in which three factors, i.e., the relevance of sentence with input query, informativeness of the sentence and the novelty of the sentence have been considered. For relevance measure instead of incorporating existing metrics, i.e., Cosine and Overlap which have certain limitations, a new relevant metric called as C-Overlap has been formulated. RIN ranking is applied on document collection to retrieve relevant sentences conveying significant and novel information about the query. These retrieved sentences are used to generate query-specific summary of multiple documents. The performance of proposed framework have been investigated using standard dataset, i.e., DUC2007 documents collection and summary evaluation tool, i.e., ROUGE.

Authors and Affiliations

Rajesh Wadhvani, Rajesh Kumar Pateriya, Manasi Gyanchandani, Sanyam Shukla

Keywords

Related Articles

Medical Image Retrieval based on the Parallelization of the Cluster Sampling Algorithm

Cluster sampling algorithm is a scheme for sequential data assimilation developed to handle general non-Gaussian and nonlinear settings. The cluster sampling algorithm can be used to solve a wide spectrum of problems tha...

A Machine Learning Approach towards Detecting Dementia based on its Modifiable Risk Factors

Dementia is considered one of the greatest global health and social care challenges in the 21st century. Fortunately, dementia can be delayed or possibly prevented by changes in lifestyle as dictated through known modifi...

Antenna Performance Improvement Techniques for Energy Harvesting: A Review Study

The energy harvesting is defined as using energy that is available within the environment to increase the efficiency of any application. Moreover, this method is recognized as a useful way to break down the limitation of...

Teen’s Social Media Adoption: An Empirical Investigation in Indonesia

Social media has reached their popularity in the past decade. Indonesia has more than 63 million social media users who are accessing their account through mobile phone and therefore Indonesia is the third largest users...

A Convolutional Neural Network for Automatic Identification and Classification of Fall Army Worm Moth

To combat the problem caused by the Fall Army Worm in the country there is a need to come up with robust early warning and monitoring systems as the current manual system is labor intensive and time consuming. The automa...

Download PDF file
  • EP ID EP249580
  • DOI 10.14569/IJACSA.2017.080317
  • Views 78
  • Downloads 0

How To Cite

Rajesh Wadhvani, Rajesh Kumar Pateriya, Manasi Gyanchandani, Sanyam Shukla (2017). RIN-Sum: A System for Query-Specific Multi-Document Extractive Summarization. International Journal of Advanced Computer Science & Applications, 8(3), 106-112. https://europub.co.uk/articles/-A-249580