MULTI-DOCUMENT TEXT SUMMARIZATION USING CLUSTERING TECHNIQUES AND LEXICAL CHAINING

Journal Title: ICTACT Journal on Soft Computing - Year 2010, Vol 1, Issue 1

Abstract

This paper investigates the use of clustering and lexical chains to produce coherent summaries of multiple documents in text format to generate an indicative, less redundant summary. The summary is designed as per user’s requirement of conciseness i.e., the documents are summarized according to the percentage input by the user. For achieving the above, various clustering techniques are used. Clustering is done at two levels, one at single document level and then at multi-document level. The clustered sentences are scored based on five different methods and lexically linked to produce the final summary in a text document.

Authors and Affiliations

Saraswathi S, Arti R

Keywords

Related Articles

DETERMINATION OF QUICK SWITCHING DOUBLE SAMPLING SYSTEM BY ATTRIBUTES UNDER FUZZY BINOMIAL DISTRIBUTION – SAMPLE SIZE TIGHTENING

Acceptance sampling is concerned with norms of deciding about the acceptance or rejection of the lots based on the quality of the product during inspection. Dodge and Romig popularized the acceptance sampling as a major...

FEATURE BASED COMMUNITY DETECTION BY EXTRACTING FACEBOOK PROFILE DETAILS

FEATURE BASED COMMUNITY DETECTION BY EXTRACTING FACEBOOK PROFILE DETAILS

SARCASM DETECTION IN ONLINE REVIEW TEXT

Sarcasm is a type of sentiment where people express negative sentiment using positive connotation words in text and vice-versa. In this work, we propose a cross-domain sarcasm detection framework that allows acquisition,...

A STATE OF THE ART SURVEY ON POLYMORPHIC MALWARE ANALYSIS AND DETECTION TECHNIQUES

Nowadays, systems are under serious security threats caused by malicious software, commonly known as malware. Such malwares are sophisticatedly created with advanced techniques that make them hard to analyse and detect,...

INTERPRETATION OF ECG USING MODIFIED INTUITIONISTIC FUZZY C-MEANS CLUSTERING FOR ARRHYTHMIA DATA

An electrocardiogram (ECG) is defined as a measure of variation in the electrical activity of the heart and is broadly used in detection and classification of heart-related diseases. The abnormalities present in the hear...

Download PDF file
  • EP ID EP198979
  • DOI 10.21917/ijsc.2010.0004
  • Views 79
  • Downloads 0

How To Cite

Saraswathi S, Arti R (2010). MULTI-DOCUMENT TEXT SUMMARIZATION USING CLUSTERING TECHNIQUES AND LEXICAL CHAINING. ICTACT Journal on Soft Computing, 1(1), 23-29. https://europub.co.uk/articles/-A-198979