Text Summarization of Multi-Aspect Comments in Social Networks in Persian Language

Abstract

Now-a-days, there are increasingly huge amount of user generated comments on the web. The user generated comments usually contains useful and essential information reflecting public’s or customers’ opinions. Since the information in the comments could be used for decision making, production or service improvement, and achieving user satisfaction, the systematic analysis of these comments is an essential need in so many domains including e-commerce, production, and social network analysis. However, the analysis of large volume of comments is a difficult and time-consuming task. Therefore, the need for a system which can convert this massive volume of comments to a useful and efficient summary is felt more and more. Text summarization leads to using more resources at higher speeds and getting richer information. According to numerous studies conducted in the field of multi-document summarization, few studies can be found that have been focused on the user generated comments in Persian language. In this paper, we propose a novel approach to summarize huge amount of comments in Persian, which is enough close to a human summarization. Our approach is based on semantic and lexical similarities and uses a graph-based summarization. We also propose a clustering to deal with multiple aspects (subjects) in a corpus of comments. According to the experiments, the summaries extracted by the proposed approach reached an average score of 8.75 out of 10, which improves the state-of-the-art summarizer’s score about 14 percent.

Authors and Affiliations

Hossein Shahverdian, Hassan Saneifar

Keywords

Related Articles

The Adoption of Software Process Improvement in Saudi Arabian Small and Medium Size Software Organizations: An Exploratory Study

Quite a lot of attention has been paid in the literature on “how to adopt” software process improvement (SPI) in Small and Medium Size (SME) software organization in several countries. This has resulted in limited improv...

 Contextual Modelling of Collaboration System

 Faced with new environmental constraints, firms decide to collaborate in collective entities and adopt new patterns of behavior. So, this firms’ collaboration becomes an unavoidable approach. Indeed, our aim intere...

Efficient Algorithm for Maximal Clique Size Evaluation

A large dataset network is considered for computation of maximal clique size (MC). Additionally, its link with popular centrality metrics to decrease uncertainty and complexity and for finding influential points of any n...

Image Denoising using Adaptive Thresholding in Framelet Transform Domain

Noise will be unavoidable during image acquisition process and denosing is an essential step to improve the image quality. Image denoising involves the manipulation of the image data to produce a visually high quality im...

Design of A high performance low-power consumption discrete time Second order Sigma-Delta modulator used for Analog to Digital Converter

This paper presents the design and simulations results of a switched-capacitor discrete time Second order Sigma-Delta modulator used for a resolution of 14 bits Sigma-Delta analog to digital converter. The use of operati...

Download PDF file
  • EP ID EP259140
  • DOI 10.14569/IJACSA.2017.081248
  • Views 76
  • Downloads 0

How To Cite

Hossein Shahverdian, Hassan Saneifar (2017). Text Summarization of Multi-Aspect Comments in Social Networks in Persian Language. International Journal of Advanced Computer Science & Applications, 8(12), 362-368. https://europub.co.uk/articles/-A-259140