Text Summarization and Discovery of Frames and Relationship from Natural Language Text - A R&D Methodology

Journal Title: International Journal on Computer Science and Engineering - Year 2010, Vol 2, Issue 3

Abstract

The paper deals with the concept of data mining whereby the data resources can be fetched and accessed accordingly with reduced time complexity. Resource sharing is an important aspect in the field of information science. The retrieval techniques are pointed out based on the ideas of binary search tree, Gantt chart, text summarization. A heorem has been cited regarding the summation of total length of codes of each leaf search term. Summarization is a hard problem of Natural Language Processing because, to do it properly, one has to really understand the point of a text. This requires semantic analysis, discourse processing, and inferential interpretation (grouping of the content using world knowledge). The last step, especially, is complex, because systems without a great deal of world knowledge simply cannot do it. Therefore, attempts so far of performing true abstraction--creating abstracts as summaries--have not been very successful. Fortunately, however, an approximation called extraction is more feasible today. To create an extract, a system need simply to identify the most important/topical/central topic(s) of the text, and return them to the reader. Although the summary is not necessarily coherent, the reader can form an opinion of the content of the original. Most automated summarization systems today produce extracts only. Another purpose of this paper is to addresses the problem of information discovery in large collections of text. For users, one of the key problems in working with such collections is determining where to focus their attention. Text documents often contain valuable structured data that is hidden in regular English sentences. This data is best exploited if available as a relational table that we could use for answering precise queries or for running data mining tasks. We explore a technique for extracting such tables from document collections that requires only a handful of training examples from users. In this paper we have tried to explain how to extract the different kind of relationship between the words with the help of a frame net analysis diagram of an annotation layer software.

Authors and Affiliations

P. Chakrabarti , J. K. Basu

Keywords

Related Articles

UEP based on Proximity Pilot Subcarriers with QAM in OFDM

A novel UEP (Unequal Error Protection) method is proposed that utilizes the subcarrier positions relative to pilot subcarriers in an OFDM multicarrier frame along with QAM (Quadrature Amplitude Modulation) schemes. With...

Augmentation of Block Truncation Coding based Image Retrieval by using Even and Odd Images with Sundry Colour Spaces

The augmentation to block truncation coding (BTC) based image retrieval techniques using Even and Odd images with ten different colour spaces is the theme of work given in the paper. Here the original image is reflected...

Towards a participatory E-learning 2.0<br /> A new E-learning focused on learners and validation of the content

Our aim is to propose a collaborative methodological approach centred on learners and based on the Web 2.0 tools in order to make E-learning 2.0. It is based on a process consisting of four iterative steps which are: gro...

Modified OCABTR based Hierarchical Two Level Data Aggregation in Wireless Sensor Networks

In Wireless Sensor Network (WSN), it is necessary to optimize the energy of the sensor node. Each sensor node is equipped with a battery which is used for sensing and processing the data. Cluster based data aggregation a...

Global Chaos Synchronization of Four-Scroll and Four-Wing Attractors by Active Nonlinear Control

This paper investigates the global chaos synchronization of identical four-scroll attractors (Liu and Chen, 2004), identical four-wing attractors (Liu, 2009) and non-identical four-scroll and four-wing attractors by acti...

Download PDF file
  • EP ID EP102696
  • DOI -
  • Views 138
  • Downloads 0

How To Cite

P. Chakrabarti, J. K. Basu (2010). Text Summarization and Discovery of Frames and Relationship from Natural Language Text - A R&amp;D Methodology. International Journal on Computer Science and Engineering, 2(3), 487-492. https://europub.co.uk/articles/-A-102696