Text Summarization and Discovery of Frames and Relationship from Natural Language Text - A R&D Methodology
Journal Title: International Journal on Computer Science and Engineering - Year 2010, Vol 2, Issue 3
Abstract
The paper deals with the concept of data mining whereby the data resources can be fetched and accessed accordingly with reduced time complexity. Resource sharing is an important aspect in the field of information science. The retrieval techniques are pointed out based on the ideas of binary search tree, Gantt chart, text summarization. A heorem has been cited regarding the summation of total length of codes of each leaf search term. Summarization is a hard problem of Natural Language Processing because, to do it properly, one has to really understand the point of a text. This requires semantic analysis, discourse processing, and inferential interpretation (grouping of the content using world knowledge). The last step, especially, is complex, because systems without a great deal of world knowledge simply cannot do it. Therefore, attempts so far of performing true abstraction--creating abstracts as summaries--have not been very successful. Fortunately, however, an approximation called extraction is more feasible today. To create an extract, a system need simply to identify the most important/topical/central topic(s) of the text, and return them to the reader. Although the summary is not necessarily coherent, the reader can form an opinion of the content of the original. Most automated summarization systems today produce extracts only. Another purpose of this paper is to addresses the problem of information discovery in large collections of text. For users, one of the key problems in working with such collections is determining where to focus their attention. Text documents often contain valuable structured data that is hidden in regular English sentences. This data is best exploited if available as a relational table that we could use for answering precise queries or for running data mining tasks. We explore a technique for extracting such tables from document collections that requires only a handful of training examples from users. In this paper we have tried to explain how to extract the different kind of relationship between the words with the help of a frame net analysis diagram of an annotation layer software.
Authors and Affiliations
P. Chakrabarti , J. K. Basu
An Integer Programming-based Local Search for Large-scale Maximal Covering Problems
Maximal covering problem (MCP) is classified as a linear integer optimization problem which can be effectively solved by integer programming technique. However, as the problem size grows, integer programming requires exc...
WEIGHTAGE FACTOR BASED SCHEME FOR PERFORMANCE IMPROVEMENT IN AD HOC NETWORKS
Ad-hoc networks are a new wireless networking paradigm for mobile hosts. Ad hoc networks don’t require any fixed infrastructure such as base stations or mobile switching centers. Node mobility in an ad-hoc network causes...
Mathematical algorithms for determination of mixed layer height from laser radar signals
This paper describes different mathematical algorithms used in the determination of mixed layer height (MLH) from the laser radar (lidar) signals. These methods are successfully applied to the indigenously developed port...
Image Retrieval using DWT with Row and Column Pixel Distributions of BMP Image
With the rapid development of technology of multimedia, the traditional information retrieval techniques based on keywords are not sufficient, content - based image retrieval (CBIR) has been an active research topic. The...
Global Chaos Synchronization of Four-Scroll and Four-Wing Attractors by Active Nonlinear Control
This paper investigates the global chaos synchronization of identical four-scroll attractors (Liu and Chen, 2004), identical four-wing attractors (Liu, 2009) and non-identical four-scroll and four-wing attractors by acti...