Time Reduction Mechanism in Information Extraction Using Parse Tree Query Language
Journal Title: International Journal of Innovative Research in Computer Science and Technology - Year 2014, Vol 2, Issue 5
Abstract
Information extraction (IE) is the task of automatically extracting structured information from unstructured and semi-structured machinereadable document. In this paper, we propose a new paradigm for information extraction. In this extraction framework, intermediate output of each text processing component is stored so that only the improved component has to be deployed to the entire corpus. Extraction is then performed on both the previously processed data from the unchanged components as well as the updated data generated by the improved component. Performing such kind of incremental extraction can result in a tremendous reduction of processing time. To realize this new information extraction framework, we propose to choose database management systems over filebased storage systems to address the dynamic extraction needs. To demonstrate the feasibility of incremental extraction approach, experiments are performed to highlight two important aspects of an information extraction system: efficiency and quality of extraction results.
Authors and Affiliations
K. Venkatesh, Mr. B. Vijaya Bhaskar Reddy
Constraints over Greenhouse Detection using Wireless Sensor Networks
Due to uneven natural distribution of rain water it is very crucial for farmers to monitor and control the equal distribution of water to all crops in the whole farm or as per the requirement of the crop. There is no ide...
A Framework for Modeling Non-Functional Requirements for Business-Critical Systems
Proper definition and implementation of NFRs is critical. In case they are Over-specify, then the solution may be too costly to be viable; in case they are underspecified or underachieve them, the system will be inadequa...
Application of IoT in Healthcare
There has been a lot of research into medical facilities and technological developments during the last ten years. To be more specific, the Internet (IoT) has shown promise in connecting a range of healthcare gear, monit...
GPU-Graphics Processing Unit
In this paper we describe GPU and its computing. GPU (Graphics Processing Unit) is an extremely multi-threaded architecture and then is broadly used for graphical and now nongraphical computations. The main advantage of...
Knowledge Representation for Legal Document Summarization
This paper presents a novel approach for legal document summarization. Proposed approach is based on Ripple-Down Rules (RDR). It is an incremental knowledge acquisition method. RDR allows us to quickly build an extendabl...