Time Reduction Mechanism in Information Extraction Using Parse Tree Query Language

Abstract

Information extraction (IE) is the task of automatically extracting structured information from unstructured and semi-structured machinereadable document. In this paper, we propose a new paradigm for information extraction. In this extraction framework, intermediate output of each text processing component is stored so that only the improved component has to be deployed to the entire corpus. Extraction is then performed on both the previously processed data from the unchanged components as well as the updated data generated by the improved component. Performing such kind of incremental extraction can result in a tremendous reduction of processing time. To realize this new information extraction framework, we propose to choose database management systems over filebased storage systems to address the dynamic extraction needs. To demonstrate the feasibility of incremental extraction approach, experiments are performed to highlight two important aspects of an information extraction system: efficiency and quality of extraction results.

Authors and Affiliations

K. Venkatesh, Mr. B. Vijaya Bhaskar Reddy

Keywords

Related Articles

New Strategies for Boosting Localization Accuracy in Wireless Sensor Nodes

Wireless Sensor Networks (WSNs), accurate and energy-efficient localization of sensor nodes remains a challenging task despite significant advancements. Current geolocation algorithms often struggle with scalability, ada...

Exploring the Efficacy of Basketball Shooting: A Comprehensive Analysis of Success Rates

The present study evaluated success rate of basketball shooting for different skill levels. A total of 10 subjects (5 skilled and 5 unskilled) participated in this study. The main goal of the study was to provide the pre...

Performance on Stabilization of Soils Using Geosynthetics

A Large variety of reinforcing materials emerged and have been developed for construction purposes, including: Metal strips, bar mats, Geotextile sheets, Geo Grids, and other reinforcing materials have emerged and been...

Measuring Maintainability of Object Oriented Design: A Revisit

Maintainability has always been an elusive concept. Software maintainability is an external software quality attributes that estimate the complexity and effort required for maintaining software. The key concern of this r...

Walking Kinematics Approaching Stairs

Walkers encounter surfaces of varying angles every day, from shallow accessible ramps to steep outdoor hills. These individuals must constantly alter their joint mechanics to safely transition between surfaces and avoid...

Download PDF file
  • EP ID EP749041
  • DOI -
  • Views 52
  • Downloads 0

How To Cite

K. Venkatesh, Mr. B. Vijaya Bhaskar Reddy (2014). Time Reduction Mechanism in Information Extraction Using Parse Tree Query Language. International Journal of Innovative Research in Computer Science and Technology, 2(5), -. https://europub.co.uk/articles/-A-749041