A Survey of Information Extraction Using Different Databases

Apply

A Survey of Information Extraction Using Different Databases

Journal Title: GRD Journal for Engineering - Year 2016, Vol 2, Issue 1

Abstract

Information extraction is one time process for extraction of a particular kind of relationships of interest from a document collection. Information Extraction is the task of automatically extracting structured information from unstructured or semi-structured machine readable documents. A pipeline of special-purpose processing modules is implemented by Information extraction systems. And a pipeline of special-purpose processing modules targeting the extraction of a particular kind of information. But this kind of extraction of information is not enough because there is some disadvantages occurs i.e. when the information have to be modified or improved, here only small part of the corpus might be affected. In this seminar we proposed the new extraction technique in which extraction needs are expressed in the form of database queries, which are evaluated and optimized by database systems. Furthermore, our approach provides automated query generation components so that casual users do not have to learn the query language in order to perform extraction. “Efficiency and quality of extraction “are the two things in which we highlighted in the information extraction system. In this, we propose a new paradigm for information extraction. In this extraction framework, intermediate output of each text processing component is stored so that only the improved component has to be deployed to the entire corpus. Extraction is then performed on both the previously processed data from the unchanged components as well as the updated data generated by the improved component. Performing such kind of incremental extraction can result in a tremendous reduction of processing time. To realize this new information extraction framework, we propose to choose database management systems over file-based storage systems to address the dynamic extraction needs. Our proposed information extraction is composed of two phase’s i.e. initial phase and extraction phase. Here we use different types of DBMS and its comparison with conclusion that, which database is efficient for incremental information extraction.

Authors and Affiliations

Miss. Aparna M. Bagde, Prof. D. C. Mehetre

Keywords

EP ID EP224297
DOI -
Views 123
Downloads 0