Searching Relevant Documents from Large Volume of Unstructured Database

Abstract

In large organizations managing of data is very tedious task. these includes unstructured data such as images,videos,MP3 files, emails etc. The central aspect of research is to identify right document from unstructured documents. It refers Tf-Idf technology, clustering mechanism, similarity measure etc. When multiple document contains same data as input then document which is most similar to input query it should be display first. For that we can use Stemming Algorithm.

Authors and Affiliations

Sarika Kolhe, Varsha Tambe, Gayatri Pawar, Priyanka Ubale, Prof. Nihar Ranjan

Keywords

Related Articles

Practical and FEA Result Analysis of Selected Casement and Sliding Windows

Glass is used for huge number of applications. But normally glass is the main part of the windows. Nowadays the walls of the building consists lot of glasses on the behalf of the packed walls. There are two types of win...

A Novel Approach for Face Detection and Recognization with Multi Scale Color Restoration Technique Using Combination of Knowledge Based and Feature Segmentation

Pattern recognition is the fastest growing area in digital world. Face Detection and recognization technique help this area make it more powerful. In this paper we have introduce an algorithm in which we have combined t...

A Study on E-Commerce Usage in Indian FMCG Companies

The main objective of this study during the summer internship was to discover the efficient ways to promote the sale of FMCG products- Confectioneries like, ORBIT and its followed versions using Ecommerce. Ecommerce bus...

The State – of the – Art of Library Resource Sharing Activities of the Rizal Technological University

In the emergence and integration of information technology, it is rarely possible for a library or information center to have enough resources to fulfill the needs of its clients. What is being delivered is only a porti...

Low Cost Multiple Output Concentrated Solar Energy System (CSES)

Low cost multiple output concentrated solar energy system is centred towards the maximum utilization of solar energy, both direct and diffused radiations producing output as a combination of Electricity, Hot water and s...

Download PDF file
  • EP ID EP20053
  • DOI -
  • Views 312
  • Downloads 5

How To Cite

Sarika Kolhe, Varsha Tambe, Gayatri Pawar, Priyanka Ubale, Prof. Nihar Ranjan (2015). Searching Relevant Documents from Large Volume of Unstructured Database. International Journal for Research in Applied Science and Engineering Technology (IJRASET), 3(4), -. https://europub.co.uk/articles/-A-20053