A New Web Document Retrieval Method Using Extended-IOWA (Extended-Induced Ordered Weighted Averaging) Operator on HTML Tags

Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2014, Vol 16, Issue 3

Abstract

Abstract: A new scenario has arisen into the information retrieval (IR) field with the increase in the use of mark-up languages. This paper targets structured IR and is focused on documents with structure. This assumption forces us to estimate the different weights which are applied to every field of structured web documents (designed using HTML). In this work a new ranking function based on fuzzy logic called Extended-IOWA operator for structured IR has proposed. Its purpose is to develop a competent IR system through Extended-IOWA operator with weighted HTML tags. We prioritized HTML tags into four classes and assign fuzzy weights to these classes according to their significance in text retrieval. Document weights are based on tags, which contain query terms. Consequently each class generates a matrix which describes document-document relationship using Linguistic terms which we represent using Trapezoidal Fuzzy Numbers. Document score is calculated in different classes and finally scores of documents are aggregated by Extended-IOWA which in turn returns result in the form of final ranked list of relevant documents.

Authors and Affiliations

Sukrati Pathak , Sakshi Mitra

Keywords

Related Articles

 Object Elimination and Reconstruction Using an Effective  Inpainting Method

 Three major problems have been found in the existing algorithms of image inpainting: Reconstruction of large regions, Preference of filling-in and Choice of best exemplars to synthesize the missing  reg...

 Robust Digital Image Watermarking based on spread spectrum  and convolutional coding

 Digital watermarking is a promising technology to embed information as unperceivable signals in digital contents. A copyright protection method for digital image with convolutional coding is proposed in this &nbs...

Optimal Planning of the Production of Corpus Details on Metal Cutting Machines with the Help of Computer Numeric Control

Abstract: The optimal planning of details mechanical processing is a key problem, directly affecting the productivity and efficiency of the activity of a machine building company. The combinatorial character of the prob...

Review of Evolutionary Algorithms in Wsn

Abstract: Diverse issues related to wireless sensor networks like energy minimization (optimization), compression schemes, network algorithms which are self-organizing, routing protocols, management of quality of service...

Download PDF file
  • EP ID EP94345
  • DOI 10.9790/0661-16346574
  • Views 135
  • Downloads 0

How To Cite

Sukrati Pathak, Sakshi Mitra (2014).  A New Web Document Retrieval Method Using Extended-IOWA (Extended-Induced Ordered Weighted Averaging) Operator on HTML Tags. IOSR Journals (IOSR Journal of Computer Engineering), 16(3), 65-74. https://europub.co.uk/articles/-A-94345