A New Web Document Retrieval Method Using Extended-IOWA (Extended-Induced Ordered Weighted Averaging) Operator on HTML Tags

Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2014, Vol 16, Issue 3

Abstract

Abstract: A new scenario has arisen into the information retrieval (IR) field with the increase in the use of mark-up languages. This paper targets structured IR and is focused on documents with structure. This assumption forces us to estimate the different weights which are applied to every field of structured web documents (designed using HTML). In this work a new ranking function based on fuzzy logic called Extended-IOWA operator for structured IR has proposed. Its purpose is to develop a competent IR system through Extended-IOWA operator with weighted HTML tags. We prioritized HTML tags into four classes and assign fuzzy weights to these classes according to their significance in text retrieval. Document weights are based on tags, which contain query terms. Consequently each class generates a matrix which describes document-document relationship using Linguistic terms which we represent using Trapezoidal Fuzzy Numbers. Document score is calculated in different classes and finally scores of documents are aggregated by Extended-IOWA which in turn returns result in the form of final ranked list of relevant documents.

Authors and Affiliations

Sukrati Pathak , Sakshi Mitra

Keywords

Related Articles

 Enhancement in Elimination of Security Threads using Trusted Proactive Routing

 Ad hoc networks have been used in many applications which mandate a dynamic setup in the absence of fixed infrastructure. The design of ad hoc network has been mainly focuses on proper operation. It is possible t...

 Gain Comparison between NIFTY and Selected Stocks identified by SOM using Technical Indicators

 The main aim of every investor is to identify a stock that has potential to go up so that the investor can maximize possible returns on investment. After identification of stock the second important point of deci...

 Protection of Direct and Indirect Discrimination using Prevention  Methods

 Along with privacy, discrimination is a very important issue when considering the legal and ethical aspects of data mining. It is more than observable that the majority people do not want to be discriminated &nbs...

 Name Entity Recognition by New Framework Using Machine Learning Algorithm

 Abstract: The amount of textual information available electronically has made it difficult for many users to find and access the right information within acceptable time. Research communities in the natural languag...

 Sliced Ridgelet Transform for Image Denoising

 Image denoising based on ridgelet transforms gives better result in image denoising than standard wavelet transforms. In this research work, the researcher introduces a new approach for image denoising that is ba...

Download PDF file
  • EP ID EP94345
  • DOI 10.9790/0661-16346574
  • Views 121
  • Downloads 0

How To Cite

Sukrati Pathak, Sakshi Mitra (2014).  A New Web Document Retrieval Method Using Extended-IOWA (Extended-Induced Ordered Weighted Averaging) Operator on HTML Tags. IOSR Journals (IOSR Journal of Computer Engineering), 16(3), 65-74. https://europub.co.uk/articles/-A-94345