QUERY CATEGORIZATION WEB SEARCH RESULTS INTO MEANINGFUL AND STABLE CATEGORIES USING FAST-FEATURE TECHNIQUES

Journal Title: World Journal of Engineering Research and Technology - Year 2018, Vol 4, Issue 1

Abstract

When search results against digital libraries and web resources have limited metadata, augmenting them with meaningful and stable category information can enable better overviews and support user exploration. This paper proposes six “fast-feature” techniques that use only features available in the search result list, such as title, snippet, and URL, to categorize results into meaningful categories. They use credible knowledge resources, including a US government organizational hierarchy, a thematic hierarchy from the Open Directory Project (ODP) web directory and personal browse histories, to add valuable metadata to search results. In three tests the percent of results categorized for five representative queries was high enough to suggest practical benefits: general web search (76-90%), government web search (39-100%), and the Bureau of Labor Statistics website (48-94%). An additional test submitted 250 TREC queries to a search engine and successfully categorized 66% of the top 100 using the ODP and 61% of the top 350. Fast-feature techniques have been implemented in a prototype search engine. We propose research directions to improve categorization rates and make suggestions about how web site designers could re-organize their sites to support fast categorization of search results. Categories and Subject Descriptors H.3.3 [Information Storage and Retrieval]: Information Search and Retrieval; H.3.7 [Information Storage and Retrieval]: Digital Libraries General Terms Measurement, Design, Experimentation, Human Factors.

Authors and Affiliations

Pravendra Singh Chauhan

Keywords

Related Articles

THE NOVEL METHOD FOR RECOGNITION OF AMERICAN SIGN LANGUAGE WITH RING PROJECTION AND DISCRETE WAVELET TRANSFORM

Sign Language is a language that allows individuals with hearing or speech impairment to communicate with themselves and their surroundings and has the feature of not being a universal language. This language, which is n...

GABOR FILTER-BASED FEATURE LEVEL FUSION OF PALM VEIN AND FINGERPRINT RECOGNITION SYSTEM

Biometrics fusion entails using two or more physiological or behavioral traits to improve the performance of biometric systems. Most existing works investigated effects of fusion of multiple features at image, matching s...

AN AUTOMATED RECOGNITION OF FAKE OR DESTROYED INDIAN CURRENCY NOTES (RESULT)

Automatic method for detection of fake currency note is very important in every country. In this project we have made fake currency note detection technique using MATLAB and feature extraction with HSV color space and ot...

APPLICATION OF NUMERICAL OPTIMIZATION AS A TOOL FOR VALIDATION OF OPTIMIZED RESPONSE OF FLEXURAL STRENGTH OF WOOD ASH (HARDWOOD) PARTICLES REINFORCED POLYPROPYLENE WARPP

In this study, we investigated the adequacy of approximation of fitted model to the real system. To meet this objective, the numerical optimization method was applied as an alternative to the conventional method by Sures...

DOME CONDITIONS IN COASTAL CITY IN TROPICAL CLIMATE

The paper describes the long-term monitoring of the finishes performance of the building with dome structure located in coastal area. The theoretical proposal is based on relevant literature and was applied and adjusted...

Download PDF file
  • EP ID EP660551
  • DOI -
  • Views 200
  • Downloads 0

How To Cite

Pravendra Singh Chauhan (2018). QUERY CATEGORIZATION WEB SEARCH RESULTS INTO MEANINGFUL AND STABLE CATEGORIES USING FAST-FEATURE TECHNIQUES. World Journal of Engineering Research and Technology, 4(1), 256-278. https://europub.co.uk/articles/-A-660551