QUERY CATEGORIZATION WEB SEARCH RESULTS INTO MEANINGFUL AND STABLE CATEGORIES USING FAST-FEATURE TECHNIQUES

Journal Title: World Journal of Engineering Research and Technology - Year 2018, Vol 4, Issue 1

Abstract

When search results against digital libraries and web resources have limited metadata, augmenting them with meaningful and stable category information can enable better overviews and support user exploration. This paper proposes six “fast-feature” techniques that use only features available in the search result list, such as title, snippet, and URL, to categorize results into meaningful categories. They use credible knowledge resources, including a US government organizational hierarchy, a thematic hierarchy from the Open Directory Project (ODP) web directory and personal browse histories, to add valuable metadata to search results. In three tests the percent of results categorized for five representative queries was high enough to suggest practical benefits: general web search (76-90%), government web search (39-100%), and the Bureau of Labor Statistics website (48-94%). An additional test submitted 250 TREC queries to a search engine and successfully categorized 66% of the top 100 using the ODP and 61% of the top 350. Fast-feature techniques have been implemented in a prototype search engine. We propose research directions to improve categorization rates and make suggestions about how web site designers could re-organize their sites to support fast categorization of search results. Categories and Subject Descriptors H.3.3 [Information Storage and Retrieval]: Information Search and Retrieval; H.3.7 [Information Storage and Retrieval]: Digital Libraries General Terms Measurement, Design, Experimentation, Human Factors.

Authors and Affiliations

Pravendra Singh Chauhan

Keywords

Related Articles

CHARACTERISTICS AND PROSPECTS OF MICROWAVE THERMOGRAPHY

The microwave thermography is a non-invasive method of measuring internal tissue temperature through the detection of microwave radiation emitted from heated tissue. This noninvasive technique makes use of the human body...

THE EFFECTIVENESS OF THE MANAGEMENT OF BASIC LABORATORY AND INTEGRATED PRACTICAL RESEARCH UNIMA USING MODEL CIPP

The laboratory became an important means for the development of the science of MIPATEKS so that the need to continue to be developed for the sake of its existence prior to the development of science. This research aims t...

APPLICATION OF BOX-BEHNKEN DESIGN FOR THE OPTIMIZATION OF STEAM TURBINE EFFICIENCY

In order to contribute to the development of a steam turbine locally, a ten (10) megawatt steam turbine was designed for power generation. MATLAB was used to develop simulation program for a two stage turbine with re-hea...

EFFECT OF SIC PARTICULATE REINFORCEMENT ON FATIGUE AND SHEAR RESPONSE OF AL-CU PISTON ALLOY METAL MATRIX COMPOSITES

The effects of SiC particulate reinforcement on fatigue, impact strength, hardness, tensile and shear response of Al-Cu piston alloys have been investigated. Permanent steel mold was used to cast the specimen in which 0...

ANALYSIS OF EARTH SLOPES SUBJECTED TO CHANGE IN WATER CONTENT USING CENTRIFUGE MODELLING

This paper discusses the results of soil slope stability using a small beam centrifuge. Model slopes were prepared using residual soils, compacted at dry state. Properties of the soil used in this study are discussed fir...

Download PDF file
  • EP ID EP660551
  • DOI -
  • Views 193
  • Downloads 0

How To Cite

Pravendra Singh Chauhan (2018). QUERY CATEGORIZATION WEB SEARCH RESULTS INTO MEANINGFUL AND STABLE CATEGORIES USING FAST-FEATURE TECHNIQUES. World Journal of Engineering Research and Technology, 4(1), 256-278. https://europub.co.uk/articles/-A-660551