QUERY CATEGORIZATION WEB SEARCH RESULTS INTO MEANINGFUL AND STABLE CATEGORIES USING FAST-FEATURE TECHNIQUES

Journal Title: World Journal of Engineering Research and Technology - Year 2018, Vol 4, Issue 1

Abstract

When search results against digital libraries and web resources have limited metadata, augmenting them with meaningful and stable category information can enable better overviews and support user exploration. This paper proposes six “fast-feature” techniques that use only features available in the search result list, such as title, snippet, and URL, to categorize results into meaningful categories. They use credible knowledge resources, including a US government organizational hierarchy, a thematic hierarchy from the Open Directory Project (ODP) web directory and personal browse histories, to add valuable metadata to search results. In three tests the percent of results categorized for five representative queries was high enough to suggest practical benefits: general web search (76-90%), government web search (39-100%), and the Bureau of Labor Statistics website (48-94%). An additional test submitted 250 TREC queries to a search engine and successfully categorized 66% of the top 100 using the ODP and 61% of the top 350. Fast-feature techniques have been implemented in a prototype search engine. We propose research directions to improve categorization rates and make suggestions about how web site designers could re-organize their sites to support fast categorization of search results. Categories and Subject Descriptors H.3.3 [Information Storage and Retrieval]: Information Search and Retrieval; H.3.7 [Information Storage and Retrieval]: Digital Libraries General Terms Measurement, Design, Experimentation, Human Factors.

Authors and Affiliations

Pravendra Singh Chauhan

Keywords

Related Articles

DEMONETIZATION: IMPACT OF DIGITAL WALLETS

The government has executed a paramount change in the economic environment by demonetizing the high value currency notes - of Rs. 500 and Rs 1000 denomination. These ceased to be legal tender from the midnight of 8th of...

COMPRESSIVE SENSING BASED IMAGE RECONSTRUCTION

Compressive sensing is a technique of image acquisition and reconstruction from a relatively fewer measurements than what the Nyquist theorem suggests; the sampling rate must be greater than twice the highest frequency i...

DEVELOPING AN INFORMATICS MODEL FOR EFFECTIVE HEALTHCARE IN MILITARY HEALTH FACILITIES IN NIGERIA

Management, exchange and control of clinical information flow, and decision support have remained a challenge in most secondary/tertiary healthcare institutions in Nigeria amidst the continued advancement in Information...

TESTING AND PERFORMANCE OF SINGLE CYLINDER CI ENGINE WITH USING RICE BRAN BIODIESEL

The consumption of fuels in the world is increasing rapidly and it affects the global economy of all the countries so this factor forced all the countries to find the alternative fuel to reduce and even replace the usage...

IMPACT OF INTERNET OF THINGS (IOT) IN TERMS OF GUEST SERVICE SATISFACTION IN HOTEL INDUSTRY

Internet of Things (IoT) is current talk of the town which is widely effecting hotel industry these days, it is very important to focus out its practical implementations. This study features the multidimensional tasking...

Download PDF file
  • EP ID EP660551
  • DOI -
  • Views 199
  • Downloads 0

How To Cite

Pravendra Singh Chauhan (2018). QUERY CATEGORIZATION WEB SEARCH RESULTS INTO MEANINGFUL AND STABLE CATEGORIES USING FAST-FEATURE TECHNIQUES. World Journal of Engineering Research and Technology, 4(1), 256-278. https://europub.co.uk/articles/-A-660551