The Anatomy of Web Search Result Clustering and Search Engines

Journal Title: Indian Journal of Computer Science and Engineering - Year 2010, Vol 1, Issue 4

Abstract

World Wide Web is a very large distributed digital information space. The ability to search and retrieve information from the Web efficiently and effectively is an enabling technology for realizing its full potential. Current search tools retrieve too many documents, of which only a small fraction are relevant to the user query. Furthermore, the most relevant documents do not necessarily appear at the top of the query output order. Clustering Techniques are now being used to give a meaningful search result on web. Text document clustering has been traditionally investigated as a means of improving the performance of search engines. We present a thorough comparison of the algorithms based on the various facets of their features and functionality. Furthermore, we highlight the main characteristics of a number of existing Web clustering engines and also discuss how to evaluate their retrieval performance.

Authors and Affiliations

R. SUBHASHINI , V. JAWAHAR SENTHIL KUMAR

Keywords

Related Articles

GROUPING WEB ACCESS SEQUENCES USING SEQUENCE ALIGNMENT METHOD

In web usage mining grouping of web access sequences can be used to determine the behavior or intent of a set of users. Grouping web sessions is how to measure the similarity between web sessions. There are many shortcom...

An Extended Model Driven Framework for End-to-End Consistent Model Transformation

Model Driven Development (MDD) results in quick transformation from models to corresponding systems. Forward engineering features of modelling tools can help in generating source code from models. To build a robust syste...

Identification of Images Using Digital Image Processing

General image identification is essential in many applications. Defects identification in industries is mostly manual and time consuming. To reduce error in detecting defects, image identification can be used in industri...

PROGRAMMED TEST CASE GENERATION FROM SIMULINK/STATEFLOW MODEL

Matlab, Simulink/Stateflow Model is the most extensively used industrial tools that include system development that allows models to be developed, visualized and exercised. Matlab, Simulink/Stateflow (SL/SF) is used part...

DETECTING THE USEFUL ELECTROMYOGRAM SIGNALS–EXTRACTING, CONDITIONING & CLASSIFICATION

Surface EMG is an important signal containing the information in form of electrical signals referred as myoelectric signals, used in designing & development of many prosthesis and clinical researches applications .Va...

Download PDF file
  • EP ID EP160575
  • DOI -
  • Views 125
  • Downloads 0

How To Cite

R. SUBHASHINI, V. JAWAHAR SENTHIL KUMAR (2010). The Anatomy of Web Search Result Clustering and Search Engines. Indian Journal of Computer Science and Engineering, 1(4), 392-401. https://europub.co.uk/articles/-A-160575