“Mining the web data using data mining techniques for identifying and classifying the user access behavioral patterns”
Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2014, Vol 16, Issue 2
Abstract
Abstract: The one of the largest and most widely used document repository is worldwide web. It has been used for mining data since many decades. It has been proved as one of the most helping platform to assimilate, disseminate and retrieve information. But unfortunately its success has only become its enemy. It seems like an ocean of information in which users are drowning not sailing. The information is so huge, diverse, dynamic and unstructured natured that users face the problems of information overloaded while interacting with the web. Here the issue of QoS cops up. It’s needed for web developer to know what the user really wants to do, predict which pages the user is interested in and provide the user the WebPages by knowledge of users navigational patterns to improve QoS. This project mainly focuses on cleaning the data i.e. sever web log file, processing the data according to some specific strategy, identifying the users using maximal forward reference algorithms and classifying them into predefined classes. Here supervised learning is used to train the classifier. We have carried out this project using an educational institute’s log file as input data. Our project work has been used to implement the model for providing the desired information to the user if the data at the backend can be maintained appropriately and grouped into different classes. We have done the implementation using the corresponding data can be given to the user, ie instead of giving hundreds of links related topic to the user, more appropriate links pertaining to the user can be given. Hence the prevision rate can be increased.
Authors and Affiliations
Shruthi C Kamoji , Praveen Naik
Six Sigma and CMMI
Abstract: Quality is essential in every walks of life and in all business for their success and profitability. There are many quality improvement initiatives and they are doing their part well. The startups and ini...
An Intelligent Meta Search Engine for Efficient Web DocumentRetrieval
Abstract: In daily use of internet, when searching information we face lots of difficulty due to the rapid growthof Information Resources. This is because of the fact that a single search engine cannot index the en...
An Extended Approach for Online Testing of Reversible Circuits
Reversible computing has tremendous benefits in terms of power consumption, less heat dissipation and packaging density. Because its applications are found in diverse fields including quantum computing, nanotech...
Processing of Top-k Selection Queries in Relational Database System
Abstract: In many applications, users specify target values for certain attributes, without requiring exact matches to these values in return. Instead, the result to such queries is typically a rank of the top-k tu...
Texture Based Approach For Face Image Recognition Using Low Resolution Images
Face recognition based on Euclidean distance and texture feature. A method for face recognition by using the GLCM (Gray Level Co-occurrence Matrix) and texture features. Euclidean distance classifier is used for the matc...