Log Mining Based on Hadoop’s Map and Reduce Technique
Journal Title: International Journal on Computer Science and Engineering - Year 2013, Vol 5, Issue 4
Abstract
In the world of cloud and grid computing Virtual Database Technology (VDB) is one of the effective solutions for integration of data from heterogeneous sources. Hadoop is a large-scale distributed batch processing infrastructure and also designed to efficiently distribute large amounts of work across a set of machines. Hadoop is an implementation of Map Reduce. This paper proposes application for inauguration of new branch of pizza in particular area according to hits from customers. In this paper we will take the log files for the particular website which will be stored on web mining server. These data will be passed on to the cloud server for region wise distribution on the virtual servers. Mapping and reduction will be done on these region wise data. The final output is then sent back to the server and client. This paper utilizes the parallel and distributed processing capability of Hadoop Map Reduce for handling heterogeneous query execution on large datasets. So Virtual Database Engine built on top of this will result in effective high performance distributed data integration
Authors and Affiliations
Anuja Pandit , Amruta Deshpande , Prajakta Karmarkar
Spectrum Sharing in a Long Term Spectrum Strategy via Cognitive Radio for Heterogeneous Wireless Networks
In this paper, Spectrum sharing technique among service providers to share the licensed spectrum of the licensed service providers for Heterogeneous wireless networks in a dynamic manner is proposed. Here, we could analy...
WaveCluster for Remote Sensing Image Retrieval
Wave Cluster is a grid based clustering approach. Many researchers have applied wave cluster technique for segmenting images. Wave cluster uses wave transformation for clustering the data item. Normally it uses Haar, Dau...
A Novel Approach for clustering web user sessions using RST
Web usage mining has assumed importance in learning about web user's behavior and user interactions with the website. It uses data mining techniques to discover non-trivial user behavior patterns. These patterns can then...
Multi-agent Based Charges subsystem for Supply Chain Logistics
The main objective of this paper is to design charges subsystem using multi agent technology which deals with calculation, accrual and collection of various charges levied at the goods in a supply chain Logistics. Accrua...
A New Technique to Backup and Restore DBMS using XML and .NET Technologies
In this paper, we proposed a new technique for backing up and restoring different Database Management Systems (DBMS). he technique is enabling to backup and restore a part of or the whole database using a unified interf...