DISTRIBUTED DATA MINING AND MINING MULTI-AGENT DATA
Journal Title: International Journal on Computer Science and Engineering - Year 2010, Vol 2, Issue 4
Abstract
The problem of distributed data mining isvery important in network problems. Ina distributed environment (such as a sensor or IP network), one has distributed probes placed at strategic locations within the network. The problem here is to be able to correlate the data seen at the various probes, and discover patterns in the global data seen at all the different probes. There could be different models of distributed data mining here, but one could involve a NOC that collects data from the distributed sites, and another in which all sites are treated equally. The goal here obviously would be to minimize the amount of data shipped between the various sites — ssentially, to reduce the communication overhead. In distributed mining, one problem is how to mine across multiple heterogeneous data sources: multi-database and ultirelational mining. Another important new area is adversary data mining. In a growing number of domains — email spam, counter-terrorism, intrusion detection/computer security, click spam, search engine spam, surveillance, fraud detection, shop bots, file sharing, etc. — data mining systems face adversaries that deliberately anipulate the data to sabotage them (e.g. make them produce false negatives). In this paper need to develop systems that explicitly take this into account, by combining data mining with game theory.
Authors and Affiliations
Vuda Sreenivasa Rao , Dr. S Vidyavathi
WSLA Schema for Functionality Based Weight fixing of Non-Functional Parameters of Web Services
Recently Web services have evolved as a cost-effective solution for exchanging information between distributed applications over different operating system, platform, and software environment. The success of such a syste...
“Spotting the techniques on OPENMP Compilers and its Optimization”
OPENMP is a parallel programming technique which is employed in order to improve the optimization. The research paper proposes a number of techniques which can be used to enhance the performance and execution of parallel...
An Implementation of Semantic Web System for Information retrieval using J2EE Technologies.
Accessing web resources (Information) is an essential facility provided by web applications to every body. Semantic web is one of the systems that provide a facility to access the resources through web service applicatio...
Association Rules Extraction from Incremental Databases through ICPT
Association Rule Mining is an important task in data mining. This paper proposes two stage ICPT (Incremental Compact Pattern Tree) Construction Methodology. This methodology facilitates to obtain incidental association r...
Intelligent and Effective Heart Disease Prediction System using Weighted Associative Classifiers
The healthcare environment is still ‘information rich’ But ‘knowledge poor’. There is a wealth of data available within the health care systems. However, there is a lack of effective analysis tools to discover hidden rel...