DISTRIBUTED DATA MINING AND MINING MULTI-AGENT DATA

Journal Title: International Journal on Computer Science and Engineering - Year 2010, Vol 2, Issue 4

Abstract

The problem of distributed data mining isvery important in network problems. Ina distributed environment (such as a sensor or IP network), one has distributed probes placed at strategic locations within the network. The problem here is to be able to correlate the data seen at the various probes, and discover patterns in the global data seen at all the different probes. There could be different models of distributed data mining here, but one could involve a NOC that collects data from the distributed sites, and another in which all sites are treated equally. The goal here obviously would be to minimize the amount of data shipped between the various sites — ssentially, to reduce the communication overhead. In distributed mining, one problem is how to mine across multiple heterogeneous data sources: multi-database and ultirelational mining. Another important new area is adversary data mining. In a growing number of domains — email spam, counter-terrorism, intrusion detection/computer security, click spam, search engine spam, surveillance, fraud detection, shop bots, file sharing, etc. — data mining systems face adversaries that deliberately anipulate the data to sabotage them (e.g. make them produce false negatives). In this paper need to develop systems that explicitly take this into account, by combining data mining with game theory.

Authors and Affiliations

Vuda Sreenivasa Rao , Dr. S Vidyavathi

Keywords

Related Articles

A Parallel Access Method for Spatial Data Using GPU

Spatial access methods (SAMs) are used for information retrieval in large spatial databases. Many of the SAMs use sequential tree structures to search the result set of the spatial data which are contained in the given q...

Design and Implementation of Neural Processor for Parsing Manufacturing Query Language 

Practically, all the approaches employed for parsing with natural languages use some or other type of neural network architecture and some typical statistical function for obtaining a parsing decision. In parsing with ne...

Fault Tolerance in Real Time Distributed System

In this paper we investigate the different techniques of fault tolerance which are used in many real time distributed systems. The main focus is on types of fault occurring in the system, fault detection techniques and t...

Cost Analysis of a Three Layered MIPv6 (TLMIPv6) Mobility Model and HMIPv6

In this paper cost analysis of a three-layer hierarchical model and HMIPv6 is done. The objective of this work is to examine the signaling cost, tunneling cost and packet dropping probability at top level anchor agents o...

Saturation Adaptive Quantizer Design for Synthetic Aperture Radar Data Compression

The essence of remote sensing resides in the acquisition of information about remote targets for further processing. As a high resolution microwave remote sensing instrument, the Synthetic Aperture Radar (SAR) has been m...

Download PDF file
  • EP ID EP124237
  • DOI -
  • Views 129
  • Downloads 0

How To Cite

Vuda Sreenivasa Rao, Dr. S Vidyavathi (2010). DISTRIBUTED DATA MINING AND MINING MULTI-AGENT DATA. International Journal on Computer Science and Engineering, 2(4), 1237-1244. https://europub.co.uk/articles/-A-124237