Anomaly Detection in Data with Extremely High Dimensional Space via Online Oversampling Principal Component Analysis
Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2014, Vol 16, Issue 3
Abstract
Abstract: Anomaly detection is a crucial analysis topic in the field of data mining as well as machine learning. Several real-world applications like Intrusion or MasterCard fraud detection need a good and efficient framework to spot deviated data instances. A good anomaly detection methodology must be able to accurately establish many varieties of anomalies, robust, need comparatively very little resources, and perform detection in period of time. In this paper we proposed the idea of combining the two different algorithms i.e. Median Based Outlier Detection and Online Oversampling PCA for effective detection of anomaly in online updating mode. Median Based outlier detection uses the interquartile range which is a measure of statistical dispersion being equal to the difference between the upper and lower quartiles. Whereas oversampling PCA does not need to store the entire covariance matrix or data matrix and thus this approach is a more useful in online or large scale problem. Compared with other anomaly detection algorithm our experimental result verifies the feasibility of our proposed method.
Authors and Affiliations
Swapnil S. Raut , Sachin N. Deshmukh
Progressivism, Modernism and Urdu Literature.A Comparative View
Abstract:The paper seeks to explore the holocaust of partition in the subcontinent after the great political divide erupting from 1946 massacre which produced writers like Bedi, Manto and Khwaja Ahmad Abbas. A modest att...
Analytical Review on the Correlation between Ai and Neuroscience
Neuroscience is the pragmatic study of brain anatomy and physiology. AI and neuroscience are typical related to the human brain’s behavior. The alliance between artificial intelligence and neuroscience can produc...
BEST-1: A Light Weight Block Cipher
Abstract: The demand for applications involving Wireless Sensor Network (WSN) or RFID systems is increasing. The sensor in a WSN and RFID tag/reader in a RFID system are called resource constrained devices becaus...
Performance of Phase Congruency and Linear Feature Extraction for Satellite Images Using Smoothing Algorithm
In computer vision all of the existing researches are interested in synthetic images features extraction. These images contain many types of features. Moreover, the satellite images are one most complex rea...
Performance Analysis of Hybrid (supervised and unsupervised) method for multiclass data set
Abstract: Due to the increasing demand for multivariate data analysis from the various application the dimensionality reduction becomes an important task to represent the data in low dimensional space for the robus...