FEATURE CLUSTERING USING SUBSELECTION ALGORITHM IN BIG DATA USING FIDOOP
Journal Title: International Research Journal of Computer Science - Year 2016, Vol 0, Issue 0
Abstract
Big data processing is a high demand area which imposes a heavy burden on computation, communication, storage in data centers, which incurs considerable operational cost to data center provider. So minimizing cost has become an issue for the upcoming big data. Different from conventional cloud service one of the main feature of the big data service is the tight coupling between data and computation, as computation task can be conducted only when the corresponding data are available. As a result, three factors that is communicational cost, computational cost, operational cost effects the expenditure cost of data centers. So in order to minimize the cost clustering is used. Clustering groups a selected objects into classes of similar objects. Feature Selection Removes Irrelevant Features- it occurs in the batch processing (scheduling algorithm) Redundant Features its occurs in the cluster formation (data-centric algorithm) jointoptimization– 2 steps Features divided into clusters(subsets) MST Cluster representatives are selected Efficient, Effective, Independent. Based on these criteria, a feature clustering based on selection algorithm is proposed and experimentally evaluated for a sample cancer dataset. This work finds the effective attributes used and removes redundancy.
Authors and Affiliations
Surekha C. , Vijayalakshmi S. , Natteshan N. V. S
Information System FIFO for Publish Journal in Information System Department Faculty of Computer Science University of Mercu Buana
Information technology has a huge impact in the growth of information in Indonesia, all activities performed using information technology. In activities Department Information System Faculty of Computer Science will cert...
Cooperative Receiving Scheme for Down link of Cellular Systems
In wireless mobile cellular systems, it has been popular for mobiles to support some short-range wireless communication protocols (SRWCPs)−Bluetooth, Zigbee, NFC (near field communication), etc. In this paper, we introdu...
An Epidemic Surveillance System for Digital India
The paper presents a conceptual model for creating an epidemic surveillance system. The System will help in predicting the outbreak of epidemics and diseases. A working model can be created by implementing the terminolog...
ARDUINO-BASED AUTOMATIC MOTORCYCLE CHAIN LUBRICATION DESIGN
In recent years, we can see the increasing growth of the motorcycles. The motorcycle riders frequently forget and ignore chain maintenance problems due to their daily busyness and routines thereby making them forget and...
ARABIC Cryptography Technique Using Neural Network and Genetic Algorithm
Cryptography is the science of Encrypting / Decryption information. The goals of cryptography is to keep message confidentiality, message integrity and sender authentication. The techniques used to encrypt information in...