FEATURE CLUSTERING USING SUBSELECTION ALGORITHM IN BIG DATA USING FIDOOP

Journal Title: International Research Journal of Computer Science - Year 2016, Vol 0, Issue 0

Abstract

Big data processing is a high demand area which imposes a heavy burden on computation, communication, storage in data centers, which incurs considerable operational cost to data center provider. So minimizing cost has become an issue for the upcoming big data. Different from conventional cloud service one of the main feature of the big data service is the tight coupling between data and computation, as computation task can be conducted only when the corresponding data are available. As a result, three factors that is communicational cost, computational cost, operational cost effects the expenditure cost of data centers. So in order to minimize the cost clustering is used. Clustering groups a selected objects into classes of similar objects. Feature Selection Removes Irrelevant Features- it occurs in the batch processing (scheduling algorithm) Redundant Features its occurs in the cluster formation (data-centric algorithm) jointoptimization– 2 steps Features divided into clusters(subsets) MST Cluster representatives are selected Efficient, Effective, Independent. Based on these criteria, a feature clustering based on selection algorithm is proposed and experimentally evaluated for a sample cancer dataset. This work finds the effective attributes used and removes redundancy.

Authors and Affiliations

Surekha C. , Vijayalakshmi S. , Natteshan N. V. S

Keywords

Related Articles

An Approach for classification in detecting tumor in Brain MRI images using GMM and Neural Network classifier

Image classification is a process of classifying an image based upon the training given to a classifier. There are various purposes of classification but in this work a Brain MRI image is taken as input and is mainly cla...

Effective Use of Blended-learning in Sudan a Step toward Enhancing Higher Education

Although we are aware of the shortcomings mentioned, we think that the results of the study on the effectiveness of blended education, and to increase the concentration of educational programs that contribute to the rena...

Parameter Adjustment of Pulse Coupled Neural Networks Based on White Pixels Evaluation

This paper presents a new method to automatic stop the iteration of Pulse Coupled Neural Networks. (PCNN) by evaluating the numbers of white pixels. The PCNN is used to segment the image which has object and background....

Multicast Routing and Data Mining in Wired Networks: A Comprehensive Study

Multicast routing is a collection leaning massage whose objective is to hold the spread of data from a dispatcher to all the recipient of a multicast group while annoying to use the obtainable bandwidth professionally, i...

ARABIC Cryptography Technique Using Neural Network and Genetic Algorithm

Cryptography is the science of Encrypting / Decryption information. The goals of cryptography is to keep message confidentiality, message integrity and sender authentication. The techniques used to encrypt information in...

Download PDF file
  • EP ID EP180576
  • DOI -
  • Views 92
  • Downloads 0

How To Cite

Surekha C. , Vijayalakshmi S. , Natteshan N. V. S (2016). FEATURE CLUSTERING USING SUBSELECTION ALGORITHM IN BIG DATA USING FIDOOP. International Research Journal of Computer Science, 0(0), 5-10. https://europub.co.uk/articles/-A-180576