Moderm Techniques used for Big Data Clustering:A Review

Journal Title: International Journal of Engineering and Science Invention - Year 2018, Vol 7, Issue 6

Abstract

Huge quantity of data is generated in each second in the modern digital era. It became almost impossible for the conventional data analytics frameworks to handle this ever-exploding big data. Storage spaces for this huge amount of data is also running out. Big data clustering algorithms help up to huge extent where dimensionality of enormous big data is reduced to a great extent without any significant loss of relevant information. Big data clustering algorithms need not only be robust in working but also need to be compliant for distributed and parallel processing. This paper is reviewing the most modern and robust big data clustering algorithms. Four algorithms are chosen, one from each category of Partitioning based, Density based, Grid based, Model-based big data clustering algorithms. Algorithms considered for in depth review are partition based big data clustering in a distributed Apache spark framework, Density Based Spatial Clustering of Applications with Noise, grid based OptiGrid clustering algorithm and model based Self-organizing Map clustering algorithm. Detailed algorithm working and shortcomings of each of these algorithms are discussed in detail.

Authors and Affiliations

Sharon Susan Jacob, Prof. R. Vijaya kumar

Keywords

Related Articles

Urban Rain Water Logging In Greater Hyderabad - A Spatial Perspective

Urban Centers In India Face An Ironical Situation I.E. On One Hand There Is An Acute Water Scarcity And On The Other Hand The Streets Are Flooded With Heavy Downpour Throwing The Traffic Out Of Gear. Problems Associated...

Evaluation of Natural Radioactivity and Physico-Chemical Characteristics alongEl-Salam Canal, Egypt

The presence of primordial radionuclides in human habitats has always been a source of prolonged nuclear radiation exposure. Measurements of naturally occurring radionuclides in the environment can be used as a baseline...

Evaluation of self-directed learning impact in higher education

The role of the university instructor is becoming more varied, more complex and at the same time more and more oriented to specific tasks. On the other hand, the scientific progress and the organizational and cultural te...

Study of Variation of R.M.S. Strain with Average Particle Size of Different Phases Evolved During High Energy Ball Milling Of Anatage Tio2 Suggesting Simultaneous Occurrence of Hall-Petch Effect and Inverse Hall-Petch Effect

Average particle size (Grain size) plays an important role in controlling material strength. In polycrystalline materials with grain sizes in the micrometre range, strength increases with decreasing size obeying Hall–Pet...

Dynamic Model of Windings Parameters for Electrical Machines with a Non-Symmetric Magnetic System

Electrical Machines that are having clearly expressed poles in rotor or in stator possess nonsymmetric magnetic systems. In this paper we study main winding inductances at the example of magnetic system with salient roto...

Download PDF file
  • EP ID EP397353
  • DOI -
  • Views 67
  • Downloads 0

How To Cite

Sharon Susan Jacob, Prof. R. Vijaya kumar (2018). Moderm Techniques used for Big Data Clustering:A Review. International Journal of Engineering and Science Invention, 7(6), 1-5. https://europub.co.uk/articles/-A-397353