BIG Data – A Review

Abstract

As more data becomes available from an abundance of sources both within and outside, organizations are seeking to use those abundant resources to increase innovation, retain customers, and increase operational efficiency. At the same time, organizations are challenged by their end users, who are demanding greater capability and integration to mine and analyze burgeoning new sources of information. Big Data provides opportunities for business users to ask questions they never were able to ask before. How can a financial organization find better ways to detect fraud? How can an insurance company gain a deeper insight into its customers to see who may be the least economical to insure? How does a software company find its most at-risk customers those who are about to deploy a competitive product? They need to integrate Big Data techniques with their current enterprise data to gain that competitive advantage. Heterogeneity, scale, timeliness, complexity, and privacy problems with Big Data impede progress at all phases of the pipeline that can create value from data. The problems start right away during data acquisition, when the data tsunami requires us to make decisions, currently in an ad hoc manner, about what data to keep and what to discard, and how to store what we keep reliably with the right metadata. Much data today is not natively in structured format; for example, tweets and blogs are weakly structured pieces of text, while images and video are structured for storage and display, but not for semantic content and search: transforming such content into a structured format for later analysis is a major challenge. The value of data explodes when it can be linked with other data, thus data integration is a major creator of value. Since most data is directly generated in digital format today, we have the opportunity and the challenge both to influence the creation to facilitate later linkage and to automatically link previously created data. Data analysis, organization, retrieval, and modelling are other foundational challenges. Data analysis is a clear bottleneck in many applications, both due to lack of scalability of the underlying algorithms and due to the complexity of the data that needs to be analyzed. Finally, presentation of the results and its interpretation by non-technical domain experts is crucial to extracting actionable knowledge.

Authors and Affiliations

Anuradha Bhatia

Keywords

Related Articles

 Skin Detection for Face Recognition Based on HSV Color Space

 Skin color has proven to be a useful and robust cue for face detection, localization and tracking. Image content filtering, content aware video compression and image color balancing applications can also benefit f...

 Experimental Techniques to find Thermal Residual Stress in Composite Materials –A study of the Literature Review

 If two dissimilar materials are joined at high temperature, a very high residual stresses developed near the interface of materials. The mismatch between the co-efficient of thermal expansion leads to stress conce...

 CLUSTER BASED BOOSTING TECHNIQUES FOR ACCURACY PREDICTION

 In recent years encroachment of social media in human life is inexplicable. The greatest and most significant issue in latest technology is the fast retrieval of information from large databases. To overcome this,...

FiWi – FUTURE GENERATION BR OADBAND ACCESS TECHN OLOGY

As today’s generation is now became the slave of different internet services at every step of their life. Hence, different researches are carried out continuously sin ce past decade to enhance the performan...

 Modeling and Controlling of MC-UPQC to Enhance Power Quality

 This paper presents a new Unified Power-Quality Conditioning System (MC-UPQC), capable of simultaneous compensation for voltage and current in multi-bus/multi-feeder systems. By using one shunt VoltageSource Conv...

Download PDF file
  • EP ID EP117339
  • DOI -
  • Views 83
  • Downloads 0

How To Cite

Anuradha Bhatia (30). BIG Data – A Review. International Journal of Engineering Sciences & Research Technology, 2(9), 2102-2106. https://europub.co.uk/articles/-A-117339