BIG Data – A Review

Abstract

As more data becomes available from an abundance of sources both within and outside, organizations are seeking to use those abundant resources to increase innovation, retain customers, and increase operational efficiency. At the same time, organizations are challenged by their end users, who are demanding greater capability and integration to mine and analyze burgeoning new sources of information. Big Data provides opportunities for business users to ask questions they never were able to ask before. How can a financial organization find better ways to detect fraud? How can an insurance company gain a deeper insight into its customers to see who may be the least economical to insure? How does a software company find its most at-risk customers those who are about to deploy a competitive product? They need to integrate Big Data techniques with their current enterprise data to gain that competitive advantage. Heterogeneity, scale, timeliness, complexity, and privacy problems with Big Data impede progress at all phases of the pipeline that can create value from data. The problems start right away during data acquisition, when the data tsunami requires us to make decisions, currently in an ad hoc manner, about what data to keep and what to discard, and how to store what we keep reliably with the right metadata. Much data today is not natively in structured format; for example, tweets and blogs are weakly structured pieces of text, while images and video are structured for storage and display, but not for semantic content and search: transforming such content into a structured format for later analysis is a major challenge. The value of data explodes when it can be linked with other data, thus data integration is a major creator of value. Since most data is directly generated in digital format today, we have the opportunity and the challenge both to influence the creation to facilitate later linkage and to automatically link previously created data. Data analysis, organization, retrieval, and modelling are other foundational challenges. Data analysis is a clear bottleneck in many applications, both due to lack of scalability of the underlying algorithms and due to the complexity of the data that needs to be analyzed. Finally, presentation of the results and its interpretation by non-technical domain experts is crucial to extracting actionable knowledge.

Authors and Affiliations

Anuradha Bhatia

Keywords

Related Articles

 NUMERICAL SIMULATION BY USING ANOVA ANALYSIS FOR THERMAL COMFORT IN CAR

 A car will be selected depends upon the majority of occupants. Then on the basis of that factors and levels are decided. Factors are those characteristics which really affects our observation. Levels are the diffe...

 FPGA Implementations of Tiny Mersenne Twister

 Random number generators are essential in many computing applications, such as Artificial Intelligence like genetic algorithms and automated opponents, random game content, simulation of complex phenomena such as...

ELIMINATING FOUR WAVE MIXING WITH DYNAMIC CHANNEL SHUFFLING IN DWDM OPTICAL NETWORK

Optical Fiber systems are major telecommunication base for worldwide broadband networks. The bandwidth capacity is increased by using DWDM optical networks. But in DWDM Optical Communication System, the nonlinear effect...

 Review Paper on Vibration Reduction in Hand Held Power Machinery

 Hand-arm vibration (HAV) is vibration transmitted from a work processes into workers’ hands and arms. It can be caused by operating hand-held power tools, hand-guided equipment, or by holding materials being proce...

 An Efficient Instruction-Level Energy Estimation Model for Embedded Systems

 To optimize the energy consumption embedded systems, the estimation of energy consumption of the embedded applications are very important. This paper proposes a simple but effective instruction-level energy estim...

Download PDF file
  • EP ID EP117339
  • DOI -
  • Views 81
  • Downloads 0

How To Cite

Anuradha Bhatia (30). BIG Data – A Review. International Journal of Engineering Sciences & Research Technology, 2(9), 2102-2106. https://europub.co.uk/articles/-A-117339