High performance computing in big data analytics

Journal Title: Applied Medical Informatics - Year 2019, Vol 41, Issue 0

Abstract

For long time High-Performance Computing (HPC) has been critical for running large-scale modeling and simulation using numerical models. The big data analytics domain (BDA) has been rapidly developed over the last years to process torrents of data now being generated in various domains. But, in general, the data analytics software was not developed inside the scientific computing community, and new approches were adopted by BDA specialists. Data-intensive applications are needed in varied field ranges from advanced research— as genomics, proteomics, epidemiology and systems biology—to commercial initiatives to develop new drugs and medical treatments, agricultural pesticides and other bio-products. Big data processing is still needed in the more HPC traditional domains as physics, climate, and astronomy, but even there adopting data-driven paradigms could bring important advantages. On the other side BDA needs the infrastructure and the fundamentals of HPC in order to face with the needed computational challenges. There are important differences in the approaches of these two domains: those that are working in BDA focus on the 4Vs of big data which are: volume, velocity, variety, and veracity, while HPC scientists tend to focus on performance, scaling, and the power efficiency of a computation. As we are heading towards extreme-scale HPC coupled with data intensive analytics, the integration of BDA and HPC is a necessity and a current hot topic of research.

Authors and Affiliations

Virginia NICULESCU

Keywords

Related Articles

Satistical Graphical User Interface Plug-In for Survival Analysis in R Statistical and Graphics Language and Environment.

[i][b]Introduction[/b][/i]: R is a statistical and graphics language and environment. Although it is extensively used in command line, graphical user interfaces exist to ease the accommodation with it for new users. Rcmd...

Sources of information on medicines: a comparation between Romania and other European countries

Doctors are required to discriminate between different types of online information sources. The official source of information on medicines in Romania is the Nomenclatorul medicamentelor published online by the National...

National training system for simulation in anesthesia and intensive therapy and other specialties – SimLab

Introduction: The SimLab project addresses quality performance issues in emergency care,through a systemic approach to the lifelong learning program. Aim: The main educationalobjective is to train the residents of anesth...

Tooth Enamel, the Result of the Relationship between Matrix Proteins and Hydroxyapatite Crystals

Enamel, a structure of epithelial origin, represents a protective tooth cover. The cells responsible for the formation of enamel, ameloblasts, are lost at the time of tooth eruption, so that enamel becomes an acellular s...

Patient data security in the era of medical connected devices

Internet of Things (IoT) is a domain that includes embedded devices connected to a network and used in multiple applications such as: transport, telecommunications, medicine, industrial field and many others. Medical Con...

Download PDF file
  • EP ID EP655035
  • DOI -
  • Views 77
  • Downloads 0

How To Cite

Virginia NICULESCU (2019). High performance computing in big data analytics. Applied Medical Informatics, 41(0), 2-2. https://europub.co.uk/articles/-A-655035