A Novel Big Data Storage Model for Protein-Protein Interaction and Gene-Protein Associations

Abstract

NGS (Next Generation Sequencing) technology has resulted in huge amount of proteomics data that exists in the form of interactions (protein-protein, gene-protein, and gene-disease). ETL (Extraction, Transformation, and Loading) techniques are very useful for Databases. Existing Rational Databases are not unified and having SQL (Structured Query Language). Proteomics data requires improvement for Integration of different Data sources. With the usage of NoSQL (not only SQL), improve the efficiency and performance. For this, a novel based unified model has been designed for protein interactions data (P-P, G-G, and G-D) by using Apache HBase to evaluate given the model, different case studies have been used.

Authors and Affiliations

M. Atif Sarwar, Hira Yaseen, Javed Ferzund, Hina Farooq, Azka Mahmood

Keywords

Related Articles

 Hybrid Denoising Method for Removal of Mixed Noise in Medical Images

 Nowadays, Digital image acquisition and processing techniques plays a very important role in current day medical diagnosis. During the acquisition process, there could be distortions in the images, which will negat...

Audio Augmentation for Traffic Signs: A Case Study of Pakistani Traffic Signs

Augmented Reality (AR) extend the appearance of real-world by adding digital information to the scene using computer graphics and image processing techniques. Various approaches have been used to detect, identify and tra...

Evaluation of OLSR Protocol Implementations using Analytical Hierarchical Process (AHP)

Adhoc networks are part of IEEE 802.11 Wireless LAN Standard also called Independent Basic Service Set (IBSS) and work as Peer to Peer network by default. These work without the requirement of an Infrastructure (such as...

Lung-Deep: A Computerized Tool for Detection of Lung Nodule Patterns using Deep Learning Algorithms Detection of Lung Nodules Patterns

The detection of lung-related disease for radiologists is a tedious and time-consuming task. For this reason, automatic computer-aided diagnosis (CADs) systems were developed by using digital CT scan images of lungs. The...

Implementation of Efficient Speech Recognition System on Mobile Device for Hindi and English Language

Speech recognition or speech to text conversion has rapidly gained a lot of interest by large organizations in order to ease the process of human to machine communication. Optimization of the speech recognition process i...

Download PDF file
  • EP ID EP258801
  • DOI 10.14569/IJACSA.2017.080543
  • Views 102
  • Downloads 0

How To Cite

M. Atif Sarwar, Hira Yaseen, Javed Ferzund, Hina Farooq, Azka Mahmood (2017). A Novel Big Data Storage Model for Protein-Protein Interaction and Gene-Protein Associations. International Journal of Advanced Computer Science & Applications, 8(5), 346-357. https://europub.co.uk/articles/-A-258801